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METHODS OF DIAGNOSIS OF LUNG CANCER, COMPOSITIONS AND METHODS 
OF SCREENING FOR MODULATORS OF LUNG CANCER 

5 

CROSS-REFERENCES TO RELATED APPLICATIONS 

This application is related to USSN 60/284,770, filed April 18, 2001; USSN 
60/290,492, filed May 10, 2001; USSN 60/334,370, filed November 29, 2001; USSN 
60/339,245, filed November 9, 2001; USSN 60/350,666, filed November 13, 2001; and 
1 0 USSN 60/xxx,xxx, filed April 12, 2002 (Docket OMNI-002P); each of which is incorporated 
herein by reference in its entirety. 

FIELD OF THE INVENTION 
The invention relates to the identification of nucleic acid and protein expression 
15 profiles and nucleic acids, products, and antibodies thereto that are involved in lung cancer; 
and to the use of such expression profiles and compositions in diagnosis and therapy of lung 
cancer. The invention further relates to methods for identifying and using agents and/or 
targets that inhibit lung cancer or related conditions. 

20 BACKGROUND OF THE INVENTION 

Lung cancer is the second most commonly occurring cancer in the United States and 
is the leading cause of cancer-related death. It is estimated that there are over 160,000 new 
cases of lung cancer in the United States every year. Of those who are diagnosed with lung 
cancer, 86 percent will die within five years. Lung cancer is the most common visceral 

25 cancer in men and accounts for nearly one third of all cancer deaths in both men and women. 
In fact, lung cancer accounts for 7% of all deaths, due to any cause, in both men and women. 

Smoking is the primary cause of lung cancer, with more than 80% of lung cancers 
resulting from smoking. About 400 to 500 separate gaseous substances are present in the 
smoke of a non-filter cigarette. The most noteworthy substances include nitrogen oxides, 

30 hydrogen cyanide, formaldehyde, benzene, and toluene. The particles present in cigarette 
smoke contain at least 3,500 individual compounds such as nicotine, tobacco alkaloids 
(nornicotine, anatabine, anabasine), polycyclic aromatic hydrocarbons (e.g., benzo(a)pyrene, 
B(a)P), naphthalenes, aromatic amines, phenols, and tobacco-specific nitrosamines. 
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Tobacco-specific nitrosamines are fonned during tobacco curing and processing, and 

are suspected of causing lung cancer in humans. In rodent studies, regardless of the where or 

how it is applied, the tobacco-specific nitrosamine known as NNK produces lung adenomas 

and lung adenocarcinomas. The tobacco-specific nitrosamine known as NNAL also produces 

5 lung adenocarcinomas in rodents. 

Many of the chemicals found in cigarette smoke also affect the nonsmoker inhaling 

"secondhand" or sidestream smoke. Indeed, the smoke inhaled by non-smokers has a 

chemical composition similar to the smoke inhaled by smokers, but, importantly, the 

concentrations of the carcinogenic tobacco-specific nitrosamines are present in higher 

10 concentrations in second hand smoke. For this and other reasons, "passive smoking" is an 

important cause of lung cancer, causing as many as 3,000 lung cancer deaths in nonsmokers 

each year. 

In addition to smoking, other factors thought to be causes of lung cancer include on- 
the-job exposure to carcinogens such as asbestos and uranium, exposure to chemical hazards 

15 such as radon, polycyclic aromatic hydrocarbons, chromium, nickel, and inorganic arsenic, 
genetic factors, and diet. 

Histological classification of various lung cancers define the types of cancer that 
begin in the lung. See, e.g., Travis, et al. (1999) Histological Typing of L ung and Pleural 
Tumours (International Histological Classification of Tumours, No 1. Four major cell types 

20 make up more than 88% of all primary lung neoplasms. These are: squamous or epidermoid 
carcinoma, small cell (also called oat cell) carcinoma, adenocarcinoma, and large cell (also 
called large cell anaplastic) carcinoma. The remainder include undifferentiated carcinomas, 
carcinoids, bronchial gland tumors, and other rarer types. The various cell types have 
different natural histories and responses to therapy, and, thus, a correct histologic diagnosis is 

25 the first step of effective treatment. 

Small cell lung cancer (SCLC) accounts for 18-25% of all lung cancers, and occurs 
less frequently than non-small cell lung cancers, and generally spread to distant organs more 
rapidly than non-small cell lung cancer. In general, at the time of presentation small cell lung 
cancers have already spread beyond the beyond the bounds where surgery and curative intent 

30 can be undertaken. Hoever, if identified early enough, these cancers are often responsive to 
chemotherapy and thoracic radiation treatment. 

Non-small cell lung cancers (NSCLC) are the more frequently occurring form of lung 
cancer. They comprise squamous cell carcinoma, adenocarcinoma, and large cell carcinoma 
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and account for more than 75% of all lung cancers. Non-small cell tumors that are localized 
at the time of presentation can sometimes be cured with surgery and/or radiotherapy, but 
usually are not identified until significant metastasis has occurred, which are typically not 
very responsive to surgical, chemotherapy, or radiation treatment . 
5 The screening of asymptomatic persons at high risk for lung cancer has often proven 

ineffective. In general, only 5 to 15 percent of lung cancer patients have their disease 
detected while they are asymptomatic. Of course, early detection and treatment are critical 
factors in the fight against lung cancer. The average survival rate is 49% for those whose 
cancer is detected early, before the cancer has spread from the lung. Lung cancer often 

1 0 spreads outside of the lung, and it may have spread to the bones or brain by the time it is 
diagnosed. While the prognosis may be better for lung cancers that are detected early, 
because of the lack ofv effective curative treatments, early detection does not necessarily alter 
the total death rate from lung cancer. 

Thus, methods for diagnosis and prognosis of lung cancer and effective treatment of 

1 5 lung cancer would be desirable. Accordingly, provided herein are methods that can be used 
in diagnosis and prognosis of lung cancer. Further provided are methods that can be used to 
screen candidate therapeutic agents for the ability to modulate, e.g., treat, lung cancer. 
Additionally, provided herein are molecular targets and compositions for therapeutic 
intervention in lung disease and other metastatic cancers. 

20 

SUMMARY OF THE INVENTION 
The present invention provides nucleotide sequences of genes that are up- and down- 
regulated in lung cancer cells. Such genes are useful for diagnostic purposes, and also as 
targets for screening for therapeutic compounds that modulate lung cancer, such as 

25 antibodies. The methods of detecting nucleic acids of the invention or their encoded proteins 
can be used for a number of purposes. Examples include early detection of lung cancers, 
monitoring and early detection of relapse following treatment of lung cancers, monitoring 
response to therapy of lung cancers, determining prognosis of lung cancers, directing therapy 
of lung cancers, selecting patients for postoperative chemotherapy or radiation therapy, 

30 selecting therapy, determining tumor prognosis, treatment, or response to treatment, and early 
detection of precancerous lesions of the lung. Examples of benign or precancerous lesions 
include: atelectasis, emphysema, brochitis, chronic obstructive pulmonary disease, fibrosis, 
hypersensitivity pneumonitis (HP), interstitial pulmonary fibrosis (IPF), asthma, and 
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bronchiectasis. Other aspects of the invention will become apparent to the skilled artisan by 

the following description of the invention. . 

In one aspect, the present invention provides a method of detecting a lung cancer- 
associated transcript in a cell from a patient, the method comprising contacting a biological 
5 sample from the patient with a polynucleotide that selectively hybridizes to a sequence at 
least 80% identical to a sequence as shown in Tables 1 A-16. Alternatively, the sample may 
be contacted with a specific binding reagent, e.g., antibody. 

In one embodiment, the polynucleotide selectively hybridizes to a sequence at least 
95% identical to a sequence as shown in Tables 1 A-l 6. In another embodiment, the 
10 polynucleotide comprises a sequence as shown in Tables 1 A-16. 

In one embodiment, the biological sample is a tissue sample, or a body fluid. In 
another embodiment, the biological sample comprises isolated nucleic acids, e.g., mRNA. 

In one embodiment, the polynucleotide is labeled, e.g., with a fluorescent label. In 
one embodiment, the polynucleotide is immobilized on a solid surface. In one embodiment, 
15 the patient is undergoing a therapeutic regimen to treat lung cancer. In another embodiment, 
the patient is suspected of having lung cancer. In one embodiment, the patient is a primate, 
e.g., a human. 

In one embodiment, the method further comprises the step of amplifying nucleic acids 
before the step of contacting the biological sample with the polynucleotide. 

20 In another aspect, the present invention provides a method of monitoring the efficacy 

of a therapeutic treatment of lung cancer, the method comprising the steps of: (i) providing a 
biological sample from a patient undergoing the therapeutic treatment; and (ii) determining 
the level of a lung cancer-associated transcript in the biological sample by contacting the 
biological sample with a polynucleotide that selectively hybridizes to a sequence at least 80% 

25 identical to a sequence as shown in Tables 1 A-16, thereby monitoring the efficacy of the 
therapy. Or the sample may be evaluated for protein, e.g., contacting the sample with an 
antibody. 

In one embodiment, the method further comprises the step of: (iii) comparing the 
level of the lung cancer-associated transcript to a level of the lung cancer-associated 
30 transcript in a biological sample from the patient prior to, or earlier in, the therapeutic 
treatment. Or the sample may be evalated for comparison of protein. 

In another aspect, the present invention provides a method of monitoring the efficacy 
of a therapeutic treatment of lung cancer, the method comprising the steps of: (i) providing a 
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biological sample from a patient undergoing the therapeutic treatment; and (ii) determining 

the level of a lung cancer-associated antibody in the biological sample by contacting the 

biological sample with a polypeptide encoded by a polynucleotide that selectively hybridizes 

to a sequence at least 80% identical to a sequence as shown in Tables 1 A-16, wherein the 

5 polypeptide specifically binds to the lung cancer-associated antibody, thereby monitoring the 

efficacy of the therapy. 

In one embodiment, the method further comprises the step of: (hi) comparing the 

level of the lung cancer-associated antibody to a level of the lung cancer-associated antibody 

in a biological sample from the patient prior to, or earlier in, the therapeutic treatment. 

10 In another aspect, the present invention provides a method of monitoring the efficacy 

of a therapeutic treatment of lung cancer, the method comprising the steps of: (i) providing a 
biological sample from a patient undergoing the therapeutic treatment; and (ii) determining 
the level of a lung cancer-associated polypeptide in the biological sample by contacting the 
biological sample with an antibody, wherein the antibody specifically binds to a polypeptide 

15 encoded by a polynucleotide that selectively hybridizes to a sequence at least 80% identical 
to a sequence as shown in Tables 1 A- 16, thereby monitoring the efficacy of the therapy. 

In one embodiment, the method further comprises the step of: (iii) comparing the 
level of the lung cancer-associated polypeptide to a level of the lung cancer-associated 
polypeptide in a biological sample from the patient prior to, or earlier in, the therapeutic 

20 treatment. In one aspect, the present invention provides an isolated nucleic acid molecule 
consisting of a polynucleotide sequence as shown in Tables 1A-16. In one embodiment, an 
expression vector or cell comprises the isolated nucleic acid. In one aspect, the present 
invention provides an isolated polypeptide which is encoded by a nucleic acid molecule 
having polynucleotide sequence as shown in Tables 1 A-16. 

25 In another aspect, the present invention provides an antibody that specifically binds to 

an isolated polypeptide which is encoded by a nucleic acid molecule having polynucleotide 
sequence as shown in Tables 1 A-16. In one embodiment, the antibody is conjugated to an 
effector component, e.g., a fluorescent label, a radioisotope or a cytotoxic chemical. In one 
embodiment, the antibody is an antibody fragment. In another embodiment, the antibody is 

30 humanized. 

In one aspect, the present invention provides a method of detecting lung cancer in a a 
patient, the method comprising contacting a biological sample from the patient with an 
antibody or protein as described herein. 
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In another aspect, the present invention provides a method of detecting antibodies 

specific to a lung cancer gene in a patient, the method comprising contacting a biological . 

sample from the patient with a polypeptide encoded by a nucleic acid comprises a sequence 

from Tables 1 A-16. 

5 In another aspect, the present invention provides a method for identifying a compound 

that modulates a lung cancer-associated polypeptide, the method comprising the steps of: (i) 
contacting the compound with a lung cancer-associated polypeptide, the polypeptide encoded 
by a polynucleotide that selectively hybridizes to a sequence at least 80% identical to a 
sequence as shown in Tables 1 A-16; and (ii) determining the functional effect of the 

1 0 compound upon the polypeptide. 

In one embodiment, the functional effect is a physical effect, an enzymatic effect, or a 
chemical effect. In one embodiment, the polypeptide is expressed in a eukaryotic host cell or 
cell membrane. In another embodiment, the polypeptide is recombinant. In one 
embodiment, the functional effect is determined by measuring ligand binding to the 

15 polypeptide. 

In another aspect, the present invention provides a method of inhibiting proliferation 
or another critical process of a lung cancer-associated cell to treat lung cancer in a patient, the 
method comprising the step of administering to the subject a therapeutically effective amount 
of a compound identified as described herein. In one embodiment, the compound is an 
20 antibody. 

In another aspect, the present invention provides a drug screening assay comprising 
the steps of: (i) administering a test compound to a mammal having lung cancer or a cell 
isolated therefrom; (ii) comparing the level of gene expression of a polynucleotide that 
selectively hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 
25 1 A-16 in a treated cell or mammal with the level of gene expression of the polynucleotide in 
a control cell or mammal, wherein a test compound that modulates the level of expression of 
the polynucleotide is a candidate for the treatment of lung cancer. 

In one embodiment, the control is a mammal with lung cancer or a cell therefrom that 
has not been treated with the test compound. In another embodiment, the control is a normal 
30 cell or mammal, or a non-malignant lung disease. 

In another aspect, the present invention provides a method for treating a mammal 
having lung cancer comprising administering a compound identified by the assay described 
herein. 
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In another aspect, the present invention provides a pharmaceutical composition for 

treating a mammal having lung cancer, the composition comprising a compound identified by 

the assay described herein and a physiologically acceptable excipient 

5 DETAILED DESCRIPTION OF THE INVENTION 

In accordance with the objects outlined above, the present invention provides novel 
methods for diagnosis and treatment of lung disease or cancer, as well as methods for 
screening for compositions which modulate lung cancer. 'Treatment, monitoring, detection 
or modulation of lung disease or cancer" includes treatment, monitoring, detection, or 

10 modulation of lung disease in those patients who have lung disease (whether malignant or 
non-malignant, e.g., emphysema, bronchitis, or fibrosis) as well as patients with lung cancers 
in which gene expression from a gene in Tables 1A-16 is increased or decreased, indicating 
that the subject is more likely to have disease. In particular,while these targets are identified 
primarily from lung cancer samples, these same targets are likely to be similarly found in 

1 5 analyses of other medical conditions. These other conditions may result from similar 
pathological processes which affect similar tissues, e.g., lung cancer, small cell lung 
carcinoma (oat cell carcinoma), non-small cell carcinomas (e.g., squamous cell carcinoma, 
adenocarcinoma, large cell lung carcinoma, carcinoid, granulomatous), fibrosis (idiopathic 
pulmonary fibrosis (IPF), hypersensitivity pneumonitis (HP), interstitial pneumonitis, 

20 nonspecific idiopathic pneumonitis (NSIP)), chronic obstructive pulmonary disease (COPD, 
e.g., emphysema, chronic bronchitis), asthma, bronchiectasis, and esophageal cancer. See, 
e.g., the NCI webpage and USSN 60/347,349 and USSN 60/xxx,xxx (docket LFBR-001-1P, 
filed March 29, 2002), each of which is incorporated herein by reference. The treatment may 
be of lung cancer or related condition itself, or treatment of metastasis. 

25 In particular, identification of markers selectively expressed on these cancers allows 

for use of that expression in diagnostic, prognostic, or therapeutic methods. As such, the 
invention defines various compositions, e.g., nucleic acids, polypeptides, antibodies, and 
small molecule agonists/antagonists, which will be useful to selectively identify those 
markers. For example, therapeutic methods may take the form of protein therapeutics which 

30 use the marker expression for selective localization or modulation of function (for those 
markers which have a causative disease effect), for vaccines, identification of binding 
partners, or antagonism, e.g., using antisense or RNAi. The markers may be useful for 
molecular characterization of subsets of lung diseases, which subsets may actually require 
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very different treatments. Moreover, the markers may also be important in related diseases to 
the specific cancers, e.g., which affect similar tissubs in non-malignant diseases, or have 
similar mechanisms of induction/maintenance. Metastatic processes or characteristics may 
also be targeted. Diagnostic and prognostic uses are made available, e.g., to subset related 
5 but distinct diseases, or to determine treatment strategy. The detection methods may be based 
upon nucleic acid, e.g., PCR or hybridization techniques, or protein, e.g., ELISA, imaging, 
IHC, etc. The diagnosis may be qualitative or quantitative, and may detect increases or 
decreases in expression levels. 

Tables 1 A- 1 6 provide unigene cluster identification numbers for the nucleotide 

10 sequence of genes that exhibit increased or decreased expression in lung cancer samples. The 
tables also provide an exemplar accession number that provides a nucleotide sequence that is 
part of the unigene cluster. In Table 1 A, genes marked as "target 1" or "target 2" are 
particularly useful as therapeutic targets. Genes marked as "target 3" are particularly useful 
as diagnostic markers. Genes marked as "chron" are upregulated in chronically diseased lung 

15 (e.g., emphysema, bronchitis, fibrosis) relative to lung tumors and normal tissue. In certain 
analyses, the ratio for the "chron" category was determined using the 70th percentile of 
chronically diseases lung samples divided by the 90th percentile of normal lung samples. 
The ratio for the targets was determined using the 70th percentile of lung tumor samples 
divided by the 90th percentile of normal lung samples. 

20 

Definitions 

The term "lung cancer protein" or "lung cancer polynucleotide" or "lung cancer- 
associated transcript" refers to nucleic acid and polypeptide polymorphic variants, alleles, 
mutants, and interspecies homologs that: (1) have a nucleotide sequence that has greater than 

25 about 60% nucleotide sequence identity, 65%, 70%, 75%, 80%, 85%, 90%, preferably 91%, 
92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% or greater nucleotide sequence identity, 
preferably over a region of over a region of at least about 25, 50, 100, 200, 500, 1000, or 
more nucleotides, to a nucleotide sequence of or associated with a unigene cluster of Tables 
1 A-16; (2) bind to antibodies, e.g., polyclonal antibodies, raised against an immunogen 

30 comprising an amino acid sequence encoded by a nucleotide sequence of or associated with a 
unigene cluster of Tables 1A-16, and conservatively modified variants thereof; (3) 
specifically hybridize under stringent hybridization conditions to a nucleic acid sequence, or 
the complement thereof of Tables 1A-16 and conservatively modified variants thereof; or (4) 

8 
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have an amino acid sequence that has greater than about 60% amino acid sequence identity, 

65%, 70%, 75%, 80%, 85%, 90%, preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 

or 99% or greater amino sequence identity, preferably over a region of over a region of at 

least about 25, 50, 100, 200, 500, 1000, or more amino acid, to an amino acid sequence 

5 encoded by a nucleotide sequence of or associated with a unigene cluster of Tables 1 A-16. A 

polynucleotide or polypeptide sequence is typically from a mammal including, but not 

limited to, primate, e.g., human; rodent, e.g., rat, mouse, hamster; cow, pig, horse, sheep, or 

other mammal. A "lung cancer polypeptide" and a u lung cancer polynucleotide/' include 

both naturally occurring or recombinant forms. 

10 A "full length" lung cancer protein or nucleic acid refers to a lung cancer polypeptide 

or polynucleotide sequence, or a variant thereof, that contains the elements normally 
contained in one or more naturally occurring, wild type lung cancer polynucleotide or 
polypeptide sequences. The "full length" may be prior to, or after, various stages of post- 
translational processing or splicing, including alternative splicing. 

15 "Biological sample" as used herein is a sample of biological tissue or fluid that 

contains nucleic acids or polypeptides, e.g., of a lung cancer protein, polynucleotide, or 
transcript. Such samples include, but are not limited to, tissue isolated from primates, e.g., 
humans, or rodents, e.g., mice, and rats. Biological samples may also include sections of 
tissues such as biopsy and autopsy samples, frozen sections taken for histologic purposes, 

20 archival materials, blood, plasma, serum, sputum, stool, tears, mucus, hair, skin, etc. 
Biological samples also include explants and primary and/or transformed cell cultures 
derived from patient tissues. A biological sample is typically obtained from a eukaryotic 
organism, most preferably a mammal such as a primate, e.g., chimpanzee or human; cow; 
dog; cat; a rodent, e.g., guinea pig, rat, mouse; rabbit; or other mammal; or a bird; reptile; 

25 fish. Livestock and domestic animals are of interest. 

"Providing a biological sample" means to obtain a biological sample for use in 
methods described in this invention. Most often, this will be done by removing a sample of 
cells from an animal, but can also be accomplished by using previously isolated cells (e.g., 
isolated by another person, at another time, and/or for another purpose), or by performing the 

30 methods of the invention in vivo. Archival tissues or materials, having treatment or outcome 
history, will be particularly useful. 

The terms "identical" or percent "identity," in the context of two or more nucleic 
acids or polypeptide sequences, refer to two or more sequences or subsequences that are the 
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same or have a specified percentage of amino acid residues or nucleotides that are the same 

(e.g., about 60% identity, preferably 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 

95%, 96%, 97%, 98%, 99%, or higher identity over a specified region, when compared and 

aligned for maximum correspondence over a comparison window or designated region) as 

5 measured using, e.g., a BLAST or BLAST 2.0 sequence comparison algorithms with default 

parameters described below, or by manual alignment and visual inspection (see, e.g., NCBI 

web site http://www.ncbi.nlm.nih.gov/BLAST/ or the like). Such sequences are then said to 

be "substantially identical." This definition also refers to, or may be applied to, the 

complement of a test sequence. The definition also includes sequences that have deletions 

10 and/or insertions, substitutions, and naturally occurring, e.g., polymorphic or allelic variants, 

and man-made variants. As described below, the preferred algorithms can account for gaps 

and the like. Preferably, identity exists over a region that is at least about 25 amino acids or 

nucleotides in length, or more preferably over a region that is 50-100 amino acids or 

nucleotides in length. 

15 For sequence comparison, typically one sequence acts as a reference sequence, to 

which test sequences are compared. When using a sequence comparison algorithm, test and 
reference sequences are entered into a computer, subsequence coordinates are designated, if 
necessary, and sequence algorithm program parameters are designated. Preferably, default 
program parameters can be used, or alternative parameters can be designated. The sequence 

20 comparison algorithm then calculates the percent sequence identities for the test sequences 
relative to the reference sequence, based on the program parameters. 

A "comparison window", as used herein, includes reference to a segment of 
contiguous positions selected from the group consisting typically of from 20 to 600, usually 
about 50 to about 200, more usually about 100 to about 150 in which a sequence may be 

25 compared to a reference sequence of the same number of contiguous positions after the two 
sequences are optimally aligned. Methods of alignment of sequences for comparison are 
well-known in the art. Optimal alignment of sequences for comparison can be conducted, 
e.g., by the local homology algorithm of Smith and Waterman (1981) Adv. Apnl. Math. 
2:482, by the homology alignment algorithm of Needleman and Wunsch (1970) J. Mol. Biol. 

30 48:443, by the search for similarity method of Pearson and Lipman (1 988) Proc. Nat'l. Acad. 
Sci. USA 85:2444, by computerized implementations of these algorithms (GAP, BESTFIT, 
FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer 
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Group, 575 Science Dr., Madison, WI), or by manual alignment and visual inspection (see, 

e.g., Ausubel, et al. (eds. 1995 and supplementsV Curient Protocols in Mo lecular Biology. 

Preferred examples of algorithms that are suitable for determining percent sequence 

identity and sequence similarity include the BLAST and BLAST 2.0 algorithms, which are 

5 described in Altschul, et al (1977) Nuc. Acids Res. 25:3389-3402 and Altschul, et al. (1990) 

J. Mol. BioL 215:403-410. BLAST and BLAST 2.0 are used, with the parameters described 

herein, to determine percent sequence identity for the nucleic acids and proteins of the 

invention. Software for performing BLAST analyses is publicly available through the 

National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). This 

1 0 algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short 
words of length W in the query sequence, which either match or satisfy some positive-valued 
threshold score T when aligned with a word of the same length in a database sequence. T is 
referred to as the neighborhood word score threshold (Altschul, et al, supra). These initial 
neighborhood word hits act as seeds for initiating searches to find longer HSPs containing 

1 5 them. The word hits are extended in both directions along each sequence for as far as the 
cumulative alignment score can be increased. Cumulative scores are calculated using, e.g., 
for nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always > 0) and N (penalty score for mismatching residues; always < 0). For amino acid 
sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word 

20 hits in each direction are halted when: the cumulative alignment score falls off by the 

quantity X from its maximum achieved value; the cumulative score goes to zero or below, 
due to the accumulation of one or more negative-scoring residue alignments; or the end of 
either sequence is reached. The BLAST algorithm parameters W, T, and X determine the 
sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) 

25 uses as defaults a wordlength (W) of 1 1 , an expectation (E) of 1 0, M=5, N=-4 and a 
comparison of both strands. For amino acid sequences, the BLASTP program uses as 
defaults a wordlength of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix 
(see Henikoff and Henikoff (1989) Proc. Natl Acad. Sci. USA 89:10915) alignments (B) of 
50, expectation (E) of 10, M=5* N=-4, and a comparison of both strands. 

30 The BLAST algorithm also performs a statistical analysis of the similarity between- 

two sequences (see, e.g., Karlin and Altschul H 993^ Proc. Naf L Acad. Sci. USA 90:5873- 
5787). One measure of similarity provided by the BLAST algorithm is the smallest sum 
probability (P(N)), which provides an indication of the probability by which a match between 



11 



WO 02/086443 PCT/US02/12476 . . 

two nucleotide or amino acid sequences would occur by chance. For example, a nucleic acid 
is considered similar to a reference sequence if the smallest sum probability in a comparison 
of the test nucleic acid to the reference nucleic acid is less than about 0.2, more preferably 
less than about 0.01, and mostpreferably less than about 0.001. Log values may be negative 
5 large numbers, e.g., 5, 10,20,30, 40,40,70,90, 110, 150, 170, etc. 

An indication that two nucleic acid sequences are substantially identical is that the 
polypeptide encoded by the first nucleic acid is immunologically cross reactive with the 
antibodies raised against the polypeptide encoded by the second nucleic acid. Thus, a 
polypeptide is typically substantially identical to a second polypeptide, e.g., where the two 
10 peptides differ only by conservative substitutions. Another indication that two nucleic acid 
sequences are substantially identical is that the two molecules or their complements hybridize 
to each other under stringent conditions. Yet another indication that two nucleic acid 
sequences are substantially identical is that the same primers can be used to amplify the 
sequences. 

15 A "host cell" is a naturally occurring cell or a transformed cell that contains an 

expression vector and supports the replication or expression of the expression vector. Host 
cells may be cultured cells, explants, cells in vivo, and the like. Host cells may be 
prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, or 
mammalian cells such as CHO, HeLa, and the like (see, e.g., the American Type Culture 

20 Collection catalog or web site, www.atcc.org). 

The terms "isolated," '"purified," or "biologically pure" refer to material that is 
substantially or essentially free from components that normally accompany it as found in its 
native state. Purity and homogeneity are typically determined using analytical chemistry 
techniques such as polyacrylamide gel electrophoresis or high performance liquid 

25 chromatography. A protein or nucleic acid that is the predominant species present in a 

preparation is substantially purified. In particular, an isolated nucleic acid is separated from 
some open reading frames that naturally flank the gene and encode proteins other than protein 
encoded by the gene. The term "purified" in some embodiments denotes that a nucleic acid 
or protein gives rise to essentially one band in an electrophoretic gel. Preferably, it means 

30 that the nucleic acid or protein is at least about 85% pure, more preferably at least 95% pure, 
and most preferably at least 99% pure. "Purify" or "purification" in other embodiments 
means removing at least one contaminant or component from the composition to b£ purified. 
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In this sense, purification does not require that the purified compound be homogeneous, e.g., 
100% pure. 

The terms "polypeptide," "peptide" and "protein" are used interchangeably herein to 
refer to a polymer of amino acid residues. The terms apply to amino acid polymers in which 
5 one or more amino acid residue is an artificial chemical mimetic of a corresponding naturally 
occurring amino acid, as well as to naturally occurring amino acid polymers, those containing 
modified residues, and non-naturally occurring amino acid polymer. 

The tenn "amino acid" refers to naturally occurring and synthetic amino acids, as well 
as amino acid analogs and amino acid mimetics that function similarly to the naturally 

10 occurring amino acids. Naturally occurring amino acids are those encoded by the genetic 
code, as well as those amino acids that are later modified, e.g., hydroxyproline, y- 
carboxyglutamate, and O-phosphoserine. Amino acid analogs refer to compounds that have 
the same basic chemical structure as a naturally occurring amino acid, e.g., an a carbon that is 
bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, 

1 5 norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs may have 
modified R groups (e.g., norleucine) or modified peptide backbones, but retain some basic 
chemical structure as a naturally occurring amino acid. Amino acid mimetics refer to 
chemical compounds that have a structure that is different from the general chemical 
structure of an amino acid, but that function similarly to another amino acid. 

20 Amino acids may be referred to herein by either their commonly known three letter 

symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical 
Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly 
accepted single-letter codes. 

"Conservatively modified variants" applies to both amino acid and nucleic acid 

25 sequences. With respect to particular nucleic acid sequences, conservatively modified 
variants refers to those nucleic acids which encode identical or essentially identical amino 
acid sequences, or where the nucleic acid does not encode an amino acid sequence, to 
essentially identical or associated, e.g., naturally contiguous, sequences. Because of the 
degeneracy of the genetic code, a large number of functionally identical nucleic acids encode 

30 most proteins. For instance, the codons GCA, GCC, GCG, and GCU each encode the amino 
acid alanine. Thus, at each position where an alanine is specified by a codon, the codon can 
be altered to another of the corresponding codons described without altering the encoded 
polypeptide. Such nucleic acid variations are "silent variations," which are one species of 
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conservatively modified variations. Every nucleic acid sequence Herein which encodes a 

polypeptide also describes silent variations of the nucleic acid. In certain contexts each 

codon in a nucleic acid (except AUG, which is ordinarily the only codon for methionine, and 

TGG, which is ordinarily the only codon for tryptophan) can be modified to yield a 

5 functionally similar molecule. Accordingly, a silent variation of a nucleic acid which 

encodes a polypeptide is implicit in a described sequence with respect to the expression 

product, but not necessarily with respect to actual probe sequences. 

As to amino acid sequences, one of skill will recognize that individual substitutions, 

deletions or additions to a nucleic acid, peptide, polypeptide, or protein sequence which 

10 alters, adds or deletes a single amino acid or a small percentage of amino acids in the encoded 
sequence is a "conservatively modified variant" where the alteration results in the substitution 
of an amino acid with a chemically similar amino acid. Conservative substitution tables 
providing functionally similar amino acids are well known in the art. Such conservatively 
modified variants are in addition to and do not exclude polymorphic variants, interspecies 

15 homologs, and alleles of the invention. Typically conservative substitutions include for one 
another: 1) Alanine (A), Glycine (G); 2) Aspartic acid (D), Glutamic acid (E); 3) Asparagine 
(N), Glutamine (Q); 4) Arginine (R), Lysine (K); 5) Isoleucine (I), Leucine (L), Methionine 
(M), Valine (V); 6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W); 7) Serine (S), 
Threonine (T); and 8) Cysteine (C), Methionine (M) (see, e.g., Creighton, Proteins (1984)). 

20 Macromolecular structures such as polypeptide structures can be described in terms of 

various levels of organization. For a general discussion of this organization, see, e.g., 
Alberts, et al. (1994) Molecular Biology of the Cell (3 rd ed.) and Cantor and Schimmel (1980) 
Biophysical Chemistry Part I: The Conformation of Biological Macromolecules . "Primary 
structure" refers to the amino acid sequence of a particular peptide. "Secondary structure" 

25 refers to locally ordered, three dimensional structures within a polypeptide. These structures 
are commonly known as domains. Domains are portions of a polypeptide that often form a 
compact unit of the polypeptide and are typically 25 to approximately 500 amino acids long. 
Typical domains are made up of sections of lesser organization such as stretches of 0-sheet 
and oc-helices. 'Tertiary structure" refers to the complete three dimensional structure of a 

30 polypeptide monomer. "Quaternary structure" refers to the three dimensional structure 
formed, usually by the noncovalent association of independent tertiary units. Anisotropic 
terms are also known as energy terms. 



14 



WO 02/086443 PCT/US02/12476 
"Nucleic acid'* or "oligonucleotide" or -'polynucleotide" or grammatical equivalents 

used herein means at least two nucleotides covalently linked together. Oligonucleotides are 

typically from about 5, 6, 7, 8, 9, 10, 12, 15, 25, 30, 40, 50 or more nucleotides in length, up 

to about 100 nucleotides in length. Nucleic acids and polynucleotides are a polymers of any 

5 length, including longer lengths, e.g., 200, 300, 500, 1000, 2000, 3000, 5000, 7000, 10,000, 

etc. A nucleic acid of the present invention will generally contain phosphodiester bonds, 

although in some cases, nucleic acid analogs are included that may have at least one different 

linkage, e.g., phosphoramidate, phosphorothioate, phosphorodithioate, or O- 

methylphophoroamidite linkages (see Eckstein (1992) Oligonucleotides and Analogues: A 

10 Practical Approach Oxford University Press); and peptide nucleic acid backbones and 
linkages. Other analog nucleic acids include those with positive backbones; non-ionic 
backbones, and noh-ribose backbones, including those described in U.S. Patent Nos. 
5,235,033 and 5,034,506, and Chapters 6 and 7, in Sanghui and Cook, eds. Carbohydrate 
Modifications in Antisense Research. ASC Symposium Series 580. Nucleic acids containing 

15 one or more carbocyclic sugars are also included within one definition of nucleic acids. 

Modifications of the ribose-phosphate backbone may be done for a variety of reasons, e.g., to 
increase the stability and half-life of such molecules in physiological environments or as 
probes on a biochip. Mixtures of naturally occurring nucleic acids and analogs can be made; 
alternatively, mixtures of different nucleic acid analogs, and mixtures of naturally occurring 

20 nucleic acids and analogs may be made. 

Particularly preferred are peptide nucleic acids (PNA) which includes peptide nucleic 
acid analogs. These backbones are substantially non-ionic under neutral conditions, in 
contrast to the highly charged phosphodiester backbone of naturally occurring nucleic acids. 
This results in two advantages. First, the PNA backbone exhibits improved hybridization 

25 kinetics. PNAs have larger changes in the melting temperature (T m ) for mismatched versus 
perfectly matched basepairs. DNA and RNA typically exhibit a 2-4° C drop in T m for an 
internal mismatch. With the non-ionic PNA backbone, the drop is closer to 7-9° C. 
Similarly, due to their non-ionic nature, hybridization of the bases attached to these 
backbones is relatively insensitive to salt concentration. In addition, PNAs are not degraded 

30 by cellular enzymes, and thus can be more stable. 

The nucleic acids may be single stranded or double stranded, as specified, or contain 
portions of both double stranded or single stranded sequence. As will be appreciated by those 
in the art, the depiction of a single strand also defines the sequence of the complementary 
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strand; thus the sequences described herein also provide the complement of the sequence. 

The nucleic acid may be DNA, both genomic and cDNA, RNA, or a hybrid, where the 

nucleic acid may contain combinations of deoxyribo- and ribo-nucleotides, and combinatioiis 

of bases, including uracil, adenine, thymine, cytosine, guanine, inosine, xanthine 

5 hypoxanthine, isocytdsine, isoguanine, etc. "Transcript typically refers to a naturally 

occurring RNA, e.g., a pre-mRNA, hnRNA, or mRNA. As used herein, the term 

"nucleoside" includes nucleotides and nucleoside and nucleotide analogs, and modified 

nucleosides such as amino modified nucleosides. In addition, "nucleoside" includes non- 

naturally occurring analog structures. Thus, e.g., the individual units of a peptide nucleic 

1 0 acid, each containing a base, are referred to herein as a nucleoside. 

A "label" or a "detectable moiety" is a composition detectable by spectroscopic, 
photochemical, biochemical, immunochemical, physiological, chemical, or other physical 
means. For example, useful labels include 32 P, fluorescent dyes, electron-dense reagents, 
enzymes (e.g., as commonly used in an ELISA), biotin, digoxigenin, or haptens and proteins 

1 5 or other entities which can be made detectable, e.g., by incorporating a radiolabel into the 
peptide or used to detect antibodies specifically reactive with the peptide. The labels may be 
incorporated into the cancer nucleic acids, proteins, and antibodies. Many methods known in 
the art for conjugating the antibody to the label may be employed, including those methods 
described by Hunter, et al. (1962) Nature 144:945; David, et al. (1974) Biochemistry 

20 13:1014-1021; Pain, et al. (1981) J. Immunol. Meth. . 40:219-230; and Nygren (1982) I 
Histochem. and Cvtochem. 30:407-412. 

An "effector" or "effector moiety" or "effector component" is a molecule that is 
bound (or linked, or conjugated), either covalently, through a linker or a chemical bond, or 
noncovalently, through ionic, van der Waals, electrostatic, or hydrogen bonds, to an antibody. 

25 The "effector" can be a variety of molecules including, e.g., detection moieties including 

radioactive compounds, fluorescent compounds, an enzyme or substrate, tags such as epitope 
tags, a toxin; activatable moieties, a chemotherapeutic agent; a lipase; an antibiotic; or a 
radioisotope emitting "hard" e.g., beta radiation. 

A "labeled nucleic acid probe or oligonucleotide" is one that is bound, either 

30 covalently, through a linker or a chemical bond, or noncovalently, through ionic, van der 
Waals, electrostatic, or hydrogen bonds to a label such that the presence of the probe may be 
detected by detecting the presence of the label bound to the probe. Alternatively, method 



16 



WG.02/086443 . . . " PCT/US02/12476 

using high affinity interactions may achieve the same results where one of a pair of binding 

partners binds to the other, e.g., biotin, streptavidin. 

As used herein a "nucleic acid probe or oligonucleotide" is a nucleic acid capable of 

binding to a target nucleic acid of complementary sequence through one or more types of 

5 chemical bonds, usually through complementary base pairing, e.g., through hydrogen bond 

formation. As used herein, a probe may include natural (i.e., A, G, C, or T) or modified bases 

(7-deazaguanosine, inosine, etc.). In addition, the bases in a probe may be joined by a 

linkage other than a phosphodiester bond, preferably one that does not functionally interfere 

with hybridization. Thus, e.g., probes may be peptide nucleic acids in which the constituent 

1 0 bases are joined by peptide bonds rather than phosphodiester linkages. Probes may bind 

target sequences lacking complete complementarity with the probe sequence depending upon 
the stringency of the hybridization conditions. The probes are preferably directly labeled, 
e.g., with isotopes, chromophores, lumiphores, chromogens, or indirectly labeled, e.g., with 
biotin to which a streptavidin complex may later bind. By assaying for the presence or 

1 5 absence of the probe, one can detect the presence or absence of the select sequence or 

subsequence. Diagnosis or prognosis may be based at the genomic level, or at the level of 
RNA or protein expression. 

The term "recombinant* ' when used with reference, e.g., to a cell, or nucleic acid, 
protein, or vector, indicates that the cell, nucleic acid, protein or vector, has been modified by 

20 the introduction of a heterologous nucleic acid or protein or the alteration of a native nucleic 
acid or protein, or that the cell is derived from a cell so modified. Thus, e.g., recombinant 
cells express genes that are not found within the native (non-recombinant) form of the cell or 
express native genes that are otherwise abnormally expressed, under expressed or not 
expressed at all. By the term "recombinant nucleic acid" herein is meant nucleic acid, 

25 originally formed in vitro, in general, by the manipulation of nucleic acid, e.g., using 
polymerases and endonucleases, in a form not normally found in nature. In this manner, 
operably linkage of different sequences is achieved. Thus an isolated nucleic acid, in a linear 
form, or an expression vector formed in vitro by ligating DNA molecules that are not 
normally joined, are both considered recombinant for the purposes of this invention. It is 

30 understood that once a recombinant nucleic acid is made and reintroduced into a host cell or 
organism, it will replicate non-recombinantly, i.e., using the in vivo cellular machinery of the 
host cell rather than in vitro manipulations; however, such nucleic acids, once produced 
recombinantly, although subsequently replicated non-recombinantly, are still considered 
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recombinant for the purposes of the invention. Similarly, a "recombinant protein" is a protein 

made using recombinant techniques, i.e., through the expression of a recombinant nucleic 
acid as depicted above. 

The term "heterologous" when used with reference to portions of a nucleic acid 
5 indicates that the nucleic acid comprises two or more subsequences that are not normally 
found in the same relationship to each other in nature. For instance, the nucleic acid is 
typically recombinantly produced, having two or more sequences, e.g., from unrelated genes 
arranged to make a new functional nucleic acid, e.g., a promoter from one source and a 
coding region from another source. Similarly, a heterologous protein will often refer to two 
10 or more subsequences that are not found in the same relationship to each other in nature (e.g., 
a fusion protein). 

A "promoter" is typically an array of nucleic acid control sequences that direct 
transcription of a nucleic acid. As used herein, a promoter includes necessary nucleic acid 
sequences near the start site of transcription, such as, in the case of a polymerase II type 

1 5 promoter, a TATA element. A promoter also optionally includes distal enhancer or repressor 
elements, which can be located as much as several thousand base pairs from the start site of 
transcription. A "constitutive" promoter is a promoter that is active under most 
environmental and developmental conditions. An "inducible" promoter is a promoter that is 
active under environmental or developmental regulation. The term "operably linked" refers 

20 to a functional linkage between a nucleic acid expression control sequence (such as a 

promoter, or array of transcription factor binding sites) and a second nucleic acid sequence, 
e.g., wherein the expression control sequence directs transcription of the nucleic acid 
corresponding to the second sequence. 

An "expression vector" is a nucleic acid construct, generated recombinantly or 

25 synthetically, with a series of specified nucleic acid elements that permit transcription of a 
particular nucleic acid in a host cell. The expression vector can be part of a plasmid, virus, or 
nucleic acid fragment. Typically, the expression vector includes a nucleic acid to be 
transcribed in operable linkage to a promoter. 

The phrase "selectively (or specifically) hybridizes to" refers to the binding, 

30 duplexing, or hybridizing of a molecule selectively to a particular nucleotide sequence under 
stringent hybridization conditions when that sequence is present in a complex mixture (e.g., 
total cellular or library DNA or RNA). 
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The phrase "stringent hybridization conditions" refers to conditions under which a 

probe will hybridize to its target subsequence, typically in a complex mixture of nucleic 

acids, but to essentially no other sequences. Stringent conditions are sequence-dependent and 

will be different in different circumstances. Longer sequences hybridize specifically at 

5 higher temperatures. An extensive guide to the hybridization of nucleic acids is found in 

"Overview of principles of hybridization and the strategy of nucleic acid assays" inTijssen 

(1993) Techniques in Biochemistry and Molecular Biolo gv-Hvbridization with Nucleic 

Probes (vol. 24) Elsevier. Generally, stringent conditions are selected to be about 5-10° C 

lower than the thermal melting point (T m ) for the specific sequence at a defined ionic strength 

10 pH. The T^ is the temperature (under defined ionic strength, pH, and nucleic concentration) 
at which 50% of the probes complementary to the target hybridize to the target sequence at 
equilibrium (as the target sequences are present in excess, at T m , 50% of the probes are 
occupied at equilibrium). Stringent conditions will be those in which the salt concentration is 
less than about 1.0 M sodium ion, typically about 0.01 to 1.0 M sodium ion concentration (or 

1 5 other salts) at pH 7.0 to 8.3 and the temperature is at least about 30° C for short probes (e.g., 
10 to 50 nucleotides) and at least about 60° C for long probes (e.g., greater than 50 
nucleotides). Stringent conditions may also be achieved with the addition of destabilizing 
agents such as formamide. For selective or specific hybridization, a positive signal is 
typically at least two times background, preferably 10 times background hybridization. 

20 Exemplary stringent hybridization conditions are often: 50% formamide, 5x SSC, and 1% 
SDS, incubating at 42° C, or, 5x SSC, 1% SDS, incubating at 65° C, with wash in 0.2x SSC, 
and 0. 1% SDS at 65° C. For PCR, a temperature of about 36° C is typical for low stringency 
amplification, although annealing temperatures may vary between about 32° C and 48° C 
depending on primer length. For high stringency PCR amplification, a temperature of about 

25 62° C is typical, although high stringency annealing temperatures can range from about 50° C 
to about 65° C, depending on the primer length and specificity. Typical cycle conditions for 
both high and low stringency amplifications include a denaturation phase of 90° C - 95° C for 
0.5 - 2 min., an annealing phase lasting 0.5 - 2 min, and an extension phase of about 72° C 
for 1 - 2 min. Protocols and guidelines for low and high stringency amplification reactions 

30 are provided, e.g., in Innis, et al.(1990) PCR Protocols. A Guide to Methods and 
Ap plications . 

Nucleic acids that do not hybridize to each other under stringent conditions are still 
substantially identical if the polypeptides which they encode are substantially identical. This 
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occurs, e.g., when a copy of a nucleic acid is created using the maximum codon degeneracy 

permitted by the genetic code. In such cases, the nucleic acids typically hybridize under 
moderately stringent hybridization conditions. Exemplary "moderately stringent 
hybridization conditions" include a hybridization in a buffer of 40% formamide, 1 M NaCl, 
5 1% SDS at 37° C, and a wash in IX SSC at 45° C. A positive hybridization is at least twice 
background. Alternative hybridization and wash conditions can be utilized to provide 
conditions of similar stringency. Additional guidelines for determining hybridization 
parameters are provided in numerous reference, e.g., Ausubel, et al. (ed.) Current Protocols in 
Molecular Biology Lippincott. 

10 The phrase "functional effects" in the context of assays for testing compounds that 

modulate activity of a lung cancer protein includes the determination of a parameter that is 
indirectly or directly under the influence of the lung cancer protein or nucleic acid, e.g., a 
physiological, enzymatic, functional, physical, or chemical effect, such as the ability to 
decrease lung cancer. It includes ligand binding activity; cell viability, cell growth on soft 

15 agar; anchorage dependence; contact inhibition and density limitation of growth; cellular 
proHferation; cellular transformation; growth factor or serum dependence; tumor specific 
marker levels; invasiveness into Matrigel; tumor growth and metastasis in vivo; mRNA and 
protein expression in cells undergoing metastasis, and other characteristics of lung cancer 
cells. "Functional effects" include in vitro, in vivo, and ex vivo activities. 

20 By "determining the functional effect" is meant assaying for a compound that 

increases or decreases a parameter that is indirectly or directly under the influence of a lung 
cancer protein sequence, e.g., physiological, functional, enzymatic, physical, or chemical 
effects. Such functional effects can be measured by many means known to those skilled in 
the art, e.g., changes in spectroscopic characteristics (e.g., fluorescence, absorbance, 

25 refractive index), hydrodynamic (e.g., shape), chromatographic, or solubility properties for 
the protein, measuring inducible markers or transcriptional activation of the lung cancer 
protein; measuring binding activity or binding assays, e.g., binding to antibodies or other 
ligands, and measuring cellular proliferation. Determination of the functional effect of a 
compound on lung cancer can also be performed using lung cancer assays known to those of 

30 skill in the art such as an in vitro assays, e.g., cell growth on soft agar; anchorage 

dependence; contact inhibition and density limitation of growth; cellular prohferation; 
cellular transformation; growth factor or serum dependence; tumor specific marker levels; 
invasiveness into Matrigel; tumor growth and metastasis in vivo; mRNA and protein 
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expression in cells undergoing metastasis, and other characteristics of lung cancer cells. The 
functional effects can be evaluated by many means known to those skilled in the art, e.g., 
microscopy for quantitative or qualitative measures of alterations in morphological features, 
measurement of changes in RNA or protein levels for lung cancer-associated sequences, 
5 measurement of RNA stability, identification of downstream or reporter gene expression ' 
(CAT, luciferase, (3-gal, GFP, and the like), e.g., via chemUuminescence, fluorescence, 
colorimetric reactions, antibody binding, inducible markers, and ligand binding assays. 

'Inhibitors", "activators", and ''modulators" of lung cancer polynucleotide and 
polypeptide sequences are used to refer to activating, inhibitory, or modulating molecules or 

10 compounds identified using in vitro and in vivo assays of lung cancer polynucleotide and 

polypeptide sequences. Inhibitors are compounds that, e.g., bind to, partially or totally block 
activity, decrease, prevent, delay activation, inactivate, desensitize, or down regulate the 
activity or expression of lung cancer proteins, e.g., antagonists. Antisense or inhibitory 
nucleic acids may seem to inhibit expression and subsequent function of the protein. 

15 "Activators" are compounds that increase, open, activate, facilitate, enhance activation, 
sensitize, agonize, or up regulate lung cancer protein activity. Inhibitors, activators, or 
modulators also include genetically modified versions of lung cancer proteins, e.g., versions 
with altered activity, as well as naturally occurring and synthetic ligands, antagonists, 
agonists, antibodies, small chemical molecules and the like. Such assays for inhibitors and 

20 activators include, e.g., expressing the lung cancer protein in vitro, in cells, or cell 

membranes, applying putative modulator compounds, and then determining the functional 
effects on activity, as described above. Activators and inhibitors of lung cancer can also be 
identified by incubating lung cancer cells with the test compound and determining increases 
or decreases in the expression of 1 or more lung cancer proteins, e.g., 1, 2, 3, 4, 5, 10, 15, 20, 

25 25, 30, 40, 50 or more lung cancer proteins, such as lung cancer proteins encoded by the 
sequences set out in Tables 1A-16. 

Samples or assays comprising lung cancer proteins that are treated with a potential 
activator, inhibitor, or modulator are compared to control samples without the inhibitor, 
activator, or modulator to examine the extent of inhibition. Control samples (untreated with 

30 inhibitors) are assigned a relative protein activity value of 100%. Inhibition of a polypeptide 
is achieved when the activity value relative to the control is about 80%, preferably 50%, more 
preferably 25-0%. Activation of a lung cancer polypeptide is achieved when the activity 
value relative to the control (untreated with activators) is 1 10%, more preferably 150%, more 
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preferably 200-500% (i.e., two to five fold higher relative to the control), more preferably 

1000-3000% higher. 

The phrase "changes in cell growth" refers to any change in cell growth and 
proliferation characteristics in vitro ox in vivo, such as cell viability, formation of foci, 

5 anchorage independence, semi-solid or soft agar growth, changes in contact inhibition and 
density limitation of growth, loss of growth factor or serum requirements, changes in cell 
morphology, gaining or losing immortalization, gaining or losing tumor specific markers, 
ability to form or suppress tumors when injected into suitable animal hosts, and/or 
immortalization of the cell. See, e.g., Freshney (1994) Culture of Animal Cells a Manual of 

10 Basic Technique pp. 231-241 (3 rd ed.). 

'Tumor cell" refers to precancerous, cancerous, and normal cells in a tumor. 
"Cancer cells," "transformed" cells, or "transformation" in tissue culture, refers to 
spontaneous or induced phenotypic changes that do not necessarily involve the uptake of new 
genetic material. Although transformation can arise from infection with a transforming vims 

15 and incorporation of new genomic DNA, or uptake of exogenous DNA, it can also arise 

spontaneously or following exposure to a carcinogen, thereby mutating an endogenous gene. 
Transformation is associated with phenotypic changes, such as immortalization of cells, 
aberrant growth control, nonmorphological changes, and/or malignancy (see, Freshney 
(1994) Culture of Animal Cells a Manual of B asic Technique (3 rd ed.)). 

20 "Antibody" refers to a polypeptide comprising a framework region from an 

immunoglobulin gene or fragments thereof that specifically binds and recognizes an antigen. 
The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, 
epsilon, and mu constant region genes, as well as the myriad immunoglobulin variable region 
genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as 

25 gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, 
IgM, IgA, IgD, and IgE, respectively. Typically, the antigen-binding region of an antibody or 
its functional equivalent will be most critical in specificity and affinity of binding. See Paul, 
Fundamental Immunology . 

An exemplary immunoglobulin (antibody) structural unit comprises a tetramer. Each 

30 tetramer is composed of two identical pairs of polypeptide chains, each pair having one 
"light" (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus of each 
chain defines a variable region of about 100 to 1 10 or more amino acids primarily responsible 
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for antigen recognition. The terms variable light chain (V L ) and variable heavy chain (Vh) 

refer to these light and heavy chains respectively. 

Antibodies exist, e.g., as intact immunoglobulins or as a number of well-characterized 

fragments produced by digestion with various peptidases. Thus, e.g., pepsin digests an 

5 antibody below the disulfide linkages in the hinge region to produce F(ab)' 2> a dimer of Fab 
which itself is a light chain joined to V H -C H 1 by a disulfide bond. The F(ab)' 2 may be 
reduced under mild conditions to break the disulfide linkage in the hinge region, thereby 
converting the F(ab)' 2 dimer into an Fab' monomer. The Fab' monomer is essentially Fab 
with part of the hinge region (see Paul (ed. 1999) Fundamental Immunology (4th ed.). While 

10 various antibody fragments are defined in terms of the digestion of an intact antibody, one of 
skill will appreciate that such fragments may be synthesized de novo either chemically or by 
using recombinant DNA methodology. Thus, the term antibody, as used herein, also includes 
antibody fragments either produced by the modification of whole antibodies, or those 
synthesized de novo using recombinant DNA methodologies (e.g., single chain Fv) or those 

15 identified using phage display libraries (see, e.g., McCafferty, et al. (1990) Nature 348:552- 
554). 

For preparation of antibodies, e.g., recombinant, monoclonal, or polyclonal 
antibodies, many technique known in the art can be used (see, e.g., Kohler and Milstein 
(1975) Nature 256:495-497; Kozbor, et al. (1983) Immunology Today 4:72; Cole, et al. 

20 (1985), pp. 77-96 in Monoclonal Antibodies and Cancer Therapy: Coligan (1991 and 
supplements) Current Protocols in Tmmunoloev : Harlow and Lane (1988) Antibodies, A 
T.flhnratorv Manual: and Goding (1986) Monoclonal A ntibodies: Principles and Practice (2d 
ed.)). Techniques for the production of single chain antibodies (U.S. Patent 4,946,778) can 
be adapted to produce antibodies to polypeptides of this invention. Also, transgenic mice, or 

25 other organisms such as other mammals, may be used to express humanized antibodies. 
Alternatively, phage display technology can be used to identify antibodies and heteromeric 
Fab fragments that specifically bind to selected antigens (see, e.g., McCafferty, et al. (1990) 
Nature 348:552-554; Marks, et al. (1992) Biotechnology 10:779-783). 

A "chimeric antibody" is an antibody molecule in which, e.g, (a) the constant region, 

30 or a portion thereof, is altered, replaced, or exchanged so that the antigen binding site 
(variable region) is linked to a constant region of a different or altered class, effector 
function, and/or species, or an entirely different molecule which confers new properties to the 
chimeric antibody, e.g., an enzyme, toxin, hormone, growth factor, drug, etc.; or (b) the 
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variable region, or a portion thereof, is altered, replaced, or exchanged with a variable region 

having a different or altered antigen specificity. 



Identification of lung cancer-associated sequences 

5 In one aspect, the expression levels of genes are determined in different patient 

samples for which diagnosis information is desired, to provide expression profiles. An 
expression profile of a particular sample is essentially a "fingerprint" of the state of the 
sample; while two states may have any particular gene similarly expressed, the evaluation of 
a number of genes simultaneously allows the generation of a gene expression profile that is 

10 characteristic of the state of the cell. That is, normal tissue may be distinguished from 
cancerous or metastatic cancerous tissue, or metastatic cancerous tissue can be compared 
with tissue from surviving cancer patients. By comparing expression profiles of tissue in 
known different lung cancer states, information regarding which genes are important 
(including both up- and down-regulation of genes) in each of these states is obtained. 

1 5 Molecular profiling may distinguish subtypes of a currently collective disease designation, 
e.g., different forms of lung cancer (chronic disease, adenocarcinoma, etc.) 

The identification of sequences that are differentially expressed in lung cancer versus 
non-lung cancer tissue allows the use of this information in a number of ways. For example, 
a particular treatment regime may be evaluated: does a chemotherapeutic drug act to down- 

20 regulate lung cancer, and thus tumor growth or recurrence, in a particular patient. 

Alternatively, a treatment step may induce other markers which may be used as targets to 
destroy tumor cells. Similarly, diagnosis and treatment outcomes may be done or confirmed 
by comparing patient samples with the known expression profiles. Malignant diseasemay be 
compared to non-malignant conditions. Metastatic tissue can also be analyzed to determine 

25 the stage of lung cancer in the tissue, or origin of primary tumor, e.g., metastasis from a 

remote primary site. Furthermore, these gene expression profiles (or individual genes) allow 
screening of drug candidates with an eye to mimicking or altering a particular expression 
profile; e.g., screening can be done for drugs that suppress the lung cancer expression profile. 
This may be done by making biochips comprising sets of the important lung cancer genes, 

30 which can then be used in these screens. PGR methods may be applied with selected primer 
pairs, and analysis may be of RNA or of genomic sequences. These methods can also be. 
done on the protein basis; that is, protein expression levels of the lung cancer proteins can be 
evaluated for diagnostic purposes or to screen candidate agents. In addition, the lung cancer 
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. nucleic acid sequences.can be administered for gene therapy purposes, including the 

administration of antisense nucleic acids, or the lung cancer proteins (including antibodies 

and other modulators thereof) administered as therapeutic drugs or as protein or DNA 

vaccines. 

5 Thus the present invention provides nucleic acid and protein sequences that are 

differentially expressed in lung cancer relative to normal tissues and/or non-malignant lung 
disease, or in different types of lung disease, herein termed "lung cancer sequences." As 
outlined below, lung cancer sequences include those that are up-regulated (i.e., expressed at a 
higher level) in lung cancer, as well as those that are down-regulated (i.e., expressed at a 

10 lower level). In a preferred embodiment, the lung cancer sequences are from humans; 
however, as will be appreciated by those in the art, lung cancer sequences from other 
organisms may be useful in animal models of disease and drug evaluation; thus, other lung 
cancer sequences are provided, from vertebrates, including mammals, including rodents (rats, 
mice, hamsters, guinea pigs, etc.), primates, farm animals (including sheep, goats, pigs, cows, 

15 horses, etc.) and pets (dogs, cats, etc.). Lung cancer sequences from other organisms may be 
obtained using the techniques outlined below. 

Lung cancer sequences can include both nucleic acid and amino acid sequences. As 
will be appreciated by those in the art and is more fully outlined below, lung cancer nucleic 
acid sequences are useful in a variety of applications, including diagnostic applications, 

20 which will detect naturally occurring nucleic acids, as well as screening applications; e.g., 
biochips comprising nucleic acid probes or PCR microtiter plates with selected probes to the 
lung cancer sequences can be generated. 

A lung cancer sequence can be initially identified by substantial nucleic acid and/or 
amino acid sequence; homology to the lung cancer sequences outlined herein. Such 

25 homology can be based upon the overall nucleic acid or amino acid sequence, and is 

generally determined as outlined below, e.g., using homology programs or hybridization 
conditions. 

For identifying lung cancer-associated sequences, the lung cancer screen typically 
includes comparing genes identified in different tissues, e.g., normal and cancerous tissues, 
30 cancer and non-malignant conditions, non-malignant conditions and normal tissues, or tumor 
tissue samples from patients who have metastatic disease vs. non metastatic tissue. Other 
suitable tissue comparisons include comparing lung cancer samples with metastatic cancer 
samples from other cancers, such as, breast, other gastrointestinal cancers, prostate, ovarian, 
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etc. Samples of, non metastatic disease tissue and tissue undergoing metastasis are applied to 

biochips comprising nucleic acid probes. The samples are fust microdissected, if applicable, 
and treated as is known in the art for the preparation of mRNA. Suitable biochips are 
commercially available, e.g., from Afiymetrix, Santa Clara, CA. Gene expression profiles as 
5 described herein are generated and the data analyzed. 

In one embodiment, the genes showing changes in expression as between normal and 
disease states are compared to genes expressed in other normal tissues, preferably normal 
lung, but also including, and not limited to colon, heart, brain, liver, breast, kidney, muscle, 
prostate, small intestine, large intestine, spleen, bone, and/or placenta. In a preferred 
1 0 embodiment, those genes identified during the lung cancer screen that are expressed in 
significant amounts in other tissues (e.g., essential organs) are removed from the profile, 
although in some embodiments, this is not necessary (e.g., where organs may be dispensible 
at a later stage of life). That is, when screening for drugs, it is usually preferable that the 
target expression be disease specific, to minimize possible side effects on other organs. 
15 In a preferred embodiment, lung cancer sequences are those that are up-regulated in 

lung cancer; that is, the expression of these genes is higher in cancerous tissue than in normal 
lung or other tissue. "Up-regulation" as used herein means, when the ratio is presented as a 
number greater than one, that the ratio is greater than one, preferably 1.5 or greater, more 
preferably 2.0 or greater. Another embodiment is directed to sequences up-regulated in non- 
20 malignant conditions relative to normal. Unigene cluster identification numbers and 

accession numbers herein are for the GenBank sequence database and the sequences of the 
accession numbers are hereby expressly incorporated by reference. GenBank is known in the 
art, see, e.g., Benson, DA, et ai (1998) Nucleic Acids Research 26:1-7 and 
http://www.ncbi.nlm.nih.gov/. Sequences are also available in other databases, e.g., 
25 European Molecular Biology Laboratory (EMBL) and DNA Database of Japan (DDBJ). 
Another embodiment is directed to sequences up-regulated in non-malignant conditions 
relative to normal. In some situations, the sequences may be derived from assembly of 
available sequences or be predicted from genomic DNA using exon prediction algorithms, 
such as FGENESH (Salamov and Solovyev (2000) Genome Res. 10:516-522). In other 
30 situations, sequences have been derived from cloning and sequencing of isolated nucleic 
acids. 

In another preferred embodiment, lung cancer sequences are those that are dojvn- . 
regulated in the lung cancer; that is, the expression of these genes is lower in cancerous tissue 
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or normal lung or other tissue. 'Down-regulation" as used herein means, when the ratio is 

presented as a number greater than one, that the ratio is greater than one, preferably 1 .5 or 

greater, more preferably 2.0 or greater, or, when the ratio is presented as a number less than 

one, that the ratio is less than one, preferably 0.5 or less, more preferably 0.25 or less. 

5 

Informatics 

The ability to identify genes that are over or under expressed in lung cancer can 
additionally provide high-resolution, high-sensitivity datasets which can be used in the areas 
of diagnostics, therapeutics, drug development, pharmacogenetics, protein structure, 

10 biosensor development, and other related areas. For example, the expression profiles can be 
used in diagnostic or prognostic evaluation of patients with lung cancer. Or as another 
example, subcellular toxicoiogical information can be generated to better direct drug structure 
and activity correlation (see Anderson (1998) Pharmaceutical Proteomics: Targets, 
Mechanism, and Function, paper presented at the IBC Proteomics conference, Coronado, CA 

15 (June 11-12, 1998)). Subcellular toxicoiogical information can also be utilized in a biological 
sensor device to predict the likely toxicoiogical effect of chemical exposures and likely 
tolerable exposure thresholds (see U.S. Patent No. 5,81 1,231). Similar advantages accrue 
from datasets relevant to other biomolecules and bioactive agents (e.g., nucleic acids, 
saccharides, lipids, drugs, and the like). 

20 Thus, in another embodiment, the present invention provides a database that includes 

at least one set of assay data. The data contained in the database is acquired, e.g., using array 
analysis either singly or in a library format. The database can be in a form in which data can 
be maintained and transmitted, but is preferably an electronic database. The electronic 
database of the invention can be maintained on any electronic device allowing for the storage 

25 of and access to the database, such as a personal computer, but is preferably distributed on a 
wide area network, such as the World Wide Web. 

The focus of the present section on databases that include peptide sequence data is for 
clarity of illustration only. It will be apparent to those of skill in the art that similar databases 
can be assembled for assay data acquired using an assay of the invention. 

30 The compositions and methods for identifying and/or quantitating the relative and/or 

absolute abundance of a variety of molecular and macromolecular species from a biological 
sample representing lung cancer, i.e., the identification of lung cancer-associated sequences 
described herein, provide an abundance of information, which can be correlated with 
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pathological conditions, predisposition to disease, drug testing, therapeutic monitoring, gene- 
disease causal linkages, identification of correlates of immunity and physiological status,, 
among others. Although the data generated from the assays of the invention is suited for 
manual review and analysis, in a preferred embodiment, data processing using high-speed 

5 computers is utilized. 

An array of methods for indexing and retrieving biomolecular information is known 
in the art. For example, U.S. Patents 6,023,659 and 5,966,712 disclose a relational database 
system for storing biomolecular sequence information in a manner that allows sequences to 
be catalogued and searched according to one or more protein function hierarchies. U.S. 

10 Patent 5,953,727 discloses a relational database having sequence records containing 
information in a format that allows a collection of partial-length DNA sequences to be 
catalogued and searched according to association with one or more sequencing projects for 
obtaining full-length sequences from the collection of partial length sequences. U.S. Patent 
5,706,498 discloses a gene database retrieval system for making a retrieval of a gene 

15 sequence similar to a sequence data item in a gene database based on the degree of similarity 
between a key sequence and a target sequence. U.S. Patent 5,538,897 discloses a method 
using mass spectroscopy fragmentation patterns of peptides to identify amino acid sequences 
in computer databases by comparison of predicted mass spectra with experimentally-derived 
mass spectra using a closeness-of-fit measure. U.S. Patent 5,926,818 discloses a multi- 

20 dimensional database comprising a functionality for multi-dimensional data analysis 
described as on-line analytical processing (OLAP), which entails the consolidation of 
projected and actual data according to more than one consolidation path or dimension. U.S. 
Patent 5,295,261 reports a hybrid database structure in which the fields of each database 
record are divided into two classes, navigational and informational data, with navigational 

25 fields stored in a hierarchical topological map which can be viewed as a tree structure or as 
the merger of two or more such tree structures. 

See also Mount, et al. (2001) Bioinformatics; Durbin, et al. (eds., 1999) Biological 
Sequence Analysis: Probabilistic Models of Pr »t*im and Nucleic Acids (; Baxevanis and 
Oeullette (eds., 1998) Rininformatics: A Practical Guide to the Analysis of Genes and 

30 Proteins) : Rashidi and Buehler (1 999) ftminformatics: Basic Applications in Biological 
Science and Medicine ; Setubal, et al. (eds 1997) Introduction to Computational Molecular 
Biology ; Misener and Krawetz (eds, 2000) Bioinformatics- Methods and Protocols; Higgins 
and Taylor (eds., 2000) Rimnformatics : Sequence. Structure, and Databanks: A Practical 
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Approach: Brown f200n Bioinfonnatics: A Biologist's Guide to Biocomputing and the 

Internet : Han and Kamber (2000) Data Mining: Concents and Techniques (2000); and 

Waterman fl995^ Introduction to Computational Biolo gv: Mans. Sequences, and Genomes. 

The present invention provides a computer database comprising a computer and 

5 software for storing in computer-retrievable form assay data records cross-tabulated, e.g., 

with data specifying the source of the target-containing sample from which each sequence 

specificity record was obtained. 

In an exemplary embodiment, at least one of the sources of target-containing sample 

is from a control tissue sample known to be free of pathological disorders. In a variation, at 

1 0 least one of the sources is a known pathological tissue specimen, e.g., a neoplastic lesion or 
another tissue specimen to be analyzed for lung cancer. In another variation, the assay 
records cross-tabulate one or more of the following parameters for each target species in a 
sample: (1) a unique identification code, which can include, e.g., a target molecular structure 
and/or characteristic separation coordinate (e.g., electrophoretic coordinates); (2) sample 

1 5 source; and (3) absolute and/or relative quantity of the target species present in the sample. 

The invention also provides for the storage and retrieval of a collection of target data 
in a computer data storage apparatus, which can include magnetic disks, optical disks, 
magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, magnetic 
bubble memory devices, and other data storage devices, including CPU registers and on-CPU 

20 data storage arrays. Typically, the target data records are stored as a bit pattern in an array of 
magnetic domains on a magnetizable medium or as an array of charge states or transistor gate 
states, such as an array of cells in a DRAM device (e.g., each cell comprised of a transistor 
and a charge storage area, which may be on the transistor). In one embodiment, the invention 
provides such storage devices, and computer systems built therewith, comprising a bit pattern 

25 encoding a protein expression fingerprint record comprising unique identifiers for at least 10 
target data records cross-tabulated with target source. 

When the target is a peptide or nucleic acid, the invention preferably provides a 
method for identifying related peptide or nucleic acid sequences, comprising performing a 
computerized comparison between a peptide or nucleic acid sequence assay record stored in 

30 or retrieved from a computer storage device or database and at least one other sequence. The 
comparison can include a sequence analysis or comparison algorithm or computer program 
embodiment thereof (e.g., FASTA, TFASTA, GAP, BESTFIT) and/or the comparison may 
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be of the relative amount of a peptide or nucleic acid sequence in a pool of sequences 

determined from a polypeptide or nucleic acid sample of a specimen. 

The invention also preferably provides a magnetic disk, such as an IBM-compatible 

(DOS, Windows, Windows95/98/2000, Windows NT, OS/2) or other format (e.g., Linux, 

5 SunOS, Solaris, AIX, SCO Unix, VMS, MV, Macintosh, etc.) floppy diskette or hard (fixed, 

Winchester) disk drive, comprising a bit pattern encoding data from an assay of the invention 

in a file format suitable for retrieval and processing in a computerized sequence analysis, 

comparison, or relative quantitation method. 

The invention also provides a network, comprising a plurality of computing devices 

10 linked via a data link, such as an Ethernet cable (coax or lOBaseT), telephone line, ISDN 

line, wireless network, optical fiber, or other suitable signal transmission medium, whereby at 
least one network device (e.g., computer, disk array, etc.) comprises a pattern of magnetic 
domains (e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM cells) 
composing a bit pattern encoding data acquired from an assay of the invention. 

1 5 The invention also provides a method for transmitting assay data that includes 

generating an electronic signal on an electronic communications device, such as a modem, 
ISDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the signal 
includes (in native or encrypted format) a bit pattern encoding data from an assay or a 
database comprising a plurality of assay results obtained by the method of the invention. 

20 In a preferred embodiment, the invention provides a computer system for comparing a 

query target to a database containing an array of data structures, such as an assay result 
obtained by the method of the invention, and ranking database targets based on the degree of 
identity and gap weight to the target data. A central processor is preferably initialized to load 
and execute the computer program for alignment and/or comparison of the assay results. 

25 Data for a query target is entered into the central processor via an I/O device. Execution of 
the computer program results in the central processor retrieving the assay data from the data 
file, which comprises a binary description of an assay result. 

The target data or record and the computer program can be transferred to secondary 
memory, which is typically random access memory (e.g., DRAM, SRAM, SGRAM, or 

30 SDRAM). Targets are ranked according to the degree of correspondence between a selected 
assay characteristic (e.g., binding to a selected affinity moiety) and the same characteristic of 
the query target and results are output via an I/O device. For example, a central processor 
can be a conventional computer (e.g., Intel Pentium, PowerPC, Alpha, PA-8000, SPARC, 
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MIPS 4400, MIPS 10000, VAX, etc.); a program can be a commercial or public domain 

molecular biology software package (e.g., UWGCG Sequence Analysis Software, Darwin); a 
data file can be an optical or magnetic disk, a data server, a memory device (e.g., DRAM, 
SRAM, SGRAM, SDRAM, EPROM, bubble memory, flash memory, etc.); an I/O device can 
5 be a terminal comprising a video display and a keyboard, a modem, an ISDN terminal 

adapter, an Ethernet port, a punched card reader, a magnetic strip reader, or other suitable I/O 
device. 

The invention also preferably provides the use of a computer system, such as that 
described above, which comprises: (1) a computer; (2) a stored bit pattern encoding a 
10 collection of peptide sequence specificity records obtained by the methods of the invention, 
which may be stored in the computer; (3) a comparison target, such as a query target; and (4) 
a program for alignment and comparison, typically with rank-ordering of comparison results 
on the basis of computed similarity values. 

15 Characteristics of lung cancer-associated proteins 

Lung cancer proteins of the present invention may be classified as secreted proteins, 
transmembrane proteins or intracellular proteins. In one embodiment, the lung cancer protein 
is an intracellular protein. Intracellular proteins may be found in the cytoplasm and/or in the 
nucleus. Intracellular proteins are involved in all aspects of cellular function and replication 

20 (including, e.g., signaling pathways); aberrant expression of such proteins often results in 
unregulated or disregulated cellular processes (see, e.g., Alberts (ed. 1994) Molecular 
Biology of the Cell (3d ed.). For example, many intracellular proteins have enzymatic 
activity such as protein kinase activity, protein phosphatase activity, protease activity, 
nucleotide cyclase activity, polymerase activity and the like. Intracellular proteins also serve 

25 as docking proteins that are involved in organizing complexes of proteins, or targeting 

proteins to various subcellular localizations, and are involved in maintaining the structural 
integrity of organelles. 

An increasingly appreciated concept in characterizing proteins is the presence in the 
proteins of one or more structural motifs for which defined functions have been attributed. In 

30 addition to the highly conserved sequences found in the enzymatic domain of proteins, highly 
conserved sequences have been identified in proteins that are involved in protein-protein 
interaction. For example, Src-homology-2 (SHE) domains bind tyrosine-phosphorylated 
targets in a sequence dependent manner, PTB domains, which are distinct from SH2 



31 



WO 02/086443 PCT/US02/12476 
domains, also bind tyrosine phosphorylated targets. SH3 domains bind to proline-rich 

targets. In addition, PH domains, tetratricopeptide repeats and WD domains to name only a 

few, have been shown to mediate protein-protein interactions. Some of these may also be 

involved in binding to phospholipids or other second messengers. As will be appreciated by 

5 one of ordinary skill in the art, these motifs can be identified on the basis of amino acid 

sequence; thus, an analysis of the sequence of proteins may provide insight into both the 

enzymatic potential of the molecule and/or molecules with which the protein may associate. 

One useful database is Pfam (protein families), which is a large collection of multiple 

sequence alignments and hidden Markov models covering many common protein domains. 

1 0 Versions are available via the internet from Washington University in St. Louis, the Sanger 

Center in England, and the Karolinska Institute in Sweden (see, e.g., Bateman, et al (2000) 

Nuc. Acids Res. 28:263-266- Sonnhammer, et al. (1997) Proteins 28:405-420; Bateman, et al. 

(1999) Nuc. Acids Res. 27:260-262; and Sonnhammer, et al. (1998) Nuc. Acids Res. 26:320- 

322). 

15 In another embodiment, the lung cancer sequences are transmembrane proteins. 

Transmembrane proteins are molecules that span a phospholipid bilayer of a cell. They may 
have an intracellular domain, an extracellular domain, or both. The intracellular domains of 
such proteins may have a number of functions including those already described for 
intracellular proteins. For example, the intracellular domain may have enzymatic activity 

20 and/or may serve as a binding site for additional proteins. Frequently the intracellular 

domain of transmembrane proteins serves both roles. For example certain receptor tyrosine 
kinases have both protein kinase activity and SH2 domains. In addition, autophosphorylation 
of tyrosines on the receptor molecule itself, creates binding sites for additional SH2 domain 
containing proteins. 

25 Transmembrane proteins may contain from one to many transmembrane domains. 

For example, receptor tyrosine kinases, certain cytokine receptors, receptor guanylyl cyclases 
and receptor serine/threonine protein kinases contain a single transmembrane domain. 
However, various other proteins including channels, pumps, and adenylyl cyclases contain 
numerous transmembrane domains. Many important cell surface receptors such as G protein 

30 coupled receptors (GPCRs) are classified as "seven transmembrane domain" proteins, as they 
contain 7 membrane spanning regions. Characteristics of transmembrane domains include 
approximately 1 7 consecutive hydrophobic amino acids that may be followed by charged 
amino acids. Therefore, upon analysis of the amino acid sequence of a particular protein, the 
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localization and number of transmembrane domains within the protein may be predicted (see, 

e.g., PSORT web site http://psort.nibb.ac.jp/). 

The extracellular domains of transmembrane proteins are diverse; however, conserved 

motifs are found repeatedly among various extracellular domains. Conserved structure 

5 and/or functions have been ascribed to different extracellular motifs. Many extracellular 

domains are involved in binding to other molecules. In one aspect, extracellular domains are 

found on receptors. Factors that bind the receptor domain include circulating ligands, which 

may be peptides, proteins, or small molecules such as adenosine and the like. For example, 

growth factors such as EGF, FGF, and PDGF are circulating growth factors that bind to their 

10 cognate receptors to initiate a variety of cellular responses. Other factors include cytokines, 
mitogenic factors, hormones, neurotrophic factors and the like. Extracellular domains also 
bind to cell-associated molecules. In this respect, they may mediate cell-cell interactions. 
Cell-associated ligands can be tethered to the cell, e.g., via a glycosylphosphatidylinositol 
(GPI) anchor, or may themselves be transmembrane proteins. Extracellular domains may 

1 5 also associate with the extracellular matrix and contribute to the maintenance of the cell 
structure. 

Lung cancer proteins that are transmembrane are particularly preferred in the present 
invention as they are readily accessible targets for extracellular immunotherapeutics, as are 
described herein In addition, as outlined below, transmembrane proteins can be also useful 

20 in imaging modalities. Antibodies may be used to label such readily accessible proteins in 
situ or in histological analysis. Alternatively, antibodies can also label intracellular proteins, 
in which case analytical samples are typically permeablized to provide access to intracellular 
proteins. In addition, some membrane proteins can be processed to release a soluble protein, 
or to expose a residual fragment. Released soluble proteins may be useful diagnostic 

25 markers, processed residual protein fragments may be useful lung markers of disease. 

It will also be appreciated by those in the art that a transmembrane protein can be 
made soluble by removing transmembrane sequences, e.g., through recombinant methods. 
Furthermore, transmembrane proteins that have been made soluble can be made to be 
secreted through recombinant means by adding an appropriate signal sequence. 

30 In another embodiment, the lung cancer proteins are secreted proteins; the secretion of 

which can be either constitutive or regulated. These proteins may have a signal peptide or 
signal sequence that targets the molecule to the secretory pathway. Secreted proteins are 
involved in numerous physiological events; e.g., if circulating, they often serve to transmit 



33 



WO 02/086443 PCT/US02/12476 
signals to various other cell types. The secreted protein may function in an autocrine manner 

(acting on the cell that secreted the factor), a paracrine manner (acting on cells in close 

proximity to the cell that secreted the factor), an endocrine manner (acting on cells at a 

distance, e.g., secretion into the blood stream), or exocrine (secretion, e.g., through . a duct or 

5 to adjacent epithelial surface as sweat glands, sebaceous glands, pancreatic ducts, lacrimal 

glands, mammary glands, sax producing glands of the ear, etc.). Thus secreted molecules 

often find use in modulating or altering numerous aspects of physiology. Lung cancer 

proteins that are secreted proteins are particularly preferred in the present invention as they 

serve as good targets for diagnostic markers, e.g., for blood, plasma, serum, or stool tests. 

1 0 Those which are enzymes may be antibody or small molecule targets. Others may be useful 

as vaccine targets, e.g., via CTL mechanisms. 

Use of lung cancer nucleic acids 

As described above, lung cancer sequence is initially identified by substantial nucleic 
1 5 acid and/or amino acid sequence homology or linkage to the lung cancer sequences outlined 
herein. Such homology can be based upon the overall nucleic acid or amino acid sequence, 
and is generally determined as outlined below, using either homology programs or 
hybridization conditions. Typically, linked sequences on a mRNA are found on the same 
molecule. 

20 The lung cancer nucleic acid sequences of the invention, e.g., the sequences in Tables 

1A-16, can be fragments of larger genes, i.e., they are nucleic acid segments. "Genes" in this 
context includes coding regions, non-coding regions, and mixtures of coding and non-coding 
regions. Accordingly, as will be appreciated by those in the art, using the sequences provided 
herein, extended sequences, in either direction, of the lung cancer genes can be obtained, 

25 using techniques well known in the art for cloning either longer sequences or the full length 
sequences; see Ausubel, et al., supra. Much can be done by informatics and many sequences 
can be clustered to include multiple sequences corresponding to a single gene, e.g., systems 
such as UniGene (see, http://www.ncbi.nlm.nih.govAJniGene/). 

Once a lung cancer nucleic acid is identified, it can be cloned and, if necessary, its 

30 constituent parts recombined to form the entire lung cancer nucleic acid coding regions or the 
entire mRNA. sequence. Once isolated from its natural source, e.g., contained within a 
plasmid or other vector or excised therefrom as a linear nucleic acid segment, the 
recombinant lung cancer nucleic acid can be further-used as a probe to identify and isolate 
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other lung cancer nucleic acids, e.g., extended coding regions. It can also be used as a 
''precursor" nucleic acid to make modified or variant lung cancer nucleic acids and proteins. 

The lung cancer nucleic acids of the present invention are used in several ways. In a 
first embodiment, nucleic acid probes to the lung cancer nucleic acids are made and attached 
5 to biochips to be used in screening and diagnostic methods, as outlined below, or for 
administration, e.g., for gene therapy, RNAi, vaccine, and/or antisense applications. 
Alternatively, the lung cancer nucleic acids that include coding regions of lung cancer 
proteins can be put into expression vectors for the expression of lung cancer proteins, again 
for screening purposes or for administration to a patient. 

10 In a preferred embodiment, nucleic acid probes to lung cancer nucleic acids (both the 

nucleic acid sequences outlined in the figures and/or the complements thereof) are made. 
The nucleic acid probes attached to the biochip are designed to be substantially 
complementary to the lung cancer nucleic acids, i.e., the target sequence (either the target 
sequence of the sample or to other probe sequences, e.g., in sandwich assays), such that 

1 5 hybridization of the target sequence and the probes of the present invention occurs. As 

outlined below, this complementarity need not be perfect; there may be any number of base 
pair mismatches which will interfere with hybridization between the target sequence and the 
single stranded nucleic acids of the present invention. However, if the number of mutations 
is so great that no hybridization can occur under even the least stringent of hybridization 

20 conditions, the sequence is not a complementary target sequence. Thus, by "substantially 
complementary" herein is meant that the probes are sufficiently complementary to the target 
sequences to hybridize under appropriate reaction conditions, particularly high stringency 
conditions, as outlined herein. 

A nucleic acid probe is generally single stranded but can be partially single and 

25 partially double stranded. The strandedness of the probe is dictated by the structure, 

composition, and properties of the target sequence. In general, the nucleic acid probes range 
from about 8 to about 100 bases long, with from about 10 to about 80 bases being preferred, 
and from about 30 to about 50 bases being particularly preferred. That is, generally 
complements of ORFs or whole genes are not used. In some embodiments, nucleic acids of 

30 lengths up to hundreds of bases can be used. 

In a preferred embodiment, more than one probe per sequence is used, with either 
overlapping probes or probes to different sections of the target being used. That is, two, 
three, four or more probes, with three being preferred, are used to build in a redundancy for a 
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particular target. The probes can be overlapping (i.e., have some sequence in common), or 
separate. In some cases, PCR primers may be used to .amplify signal for higher sensitivity. 

As will be appreciated by those in the art, nucleic acids can be attached or 
immobilized to a solid support in a wide variety of ways. By "immobilized" and grammatical 
5 equivalents herein is meant the association or binding between the nucleic acid probe and the 
solid support is sufficient to be stable under the conditions of binding, washing, analysis, and 
removal as outlined below. The binding can typically be covalent or non-covalent By "non- 
covalent binding" and grammatical equivalents herein is typically meant one or more of 
electrostatic, hydrophilic, and hydrophobic interactions. Included in non-covalent binding is 

10 the covalent attachment of a molecule, such as, streptavidin to the support and the non- 
covalent binding of the biotinylated probe to the streptavidin. By "covalent binding" and 
grammatical equivalents herein is meant that the two moieties, the solid support and the 
probe, are attached by at least one bond, including sigma bonds, pi bonds and coordination 
bonds. Covalent bonds can be formed directly between the probe and the solid support or can 

15 be formed by a cross linker or by inclusion of a specific reactive group on either the solid 
support or the probe or both molecules. Immobilization may also involve a combination of 
covalent and non-covalent interactions. 

In general, the probes are attached to a biochip in a wide variety of ways, as will be 
appreciated by those in the art. As described herein, the nucleic acids can either be 

20 synthesized first, with subsequent attachment to the biochip, or can be directly synthesized on 
the biochip. 

The biochip comprises a suitable solid substrate. By "substrate" or "solid support" or 
other grammatical equivalents herein is meant a material that can be modified for the 
attachment or association of the nucleic acid probes and is amenable to at least one detection 

25 method. Often the substrate may contain discrete individual sites appropriate for ndivitual 
partitioning and identification. As will be appreciated by those in the art, the number of 
possible substrates are very large, and include, but are not limited to, glass and modified or 
functionalized glass, plastics (including acrylics, polystyrene and copolymers of styrene and 
other materials, polypropylene, polyethylene, polybutylene, polyurethanes, Teflon, etc.), 

30 polysaccharides, nylon or nitrocellulose, resins, silica or silica-based materials including 
silicon and modified silicon, carbon, metals, inorganic glasses, plastics, etc. In general, the 
substrates allow optical detection and do not appreciably fluoresce. A preferred substrate is 
described in US application entitled Reusable Low Fluorescent Plastic Biochip, U.S. 
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Application Serial No. 09/270,214, filed March 15, 1999, herein incoiporated by reference in 

its entirety. 

Generally the substrate is planar, although as will be appreciated by those in the art, 
other configurations of substrates may be used as well. For example, the probes may be 
5 placed on the inside surface of a tube, for flow-through sample analysis to minimize sample 
volume. Similarly, the substrate may be flexible, such as a flexible foam, including closed 
cell foams made of particular plastics. 

In a preferred embodiment, the surface of the biochip and the probe may be 
derivatized with chemical functional groups for subsequent attachment of the two. Thus, e.g., 

10 the biochip is derivatized with a chemical functional group including, but not limited to, 
amino groups, carboxy groups, oxo groups and thiol groups, with amino groups being 
particularly preferred. Using these functional groups, the probes can be attached using 
functional groups on the probes. For example, nucleic acids containing amino groups can be 
attached to surfaces comprising amino groups, e.g., using linkers as are known in the art; e.g., 

1 5 homo-or hetero-bifimctional linkers as are well known (see 1 994 Pierce Chemical Company 
catalog, technical section on cross-linkers, pages 155-200). In addition, in some cases, 
additional linkers, such as alkyl groups (including substituted and heteroalkyl groups) may be 
used. 

In this embodiment, oligonucleotides are synthesized, and then attached to the surface 
20 of the solid support. Either the 5' or 3' terminus may be attached to the solid support, or 

attachment may be via linkage to an internal nucleoside. 

In another embodiment, the immobilization to the solid support may be very strong, 

yet non-covalent. For example, biotinylated oligonucleotides can be made, which bind to 

surfaces covalently coated with streptavidin, resulting in attachment. 
25 Alternatively, the oligonucleotides may be synthesized on the surface, as is known in 

the art. For example, photoactivation techniques utilizing photopolymerization compounds 

and techniques are used. In a preferred embodiment, the nucleic acids can be synthesized in 

situ, using known photolithographic techniques, such as those described in WO 95/251 16; 

WO 95/35505; U.S. Patent Nos. 5,700,637 and 5,445,934; and references cited within, all of 
30 which are expressly incorporated by reference; these methods of attachment form the basis of 

the Affymetrix GeneChip™ technology. 

Often, amplification-based assays are performed to measure the expression level of 

lung cancer-associated sequences. These assays are typically performed in conjunction with 
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reverse transcription. In such assays, a lung cancer-associated nucleic acid sequence acts as a 

template in an amplification reaction (e.g., Polymerase Chain Reaction, or PCR). In a 

quantitative amplification, the amount of amplification product will be proportional to the 

amount of template in the original sample. Comparison to appropriate controls provides a 

5 measure of the amount of lung cancer-associated RNA. Methods of quantitative 

amplification are well known to those of skill in the art. Detailed protocols for quantitative 

V np r y^r\A^ * * j in Tnnf gj *t al (1 QQO) PCR Protocols. A Guide to Methods and 

Applications . 

In some embodiments, a TaqMan based assay is used to measure expression. 

10 TaqMan based assays use a fluorogenic oligonucleotide probe that contains a 5' fluorescent 
dye and a 3' quenching agent. The probe hybridizes to a PCR product, but cannot itself be 
extended due to a blocking agent at the 3 ' end. When the PCR product is amplified in 
subsequent cycles, the 5' nuclease activity of the polymerase, e.g., AmpliTaq, results in the 
cleavage of the TaqMan probe. This cleavage separates the 5' fluorescent dye and the 3' 

15 quenching agent, thereby resulting in an increase in fluorescence as a function of 

amplification (see, e.g., literature provided by Perkin-Elmer, e.g., www2.perkin-elmer.com). 

Other suitable amplification methods include, but are not limited to, ligase chain 
reaction (LCR) (see Wu and Wallace (1989) Genomics 4:560, Landegren, et al. (1988) 
Science 241:1077, and Barringer, et al. (1990) Gene 89:117), transcription amplification 

20 (Kwoh, et al. (1989) Proc. Natl. Acad. Sci. USA 86:1173), self-sustained sequence 

replication (Guatelli, et al. (1990) Proc. Nat. Acad. Sci. USA 87:1874), dot PCR, and linker 
adapter PCR, etc. 

Expression of lung cancer proteins from nucleic acids 

25 In a preferred embodiment, lung cancer nucleic acids, e.g., encoding lung cancer 

proteins, are used to make a variety of expression vectors to express lung cancer proteins 
which can then be used in screening assays, as described below. Expression vectors and 
recombinant DNA technology are well known to those of skill in the art (see, e.g., Ausubel, 
supra, and Fernandez and Hoeffler (eds 1999) Gene Expression Systems) and are used to 

30 express proteins. The expression vectors may be either self-replicating extrachromosomal 
vectors or vectors which integrate into a host genome. Generally, these expression vectors 
include transcriptional and translational regulatory nucleic acid operably linked to the nucleic 
acid encoding the lung cancer protein. The term "control sequences*' refers to DNA 
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sequences used for the expression of an operably linked coding sequence in a particular host 

organism. Control sequences that are suitable for prokaryotes, e.g., include a promoter, 

optionally an operator sequence, and a ribosome binding site. Eukaryotic cells are known to 

utilize promoters, polyadenylation signals, and enhancers. 

5 Nucleic acid is "operably linked" when it is placed into a functional relationship with 

another nucleic acid sequence. For example, DNA for a presequence or secretory leader is 

operably linked to DNA for a polypeptide if it is expressed as a preprotein that participates in 

the secretion of the polypeptide; a promoter or enhancer is operably linked to a coding 

sequence if it affects the transcription of the sequence; or a ribosome binding site is operably 

10 linked to a coding sequence if it is positioned so as to facilitate translation. Generally, 
"operably linked" means that the DNA sequences being linked are contiguous, and, in the 
case of a secretory leader, contiguous and in reading phase. However, enhancers do not have 
to be contiguous. Linking is typically accomplished by ligation at convenient restriction 
sites. If such sites do not exist, synthetic oligonucleotide adaptors or linkers are used in 

1 5 accordance with conventional practice. Transcriptional and translational regulatory nucleic 
acid will generally be appropriate to the host cell used to express the lung cancer protein. 
Numerous types of appropriate expression vectors, and suitable regulatory sequences are 
known in the art for a variety of host cells. 

In general, transcriptional and translational regulatory sequences may include, but are 

20 not limited to, promoter sequences, ribosomal binding sites, transcriptional start and stop 
sequences, translational start and stop sequences, and enhancer or activator sequences. In a 
preferred embodiment, the regulatory sequences include a promoter and transcriptional start 
and stop sequences. 

Promoter sequences may be either constitutive or inducible promoters. The promoters 
25 may be either naturally occurring promoters or hybrid promoters. Hybrid promoters, which 
combine elements of more than one promoter, are also known in the art, and are useful in the 
present invention. 

In addition, an expression vector may comprise additional elements. For example, the 
expression vector may have two replication systems, thus allowing it to be maintained in two 
30 organisms, e.g., in mammalian or insect cells for expression and in a prokaryotic host for 
cloning and amplification. Furthermore, for integrating expression vectors, the expression 
vector often contains at least one sequence homologous to the host cell genome, and 
preferably two homologous sequences which flank the expression construct. The integrating 
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vector may be directed to a specific locus in the host cell by selecting the appropriate 

homologous sequence for inclusion in the vector. Constructs for integrating vectors are well 
known in the art (e.g., Fernandez and Hoeffler, supra). 

In addition, in a preferred embodiment, the expression vector contains a selectable 
5 marker gene to allow the selection of transformed host cells. Selection genes are well known 
in the art and will vary with the host cell used. 

The lung cancer proteins of the present invention are usually produced by culturing a 
host cell transformed with an expression vector containing nucleic acid encoding a lung 
cancer protein, under the appropriate conditions to induce or cause expression of the lung 

10 cancer protein. Conditions appropriate for lung cancer protein expression will vary with the 
choice of the expression vector and the host cell, and will be easily ascertained by one skilled 
in the art through routine experimentation or optimization. For example, the use of 
constitutive promoters in the expression vector will require optimizing the growth and 
proliferation of the host cell, while the use of an inducible promoter requires the appropriate 

15 growth conditions for induction. In addition, in some embodiments, the timing of the harvest 
is important. For example, the baculoviral systems used in insect cell expression are lytic 
viruses, and thus harvest time selection can be crucial for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fimgi, and insect and 
animal cells, including mammalian cells. Of particular interest are Saccharomyces cerevisiae 

20 and other yeasts, E. coli, Bacillus subtilis, Sf9 cells, C129 cells, 293 cells, Neurospora, BHK, 
CHO, COS, HeLa cells, HUVEC (human umbilical vein endothelial cells), THP1 cells (a 
macrophage cell line) and various other human cells and cell lines. 

In a preferred embodiment, the lung cancer proteins are expressed in mammalian 
cells. Mammalian expression systems are also known in the art, and include retroviral and 

25 adenoviral systems. Of particular use as mammalian promoters are the promoters from 
mammalian viral genes, since the viral genes are often highly expressed and have a broad 
host range. Examples include the SV40 early promoter, mouse mammary tumor virus LTR 
promoter, adenovirus major late promoter, herpes simplex virus promoter, and the CMV 
promoter (see, e.g., Fernandez and Hoeffler, supra). Typically, transcription termination and 

30 polyadenylation sequences recognized by mammalian cells are regulatory regions located 3 * 
to the translation stop codon and thus, together with the promoter elements, flank the coding 
sequence. Examples of transcription terminator and polyadenylation signals include those 
derived form SV40. 
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The methods of introducing exogenous nucleic acid into mammalian hosts, as well as 

other hosts, is well known in the art, and will vary with the host cell used. Techniques 

include dextran-mediated transfection, calcium phosphate precipitation, polybrene mediated 

transfection, protoplast fusion, electroporation, viral infection, encapsulation of the 

polynucleotide(s) in liposomes, and direct microinjection of the DNA into nuclei. 

In a preferred embodiment, lung cancer proteins are expressed in bacterial systems. 

Promoters from bacteriophage may also be used and are known in the art. In addition* 

synthetic promoters and hybrid promoters are also useful; e.g., the tac promoter is a hybrid of 

the trp and lac promoter sequences. Furthermore, a bacterial promoter can include naturally 

occurring promoters of non-bacterial origin that have the ability to bind bacterial RNA 

polymerase and initiate transcription. In addition to a functioning promoter sequence, an 

efficient ribosome binding site is desirable. The expression vector may also include a signal 

peptide sequence that provides for secretion of the lung cancer protein in bacteria. The 

protein is either secreted into the growth media (gram-positive bacteria) or into the 

periplasmic space, located between the inner and outer membrane of the cell (gram-negative 

bacteria). The bacterial expression vector may also include a selectable marker gene to allow 

for the selection of bacterial strains that have been transformed. Suitable selection genes 

include genes which render the bacteria resistant to drugs such as ampicillin, 

chloramphenicol, erythromycin, kanamycin, neomycin and tetracycline. Selectable markers 

also include biosynthetic genes, such as those in the histidine, tryptophan and leucine 

biosynthetic pathways. These components are assembled into expression vectors. Expression 

vectors for bacteria are well known in the art, and include vectors for Bacillus subtilis, E. 

coli, Streptococcus cremoris, and Streptococcus lividans, among others (e.g., Fernandez and 

Hoeffler, supra). The bacterial expression vectors are transformed into bacterial host cells 

using techniques well known in the art, such as calcium chloride treatment, electroporation, 

and others. 

In one embodiment, lung cancer proteins are produced in insect cells. Expression 
vectors for the transformation of insect cells, and in particular, baculovirus-based expression 
vectors, are well known in the art. 

In a preferred embodiment, lung cancer protein is produced in yeast cells. Yeast 
expression systems are well known in the art, and include expression vectors for 
Saccharomyces cerevisiae, Candida albicans and C. maltosa, Hansenula polymorpha, 
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Kluyveromyces fiagilis and K. lactis, Pichia guillerimondii, and P. pastoris, 

Schizosaccharomyces pombe, and Yarrowia lipolytica. 

The lung cancer protein may also be made as a fusion protein, using techniques well 

known in the art. Thus, e.g., for the creation of monoclonal antibodies, if the desired epitope 

5 is small, the lung cancer protein may be fused to a carrier protein to form an immunogen. 

Alternatively, the lung cancer protein may be made as a fusion protein to increase expression 

for affinity purification purposes, or for other reasons. For example, when the lung cancer 

protein is a lung cancer peptide, the nucleic acid encoding the peptide may be linked to other 

nucleic acid for expression purposes. 

10 In a preferred embodiment, the lung cancer protein is purified or isolated after 

expression. Lung cancer proteins may be isolated or purified in a variety of appropriate 
ways. Standard purification methods include electrophoretic, molecular, immunological and 
chromatographic techniques, including ion exchange, hydrophobic, affinity, and reverse- 
phase HPLC chromatography, and chromatofocusing. For example, the lung cancer protein 

1 5 may be purified using a standard anti-lung cancer protein antibody column. Ultrafiltration 
and diafiltration techniques, in conjunction with protein concentration, are also useful. For 
general guidance in suitable purification techniques, see Scopes (1982) Protein Purification . 
The degree of purification necessary will vary depending on the use of the lung cancer 
protein. In some instances no purification will be necessary. 

20 Once expressed and purified if necessary, the lung cancer proteins and nucleic acids 

are useful in a number of applications. They may be used as immunoselection reagents, as 
vaccine reagents, as screening agents, therapeutic entities, for production of antibodies, as 
transcription or translation inhibitors, etc. 

25 Variants of lung cancer proteins 

In one embodiment, the lung cancer proteins are derivative or variant lung cancer 
proteins as compared to the wild-type sequence. That is, as outlined more fully below, the 
derivative lung cancer peptide will often contain at least one amino acid substitution, deletion 
or insertion, with amino acid substitutions being particularly preferred. The amino acid 
30 substitution, insertion or deletion may occur at a particular residue within the lung cancer 
peptide. 

Also included within one embodiment of lung cancer proteins of the present invention 
are amino acid sequence variants. These variants typically fall into one or more of three 
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classes: substitutional, insertiorial or deletional variants. These vanants ordinarily are 

prepared by site specific mutagenesis of nucleotides in the DNA encoding the lung cancer 

protein, using cassette or PCR mutagenesis or other techniques, to produce DNA encoding 

the variant, and thereafter expressing the DNA in recombinant cell culture as outlined above. 

5 However, variant lung cancer protein fragments having up to about 100-150 residues maybe 

prepared by in vitro synthesis. Amino acid sequence variants are characterized by the 

predetermined nature of the variation, a feature that sets them apart from naturally occurring 

allelic or interspecies variation of the lung cancer protein amino acid sequence. The variants 

typically exhibit a similar qualitative biological activity as the naturally occurring analogue, 

10 although variants can also be selected which have modified characteristics as will be more 
fully outlined below. 

While the site or region for introducing an amino acid sequence variation is often 
predetermined, the mutation per se need not be predetermined. For example, in order to 
optimize the performance of a mutation at a given site, random mutagenesis may be 

1 5 conducted at the target codon or region and the expressed lung cancer variants screened for 
the optimal combination of desired activity. Techniques exist for making substitution 
mutations at predetermined sites in DNA having a known sequence, e.g., Ml 3 primer 
mutagenesis and PCR mutagenesis. Screening of mutants is often done using assays of lung 
cancer protein activities. 

20 Amino acid substitutions are typically of single residues; insertions usually will be on 

the order of from about 1 to 20 amino acids, although considerably larger insertions may be 
occasionally tolerated. Deletions generally range from about 1 to about 20 residues, although 
in some cases deletions may be much larger. 

Substitutions, deletions, insertions or any combination thereof may be used to arrive 

25 at a final derivative. Generally these changes are done on a few amino acids to minimize the 
alteration of the molecule. Larger changes may be tolerated in certain circumstances. When 
small alterations in the characteristics of a lung cancer protein are desired, substitutions are 
generally made in accordance with the amino acid substitution chart provided in the 
definition section. 

30 Variants typically exhibit essentially the same qualitative biological activity and will 

elicit the same immune response as a naturally-occurring analog, although variants also are 
selected to modify the characteristics of lung cancer proteins as needed. Alternatively, the 
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variant may be designed or reorganized such that a biological activity of the lung cancer - 

protein is altered. For example, glycosylation sites may be added, altered, or removed. 

Covalent modifications of lung cancer polypeptides are included within the scope of 

this invention. One type of covalent modification includes reacting targeted amino acid 

5 residues of a lung cancer polypeptide with an organic derivatizing agent that is capable of 

reacting with selected side chains or the N-or C-terminal residues of a lung cancer 

polypeptide. Derivatization with Afunctional agents is useful, for instance, for crosslinking 

lung cancer polypeptides to a water-insoluble support matrix or surface for use in a method 

for purifying anti-lung cancer polypeptide antibodies or screening assays, as is more fully 

1 0 described below. Commonly used crosslinking agents include, e.g., l,l-bis(diazoacetyl)-2- 
phenylethane, glutaraldehyde, N-hydroxysuccinimide esters, e.g., esters with 4-azidosalicylic 
acid, homobifunctional imidoesters, including disuccinimidyl esters such as 3,3'- 
dithiobis(succinimidylpropionate), bifunctional maleimides such as bis-N-maleimido-1,8- 
octane and agents such as methyl-3-((p-azidophenyl)dithio)propipimidate. 

1 5 Other modifications include deamidation of glutaminyl and asparaginyl residues to 

the corresponding glutamyl and aspartyl residues, respectively, hydroxylation of proline and 
lysine, phosphorylation of hydroxyl groups of serinyl, threonyl or tyrosyl residues, 
methylation of the y-amino groups of lysine, arginine, and histidine side chains (Creighton 
(1983) Proteins: Structure and Molecular Properties, pp. 79-86), acetylation of the N-terminal 

20 amine, and amidation of any C-terminal carboxyl group. 

Another type of covalent modification of the lung cancer polypeptide encompassed by 
this invention is an altered native glycosylation pattern of the polypeptide. "Altering the 
native glycosylation pattern" is intended herein to mean adding to or deleting one or more 
carbohydrate moieties of a native sequence lung cancer polypeptide. Glycosylation patterns 

25 can be altered in many ways. For example the use of different cell types to express lung 
cancer-associated sequences can result in different glycosylation patterns. 

Addition of glycosylation sites to lung cancer polypeptides may also be accomplished 
by altering the amino acid sequence thereof. The alteration may be made, e.g., by the 
addition of, or substitution by, one or more serine or threonine residues to the native sequence 

30 lung cancer polypeptide (for O-linked glycosylation sites). The lung cancer amino acid 
sequence may optionally be altered through changes at the DNA level, particularly by 
mutating the DNA encoding the lung cancer polypeptide at preselected bases such that 
codons are generated that will translate into the desired amino acids. 
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Another means of increasing the number of carbohydrate moieties on the lung cancer 

polypeptide is by chemical or enzymatic coupling of glycosides to the polypeptide. Such 

methods are described in the art, e.g., in WO 87/05330, and in Aplin and Wriston (1981) 

CRC Crit Rev. Bioehem.. pp. 259-306. 

5 Removal of carbohydrate moieties present on the lung cancer polypeptide may be 

accomplished chemically or enzymatically or by mutational substitution of codons encoding, 
for amino acid residues that serve as targets for glycosylate. Chemical deglycosylation 
techniques are known in the art and described/for instance, by HaMmuddin, et al. (1987) 
Amh TW.hem. Biophvs.. 259:52 and by Edge, et al. (1981) Anal. Biochem., 118:131. 

10 Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the use of a 
variety of endo-and exo-glycosidases as described by Thotakura, et al. (1987) MeflL 
EnzymoL 138:350. 

Another type of covalent modification of lung cancer comprises linking the lung 
cancer polypeptide to one of a variety of nonproteinaceous polymers, e.g., polyethylene 

15 glycol, polypropylene glycol, or polyoxyalkylenes, in the manner set forth in U.S. Patent 
Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192, or 4,179,337. 

Lung cancer polypeptides of the present invention may also be modified in a way to 
form chimeric molecules comprising a lung cancer polypeptide fused to another, 
heterologous polypeptide or amino acid sequence. In one embodiment, such a chimeric 

20 molecule comprises a fusion of a lung cancer polypeptide with a tag polypeptide which 
provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is 
generally placed at the ammo-or carooxyl-terminus of the lung cancer polypeptide. The 
presence of such epitope-tagged forms of a lung cancer polypeptide can be detected using an 
antibody against the tag polypeptide. Also, provision of the epitope tag enables the lung 

25 cancer polypeptide to be readily purified by affinity purification using an anti-tag antibody or 
another type of affinity matrix that binds to the epitope tag. In an alternative embodiment, 
the chimeric molecule may comprise a fusion of a lung cancer polypeptide with an 
immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the 
chimeric molecule, such a fusion could be to the Fc region of an IgG molecule. 

30 Various tag polypeptides and their respective antibodies are well known and examples 

include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; fflS6 and metal 
chelation tags, the flu HA tag polypeptide and its antibody 12CA5 (Field, et al. (1988) MoL 
Cell. Biol. 8:2159-2165); the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 9E10 antibodies 
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thereto (Evan, et al. (1985) Molecular and Cellular Biology 5:3610-3616); and the Herpes 

Simplex virus glycoprotein D (gD) tag and its antibody (Paborsky, et al. (1 990) Protein 

Engineering 3(6):547-553). Other tag polypeptides include the Flag-peptide (Hopp, et al. 

(1988) BioTechnologv 6:1204-1210); the KT3 epitope peptide (Martin, et al. (1992) Science 

5 255:192-194); tubulin epitope peptide (Skinner, et al. (1991) J. Biol. Chem. 266:15163- 

15166); and the T7 gene 10 protein peptide tag (Lutz-Freyermuth, et al. (1990) Proc. NatT 

Acad. Sci. USA 87:6393-6397). 

Also included are other lung cancer proteins of the lung cancer family, and lung 

cancer proteins from other organisms, which are cloned and expressed as outlined below. 

10 Thus, probe or degenerate polymerase chain reaction (PCR) primer sequences may be used to 
find other related lung cancer proteins from primates or other organisms. As will be 
appreciated by those in the art, particularly useful probe and/or PCR primer sequences 
include unique areas of the lung cancer nucleic acid sequence. As is generally known in the 
art, preferred PCR primers are from about 15 to about 35 nucleotides in length, with from 

15 about 20 to about 30 being preferred, and may contain inosine as needed. PCR reaction 
conditions are well known in the art (e.g., Innis, PCR Protocols, supra). 

Antibodies to lung cancer proteins 

In a preferred embodiment, when a lung cancer protein is to be used to generate 
20 antibodies, e.g., for immunotherapy or immunodiagnosis, the lung cancer protein should 
share at least one epitope or determinant with the full length protein. By "epitope" or 
"determinant" herein is typically meant a portion of a protein which will generate and/or bind 
an antibody or T-cell receptor in the context of MHC. Thus, in most instances, antibodies 
made to a smaller lung cancer protein will be able to bind to the full-length protein, 
25 particularly linear epitopes. In a preferred embodiment, the epitope is unique; that is, 
antibodies generated to a unique epitope show little or no cross-reactivity. 

Methods of preparing polyclonal antibodies are well known (e.g., Coligan, supra; and 
Harlow and Lane, supra). Polyclonal antibodies can be raised in a mammal, e.g., by one or 
more injections of an immunizing agent and, if desired, an adjuvant. Typically, the 
30 immunizing agent and/or adjuvant will be injected in the mammal by multiple subcutaneous 
or intraperitoneal injections. The immunizing agent may include a protein encoded by a 
nucleic acid of Tables 1 A-16 or fragment thereof or a fusion protein thereof. It may be useful 
to conjugate the immunizing agent to a protein known to be immunogenic in the mammal 
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being immunized. Immunogenic proteins include, e.g., keyhole limpet hemocyanin, serum 

albumin, bovine thyroglobulin, and soybean trypsin inhibitor. Adjuvants include, e.g., 
Freund's complete adjuvant and MPL-TDM. adjuvant (monophosphoryl Lipid A, synthetic 
trehalose dicorynomycolate). The immunization protocol may be selected by one skilled in 
5 the art. 

The antibodies may, alternatively, be monoclonal antibodies. Monoclonal antibodies 
may be prepared using hybridoma methods, such as those described by Kohler and Milstein 
(1975) Nature 256:495. In a hybridoma method, a mouse, hamster, or other appropriate host 
animal, is typically immunized with an immunizing agent to elicit lymphocytes that produce 

10 or are capable of producing antibodies that will specifically bind to the immunizing agent. 
Alternatively, the lymphocytes may be immunized in vitro. The immunizing agent will 
typically include a polypeptide encoded by a nucleic acid of the tables, or fragment thereof, 
or a fusion protein thereof. Generally, either peripheral blood lymphocytes ("PBLs") are 
used if cells of human origin are desired, or spleen cells or lymph node cells are used if non- 

1 5 human mammalian sources are desired. The lymphocytes are then fused with an 

immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to form a 
hybridoma cell (Goding (1986) Monoclonal Antibodies: P rinciples and Practice, pp. 59-103 ). 
Immortalized cell lines are usually transformed mammalian cells, particularly myeloma cells 
of rodent, bovin, or primate origin. Usually, rat or mouse myeloma cell lines are employed. 

20 The hybridoma cells may be cultured in a suitable culture medium that preferably contains 
one or more substances that inhibit the growth or survival of the unfiised, immortalized cells. 
For example, if the parental cells lack the enzyme hypoxanthine guanine phosphoribosyl 
transferase (HGPRT or HPRT), the culture medium for the hybridomas typically will include 
hypoxanthine, aminopterin, and thymidine ("HAT medium"), which substances prevent the 

25 growth of HGPRT-deficient cells. > 

In one embodiment, the antibodies are bispecific antibodies. Bispecific antibodies are 
typically monoclonal, preferably human or humanized, antibodies that have binding 
specificities for at least two different antigens or that have binding specificities for two 
epitopes on the same antigen. In one embodiment, one of the binding specificities is for a 

30 protein encoded by a nucleic acid of the tables or a fragment thereof, the other one is for any 
other antigen, and preferably for a cell-surface protein or receptor or receptor subunit, 
preferably one that is tumor specific. Alternatively, tetramer-type technology may create 
multivalent reagents. 
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.. In a preferred embodiment, the antibodies to lung cancer protein are capable of 

reducing or eliminating a biological function of a lung cancer protein, in a naked fonn or 

conjugated to an effector moiety. That is, the addition of anti-lung cancer protein antibodies 

(either polyclonal or preferably monoclonal) to lung cancer tissue (or cells containing lung 

cancer) may reduce or eliminate the lung cancer. Generally, at least a 25% decrease in 

activity, growth, size or the like is preferred, with at least about 50% being particularly 

preferred and about a 95-100% decrease being especially preferred. 

. In a preferred embodiment the antibodies to the lung cancer proteins are humanized 

antibodies (e.g., Xenerex Biosciences, Medarex, Inc., Abgenix, Inc., Protein Design Labs, 

Inc.) Humanized forms of non-human (e.g., murine) antibodies are chimeric molecules of 

immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', 

F(ab')2 or other antigen-binding subsequences of antibodies) which contain minim al 

sequence derived from non-human immunoglobulin. Humanized antibodies include human 

immunoglobulins (recipient antibody) in which residues from a complementary determining 

region (CDR) of the recipient are replaced by residues from a CDR of a non-human species 

(donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and 

capacity. In some instances, Fv framework residues of a human immunoglobulin are 

replaced by corresponding non-human residues. Humanized antibodies may also comprise 

residues which are found neither in the recipient antibody nor in the imported CDR or 

framework sequences. In general, a humanized antibody will comprise substantially all of at 

least one, and typically two, variable domains, in which all or substantially all of the CDR 

regions correspond to those of a non-human immunoglobulin and all or substantially all of 

the framework (FR) regions are those of a human immunoglobulin consensus sequence. A 

humanized antibody optimally also will typically comprise at least a portion of an 

immunoglobulin constant region (Fc), typically that of a human immunoglobulin (Jones, et 

al. (1986) Nature 321 :522-525; Riechmann, et al. (1988) Nature 332:323-329; and Presta 

(1992) Curr. Op. Struct. Biol. 2:593-596). Humanization can be performed following the 

method of Winter and co-workers (Jones, et al. (1986) Nature 321 :522-525; Riechmann, et al. 

(1988) Nature 332:323-327; Verhoeyen, et al. (1988) Science 239:1534-1536), by 

substituting rodent CDRs or CDR sequences for corresponding sequences of a human 

antibody. Accordingly, such humanized antibodies are chimeric antibodies (U.S. Patent No. 

4,816,567), wherein substantially less than an intact human variable domain has been 

substituted by corresponding sequence from a non-human species. 
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Human-like antibodies can also be produced using various techniques known in the 

art, including phage display libraries (Hoogenboom and Winter (1991) J. Mol. Biol. 227:381 

Marks, et al. (1991) J. Mol. Biol. 222:581). The techniques of Cole, et al. and Boerner, et al. 

are also available for the preparation of human monoclonal antibodies (Cole, et al. (1985) 

5 Monoclonal Antibodies and Cancer Therapy, p. 77 and Boerner, et al. (1 991) J. Immunol. 

147(l):86-95). Similarly, human antibodies can be made by introducing human 

immunoglobulin loci into transgenic animals, e.g., mice in which the endogenous 

immunoglobulin genes have been partially or completely inactivated. Upon challenge, 

human antibody production is observed, which closely resembles that seen in humans in 

10 nearly all respects, including gene rearrangement, assembly, and antibody repertoire. This 
approach is described, e.g., inU.S; Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 
5,633,425; 5,661,016, and in the following scientific publications: Marks, et al. (1992) 
Bio/Technology 10:779-783; Lonberg, et al. (1994) Nature 368:856-859; Morrison (1994) 
Nature 368:812-13; Fishwild, et al. (1996) Nature Biotechnology 14:845-51; Neuberger 

15 (1996) Nature Biotechnology 14:826; and Lonberg and Huszar (1995) Intern. Rev. Immunol. 
13:65-93. 

By immunotherapy is meant treatment of lung cancer with an antibody raised against 
a lung cancer proteins. As used herein, immunotherapy can be passive or active. Passive 
immunotherapy as defined herein is the passive transfer of antibody to a recipient (patient). 

20 Active immunization is the induction of antibody and/or T-cell responses in a recipient 
(patient). Induction of an immune response is the result of providing the recipient with an 
antigen to which antibodies are raised. The antigen may be provided by injecting a 
polypeptide against which antibodies are desired to be raised into a recipient, or contacting 
the recipient with a nucleic acid capable of expressing the antigen and under conditions for 

25 expression of the antigen, leading to an immune response. 

In a preferred embodiment the lung cancer proteins against which antibodies are 
raised are secreted proteins as described above. Without being bound by theory, antibodies 
used for treatment, may bind and prevent the secreted protein from binding to its receptor, 
thereby inactivating the secreted lung cancer protein. 

30 In another preferred embodiment, the lung cancer protein to which antibodies are . 

raised is a transmembrane protein. Without being bound by theory, antibodies used for 
treatment may bind the extracellular domain of the lung cancer protein and prevent it from 
binding to other proteins, such as circulating ligands or cell-associated molecules. The 
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antibody may cause down-regulation of the transmembrane The 

antibody may be a competitive, non-competitive or uncompetitive inhibitor of protein binding 

to the extracellular domain of the lung cancer protein; The antibody may be an antagonist of 

the lung cancer protein or may prevent activation of a transmembrane lung cancer protein, or 

5 may induce or suppress a particular cellular pathway. In some embodiments, when the 

antibody prevents the binding of other molecules to the lung cancer protein, the antibody 

prevents growth of the cell. The antibody may also be used to target or sensitize the cell to 

cytotoxic agents, including, but not limited to TNF-a, TNF-P, IL-1, INF-y, and IL-2, or 

chemotherapeutic agents including 5FU, vinblastine, actinomycin D, cisplatin, methotrexate, 

10 and the like. In some instances the antibody may belong to a sub-type that activates serum 
complement when complexed with the transmembrane protein thereby mediating cytotoxicity 
or antigen-dependent cytotoxicity (ADCC). Thus, lung cancer may be treated by 
administering to a patient antibodies directed against the transmembrane lung cancer protein. 
Antibody-labeling may activate a co-toxin, localize a toxin payload, or otherwise provide 

15 means to locally ablate cells. 

In another preferred embodiment, the antibody is conjugated to an effector moiety. 
The effector moiety can be various molecules, including labeling moieties such as radioactive 
labels or fluorescent labels, or can be a therapeutic moiety. In one aspect the therapeutic 
moiety is a small molecule that modulates the activity of a lung cancer protein. In another 

20 aspect the therapeutic moiety may modulate an activity of molecules associated with or in 
close proximity to a lung cancer protein. The therapeutic moiety may inhibit enzymatic or 
signaling activity such as protease or collagenase activity associated with lung cancer. 

In a preferred embodiment, the therapeutic moiety can also be a cytotoxic agent. In 
this method, targeting the cytotoxic agent to lung cancer tissue or cells results in a reduction 

25 in the number of afflicted cells, thereby reducing symptoms associated with lung cancer. 

Cytotoxic agents are numerous and varied and include, but are not limited to, cytotoxic drugs 
or toxins or active fragments of such toxins. Suitable toxins and their corresponding 
fragments include diphtheria A chain, exotoxin A chain, ricin A chain, abrin A chain, curcin, 
crotin, phenomycin, enomycin, saporin, auristatin, and the like. Cytotoxic agents also include 

30 radiochemicals made by conjugating radioisotopes to antibodies raised against lung cancer 
proteins, or binding of a radionuclide to a chelating agent that has been covalently attached to. 
the antibody. Targeting the; therapeutic moiety to transmembrane lung cancer proteins not 
only serves to increase the local concentration of therapeutic moiety in the lung cancer 
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afflicted area, but also serves to reduce deleterious side effects that may be associated with 

the untargeted therapeutic moiety. 

In another preferred embodiment, the lung cancer protein against which the antibodies 
are raised is an intracellular protein. In this case, the antibody may be conjugated to a.protein- 
5 or other entity which facilitates entry into the cell. In one case, the antibody enters the cell by 
endocytosis. In another embodiment, a nucleic acid encoding the antibody is administered to 
the individual or cell. Moreover, wherein the lung cancer protein can be targeted within a 
cell, i.e:, the nucleus, an antibody theretomay contain a signal for that target localization, i.e., 
a nuclear localization signal. 
10 The lung cancer antibodies of the invention specifically bind to lung cancer proteins. 

By "specifically bind" herein is meant that the antibodies bind to the protein with a Kd of at 
least about 0.1 mM, more usually at least about 1 pM, preferably at least about 0.1 or 
better, and most preferably, 0.01 pM or better. Selectivity of binding to the specific target 
and not to related other sequences is also important. 

15 

Detection of lung cancer sequence for diagnostic and therapeutic applications 

In one aspect, the RNA expression levels of genes are determined for different 
cellular states in the lung cancer phenotype. Expression levels of genes in normal tissue (e.g., 
not undergoing lung cancer), in lung cancer tissue (and in some cases, for varying severities 

20 of lung cancer that relate to prognosis, as outlined below), or in non-malignant disease are 
evaluated to provide expression profiles. A gene expression profile of a particular cell state 
or point of development is essentially a "fingerprint" of the state of the cell. While two states 
may have a particular gene similarly expressed, the evaluation of a number of genes 
simultaneously allows the generation of a gene expression profile that is reflective of the state 

25 of the cell. By comparing expression profiles of cells in different states, information 

regarding which genes are important (including both up- and down-regulationof genes) in 
each of these states is obtained. Then, diagnosis may be performed or confirmed to 
determine whether a tissue sample has the gene expression profile of normal or cancerous 
tissue. This will provide for molecular diagnosis of related conditions. 

30 "Differential expression," or grammatical equivalents as used herein, refers to 

qualitative or quantitative differences in the temporal and/or cellular gene expression 
patterns within and among cells and tissue. Thus, a differentially expressed gene can 
qualitatively have its expression altered, including an activation or inactivation, in, e.g., 
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normal versus lung cancer tissue. Genes may be turned on or turned off in a particular state, 
relative to another state thus permitting comparison of two or more states. A qualitatively 
regulated gene will exhibit an expression pattern within a state or cell type which is 
detectable by standard techniques. Some genes will be expressed in one state or cell type, but 

5 not in both. Alternatively, the difference in expression may be quantitative, e.g., in that 
expression is increased or decreased; i.e., gene expression is either upregulated, resulting in 
an increased amount of transcript, or downregulated, resulting in a decreased amount of 
transcript. The degree to which expression differs need only be large enough to quantify via 
standard characterization techniques as outlined below, such as by use of Affymetrix 

1 0 GeneChip™ expression arrays, Lockhart (1996) Nature Biotechnology 14: 1675-1 680, hereby 
expressly incorporated by reference. Other techniques include, but are not limited to, 
quantitative reverse transcriptase PCR, northern analysis and RNase protection. As outlined 
above, preferably the change in expression (i.e., upregulation or downregulation) is typically 
at least about 50%, more preferably at least about 100%, more preferably at least about 

15 150%, more preferably at least about 200%, with from 300 to at least 1000% being especially 
preferred. 

Evaluation may be at me gene transcript or me protein level. The amount of gene 
expression may be monitored using nucleic acid probes to the RNA or DNA equivalent of the 
gene transcript, and the quantification of gene expression levels, or, alternatively, the final 

20 gene product itself (protein) can be monitored, e.g., with antibodies to the lung cancer protein 
and standard immunoassays (ELISAs, etc.) or other techniques, including mass spectroscopy 
assays, 2D gel electrophoresis assays, etc. Proteins corresponding to lung cancer genes, e.g., 
those identified as being important in a lung cancer or disease phenotype, can be evaluated in 
a lung cancer diagnostic test. In a preferred embodiment, gene expression monitoring is 

25 performed simultaneously on a number of genes. 

The lung cancer nucleic acid probes may be attached to biochips as outlined herein for 
the detection and quantification of lung cancer sequences in a particular cell. The assays are 
further described below in the example. PCR techniques can be used to provide greater 
sensitivity. Multiple protein expression monitoring can be performed as well. Similarly, 

30 these assays may be performed on an individual basis as well. 

In a preferred embodiment nucleic acid's encoding the lung cancer protein are 
detected. Although DNA or RNA encoding the lung cancer protein may be detected, of 
particular interest are methods wherein an mRNA encoding a lung cancer protein is detected. 
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Probes to detect mRNA can be a nucleotide/deoxynucleotide probe that is complementary to 

and hybridizes with the mRNA and includes, but is not limited to, oligonucleotides, cDNA or 

RNA. Probes also should contain a detectable label, as defined herein. In one method the 

mRNA is detected after immobilizing the nucleic acid, to be examined on a solid support such 

5 as nylon membranes and hybridizing the probe with the sample. Following washing to 

remove the non-specifically bound probe, the label is detected. In another method detection 

of the mRNA is performed in situ. In this method permeabilized cells or tissue samples are 

contacted with a detectably labeled nucleic acid probe for sufficient time to allow the probe 

to hybridize with the target mRNA. Following washing to remove the non-specifically bound 

1 0 probe, the label is detected. For example a digoxygenin labeled riboprobe (RNA probe) that 
is complementary to the mRNA encoding a lung cancer protein is detected by binding the 
digoxygenin with an anti-digoxygenin secondary antibody and developed with nitro blue 
tetrazolium and 5-bromo-4-chloro-3-indoyl phosphate. 

In a preferred embodiment, various proteins from the three classes of proteins as 

1 5 described herein (secreted, transmembrane or intracellular proteins) are used in diagnostic 
assays. The lung cancer proteins, antibodies, nucleic acids, modified proteins and cells 
containing lung cancer sequences are used in diagnostic assays. This can be performed on an 
individual gene or corresponding polypeptide level. In a preferred embodiment, the 
expression profiles are used, preferably in conjunction with high throughput screening 

20 techniques to allow monitoring for expression profile genes and/or corresponding 
polypeptides. 

As described and defined herein, 1 lung cancer proteins, including intracellular, 
transmembrane, or secreted proteins, find use as markers of lung cancer, e.g., for prognostic 
or diagnostic purposes. Detection of these proteins in putative lung cancer tissue allows for 

25 detection, prognosis, or diagnosis of lung cancer or similar disease, and perhaps for selection 
of therapeutic strategy. In one embodiment, antibodies are used to detect lung cancer 
proteins. A preferred method separates proteins from a sample by electrophoresis on a gel 
(typically a denaturing and reducing protein gel, but may be another type of gel, including 
isoelectric focusing gels and the like). Following separation of proteins, the lung.cancer 

30 protein is detected, e.g., by immunoblotting with antibodies raised against the lung cancer 
protein. Methods of immunoblotting are well known to those of ordinary skill in the art 

In another preferred method, antibodies to the lung cancer protein find use in in situ 
imaging techniques, e.g., in histology (e.g., Asai (edl 1993) Methods in Cell giology: 
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Antibodies in Cell Biology, volume 37. In this method cells are contacted .with from one to 

many antibodies to the lung cancer protein(s). Following washing to remove non-specific 

antibody binding, the presence of the antibody or antibodies is detected. In one embodiment 

the antibody is detected by incubating with a secondary antibody that contains a detectable 

label, e.g., multicolor fluorescence or confocal imaging. In another method the primary 

antibody to the lung cancer protein(s) contains a detectable label, e.g., an enzyme marker that 

can act on a substrate. In another preferred embodiment each one of multiple primary 

antibodies contains a distinct and detectable label. This method finds particular use in . 

simultaneous screening for a plurality of lung cancer proteins. Many other histological 

imaging techniques are also provided by the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the ability 

to detect and distinguish emissions of different wavelengths. In addition, a fluorescence 

activated cell sorter (FACS) can be used in the method. 

In another preferred embodiment, antibodies find use in diagnosing lung cancer from 

blood, serum, plasma, stool, and other samples. Such samples, therefore, are useful as 

samples to be probed or tested for the presence of lung cancer proteins. Antibodies can be 

used to detect a lung cancer protein by previously described immunoassay techniques 

including ELISA, immunoblotting (western blotting), immunoprecipitation, BIACORE 

technology and the like. Conversely, the presence of antibodies may indicate an immune 

response against an endogenous lung cancer protein or vaccine. 

In a preferred embodiment, in situ hybridization of labeled lung cancer nucleic acid 

probes to tissue arrays is done. For example, arrays of tissue samples, including lung cancer 

tissue and/or normal tissue, are made. In situ hybridization (see, e.g., Ausubel, supra) is then 

performed. When comparing the fingerprints between an individual and a standard, the 

skilled artisan can make a diagnosis, a prognosis, or a prediction based on the findings. It is 

further understood that the genes which indicate the diagnosis may differ from those which 

indicate the prognosis and molecular profiling of the condition of the cells may lead to 

distinctions between responsive or refractory conditions or may be predictive of outcomes. 

In a preferred embodiment, the lung cancer proteins, antibodies, nucleic acids, 

modified proteins and cells containing lung cancer sequences are used in prognosis assays. 

As above, gene expression profiles can be generated that correlate to lung cancer, clinical, 

pathological, or other information, in terms of long term prognosis. Again, this may be done 

on either a protein or gene level, with the use of genes being preferred. Single or multiple 
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genes may be useful in various combinations. As above, lung cancer probes may be attached 

to biochips for the detection and quantification of lung cancer sequences in a tissue or patient. 

The assays proceed as outlined above for diagnosis. PGR method may provide more ■ 

sensitive and accurate quantification. 

5 

Assays for therapeutic compounds 

In a preferred embodiment, the proteins, nucleic acids, and antibodies as described 
herein are used in drug screening assays. The lung cancer proteins, antibodies, nucleic acids, 
modified proteins and cells containing lung cancer sequences are used in drug screening 

10 assays or by evaluating the effect of drug candidates on a "gene expression profile" or 
expression profile of polypeptides. In a preferred embodiment, the expression profiles are 
used, preferably in conjunction with high throughput screening techniques to allow 
monitoring for expression profile genes after treatment with a candidate agent (e.g., 
Zlokarnik, et aL (1998) Science 279:84-8; Heid (1996) Genome Res. 6:986-94. 

15 In a preferred embodiment, the lung cancer proteins, antibodies, nucleic acids, 

modified proteins and cells containing the native or modified lung cancer proteins are used in 
screening assays. That is, the present invention provides novel methods for screening for 
compositions which modulate the lung cancer phenotype or an identified physiological 
function of a lung cancer protein. As above, this can be done on an individual gene level or 

20 by evaluating the effect of drug candidates on a "gene expression profile'*. In a preferred 

embodiment, the expression profiles are used, preferably in conjunction with high throughput 
screening techniques to allow monitoring for expression profile genes after treatment with a 
candidate agent, see Zlokarnik, supra. 

Having identified differentially expressed genes herein, a variety of assays may be. 

25 performed. In a preferred embodiment, assays may be run on an individual gene or protein 
level. That is, having identified a particular gene with altered regulation in lung cancer, test 
compounds can be screened for the ability to modulate gene expression or for binding to the 
lung cancer protein. "Modulation" thus includes an increase or a decrease in gene 
expression. The preferred amount of modulation will depend on the original change of the . 

30 gene expression in normal, versus tissue undergoing lung cancer, with changes of at least 
10%, preferably 50%, more preferably 100-300%, and in some embodiments 300-1000% or 
greater. Thus, if a gene exhibits a 4-fold increase in lung cancer tissue compared to normal 
tissue, a decrease of about four-fold is often desired; similarly, a 10-fold decrease in lung 
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cancer tissue compared to normal tissue often provides a target value of a 10-fold increase in 

expression to be induced by the test compound. \ • 

The amount of gene expression may be monitored using nucleic acid probes and the 
quantification of gene expression levels, or, alternatively, the gene product itself can be 
5 monitored, e.g., through the use of antibodies to the lung cancer protein and standard 
immunoassays. Proteomics and separation techniques may also allow quantification of 
expression. 

In a preferred embodiment, gene or protein expression monitoring of a number of 
entities, i.e., an expression profile, is monitored simultaneously. Such profiles will typically 
10 involve a plurality of those entities described herein. 

In this embodiment, the lung cancer nucleic acid probes are attached to biochips as 
outlined herein for the detection and quantification of lung cancer sequences in a particular 
cell. Alternatively, PCR may be used. Thus, a series, e.g., of microtiter plate, may be used 
with dispensed primers in desired wells. A PCR reaction can then be performed and analyzed 
15 for each well. 

Expression monitoring can be performed to identify compounds that modify the 
expression of one or more lung cancer-associated sequences, e.g., a polynucleotide sequence 
set out in the tables. Generally, in a preferred embodiment, a test compound is added to the 
cells prior to analysis. Moreover, screens are also provided to identify agents that modulate 

20 lung cancer, modulate lung cancer proteins, bind to a lung cancer protein, or interfere with 
the binding of a lung cancer protein and an antibody, substrate, or other binding partner. 

The term "test compound" or "drug candidate" or "modulator" or grammatical 
equivalents as used herein describes a molecule, e.g., protein, oligopeptide, small organic 
molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or. 

25 indirectly alter the lung cancer phenotype or the expression of a lung cancer sequence, e.g., a 
nucleic acid or protein sequence. In preferred embodiments, modulators alter expression 
profiles of nucleic acids or proteins provided herein. In one embodiment, the modulator 
suppresses a lung cancer phenotype, e.g., to a normal or non-malignant tissue fingerprint. In 
another embodiment, a modulator induces a lung cancer phenotype. Generally, a plurality of 

30 assay mixtures are run in parallel with different agent concentrations to obtain a differential 
' response to the various concentrations. Typically, one of these concentrations serves as a 
negative control, i.e., at zero concentration or below the level of detection. 
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In one aspect, a modulator will neutralize the effect of a lung cancer protein. By 
"neutralize" is meant that activity of a protein and the consequent effect on the cell is 
inhibited or blocked. 

In certain embodiments, combinatorial libraries of potential modulators will be 
5 screened for an ability to bind to a lung cancer polypeptide or to modulate activity. 

Conventionally, new chemical entities with useful properties are generated by identifying a 
chemical compound (called a "lead compound*') with some desirable property or activity, 
e.g., inhibiting activity, creating variants of the lead compound, and evaluating the property 
and activity of those variant compounds. Often, high throughput screening (HTS) methods 
1 0 are employed for such an analysis. 

In one preferred embodiment, high throughput screening methods involve providing a 
library containing a large number of potential therapeutic compounds (candidate 
compounds). Such "combinatorial chemical libraries" are then screened in one or more 
assays to identify those library members (particular chemical species or subclasses) that 
15 display a desired characteristic activity. The compounds thus identified can serve as 

conventional "lead compounds" or can themselves be used as potential or actual therapeutics. 

A combinatorial chemical library is a collection of diverse chemical compounds 
generated by either chemical synthesis or biological synthesis by combining a number of 
chemical "building blocks" such as reagents. For example, a linear combinatorial chemical 
10 library, such as a polypeptide (e.g., mutein) library, is formed by combining a set of chemical 
building blocks called amino acids in every possible way for a given compound length (i.e., 
the number of amino acids in a polypeptide compound). Millions of chemical compounds 
can be synthesized through such combinatorial mixing of chemical building blocks (Gallop, 
et al. (1994) I Med. Chem. 37f9V1233-1251 \ 
15 Preparation and screening of combinatorial chemical libraries is well known to those 

of skill in the art. Such combinatorial chemical libraries include, but are not limited to, 
peptide libraries (see, e.g., U.S. Patent No. 5,010,175, Furka (1991) Pept. Prot. Res. 37:487- 
493, Houghton, et al. (1991) Nature, 354:84-88), peptoids (PCT Publication No WO 
91/19735), encoded peptides (PCT Publication WO 93/20242), random bio-oligomers (PCT 
10 Publication WO 92/00091), benzodiazepines (U.S. Pat. No. 5,288,514), diversomers such as 
hydantoins, benzodiazepines and dipeptides (Hobbs, et al. (1993) Proc. Nat. Acad. Sci. USA 
90:6909-6913), vinylogous polypeptides (Hagihara, et al. (1992) J. Amer. Chem. Soc. 
1 14:6568), nonpeptidal peptidomimetics with a Beta-D-Glucose scaffolding (Hirschmann, et 
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id. (1992) T Ainer. Chem. Soc. 114:9217-9218), analogous organic syntheses of small 

compound ubraries (Chen, et al. (1994) I Amer/Chem. Soc. 1 16:2661), oUgocarbamates 

(Cho, et al. (1993) Science 261:1303), and/or peptidyl phosphonates (Campbell, et al. (1994) 

J. Org. Chem. 59:658). See, generally, Gordon, et al. (1994) J. Med. Chem. 37:1385, nucleic 

5 acid Ubraries (see, e.g., Stratagene, Corp.), peptide nucleic acid libraries (see, e.g., U.S. 

Patent 5,539,083), antibody Ubraries (see, e.g., Vaughn, et al. (1 996) Nature Biotechnology 

14(3):309-314, and PCT/US96/10287), carbohydrate Ubraries (see, e.g., Liang, et al. (1996) 

Science 274:1520-1522, and U.S. Patent No. 5,593,853), and small organic molecule Ubraries 

(see, e.g., benzodiazepines, Baum (1993) C&EN, Jan 18, page 33; isoprenoids, U.S. Patent 

10 No. 5,569,588; thiazolidinones and metathiazanones, U.S. Patent No. 5,549,974; pyrrolidines, 
U.S. Patent Nos. 5,525,735 and 5,519,134; morphoUno compounds, U.S. Patent No. 
5,506,337; benzodiazepines, U.S. Patent No. 5,288,514; and the like). . 

Devices for the preparation of combinatorial Ubraries are commercially available (see, 
e.g., 357 MPS, 390 MPS, Advanced Chem Tech, LouisviUe KY, Symphony, Rainin, 

15 Woburn, MA, 433A Applied Biosystems, Foster City, CA, 9050 Plus, MiUipore, Bedford, 
MA). 

A number of weU known robotic systems have also been developed for solution phase 
chemistries. These systems include automated workstations like the automated synthesis 
apparatus developed by Takeda Chemical Industries, LTD. (Osaka, Japan) and many robotic 

20 systems utilizing robotic arms (Zymate JJ, Zymark Corporation, Hopkinton, Mass.; Orca, 
Hewlett-Packard, Palo Alto, Catif.), which mimic the manual synthetic operations performed 
by a chemist. The above devices, with appropriate modification, are suitable for use with the 
present invention. In addition, numerous combinatorial libraries are themselves 
commercially available (see, e.g., ComGenex, Princeton, N.J., Asinex, Moscow, Ru, Tripos, 

25 Inc., St. Louis, MO, ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Exton, PA, Martek 
Biosciences, Columbia, MD, etc.). 

The assays to identify modulators are amenable to high throughput screening. 
Preferred assays thus detect modulation of lung cancer gene transcription, polypeptide 
expression, and polypeptide activity. 

30 High throughput assays for evaluating the presence, absence, quantification, or other 

properties of particular nucleic acids or protein products are weU known to those of skiU in 
the art. Similarly, binding assays and reporter gene assays are similarly weU known. Thus, 
e.g., U.S . Patent No. 5,559,410 discloses high throughput screening methods for proteins, 
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U.S. Patent No. 5,585,639 discloses high throughput screening methods for nucleic acid 

binding (i.e:, in arrays), while U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high. 

throughput methods of screening for ligand/antibody binding. 

. In addition, high throughput screening systems are commercially available (see, e.g., 

5 Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 

Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, etc.). These systems 

typically automate procedures, including sample and reagent pipetting, liquid dispensing, 

timed incubations, and final readings of the microplate in detector(s) appropriate for the 

assay. These configurable systems provide high throughput and rapid start up as well as a 

10 high degree of flexibility and customization. The manufacturers of such systems provide 
detailed protocols for various high throughput systems. Thus, e.g., Zymark Corp. provides 
technical bulletins describing screening systems for detecting the modulation of gene 
transcription, ligand binding, and the like. 

In one embodiment, modulators are proteins, often naturally occurring proteins or 

1 5 fragments of naturally occurring proteins. Thus, e.g., cellular extracts containing proteins, or 
random or directed digests of proteinaceous cellular extracts, may be used. In this way 
libraries of proteins may be made for screening in the methods of the invention. Particularly 
preferred in this embodiment are libraries of bacterial, fungal, viral, and mammalian proteins, 
with the latter being preferred, and human proteins being especially preferred. Particularly 

20 useful test compound will be directed to the class of proteins to which the target belongs, e.g., 
substrates for enzymes or ligands and receptors. 

In a preferred embodiment, modulators are peptides of from about 5 to about 30 
amino acids, with from about 5 to about 20 amino acids being preferred, and from about 7 to 
about 15 being particularly preferred. The peptides may be digests of naturally occurring 

25 proteins, random peptides, or biased" random peptides. By "randomized" or grammatical 
equivalents herein is meant that the nucleic acid or peptide consists of essentially random 
sequences of nucleotides and amino acids, respectively. Since these random peptides (or 
nucleic acids, discussed below) are often chemically synthesized, they may incorporate a 
nucleotide or amino acid at any position. The synthetic process can be designed to generate 

30 . randomized proteins or nucleic acids, to allow the formation of all or most of the possible 

combinations over the length of the sequence, thus forming a library of randomized candidate 
bioactive proteinaceous agents. 
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In one embodiment, the library is folly randomized, with no sequence preferences or. 

constants at any position. In a preferred embodiment, the library is biased. Thatis,some 

positions within the sequence are either held constant, or are selected from a limited number 

of possibilities. In a preferred embodiment, the nucleotides or amino acid residues .are 

5 randomized within a defined class, e.g., of hydrophobic amino acids, hydrophilic residues, 

stericaUy biased (either small or large) residues, towards the creation of nucleic acid binding 

domains, the creation of cysteines, for cross^linking, prolines for SH-3 domains, serines, , 

threonines, tyrosines or bistidines for phosphorylation sites, etc. 

' Modulators of lung cancer can also be nucleic acids, as defined above. 

10 As described above generally for proteins, nucleic acid modulating agents may be 

naturally occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids. 
Digests of procaryotic or eucaryotic genomes may be used as is outlined above for proteins. 

In a preferred embodiment, the candidate compounds are organic chemical moieties, a 
wide variety of which are available in the literature. 

15 After a candidate agent has been added and the cells allowed to incubate for some, 

period of time, the sample containing a target sequence is analyzed. If required, the target 
sequence is prepared using known techniques. For example, the sample may be treated to 
lyse the cells, using known lysis buffers, electroporation, etc., with purification and/or 
amplification such as PCR performed as appropriate. For example, an in vitro transcription 

20 with labels covalently attached to the nucleotides is performed. Generally, the nucleic acids 
are labeled with biotin-FITC or PE, or with cy3 or cy5 . 

In a preferred embodiment, the target sequence is labeled with, e.g., a fluorescent, a 
chemiluminescent, a chemical, or a radioactive signal, to provide a means of detecting the 
target sequence's specific binding to a probe. The label also can be an enzyme, such as, 

25 alkaline phosphatase or horseradish peroxidase, which when provided with an appropriate 
substrate produces a product that can be detected. Alternatively, the label can be a labeled 
compound or small molecule, such as an enzyme inhibitor, that binds but is not catalyzed or 
altered by the enzyme. The label also can be a moiety or compound, such as, an epitope tag 
or biotin which specifically binds to streptavidin. For the example of biotin, the streptavidin 

30 is labeled as described above, thereby, providing a detectable signal for the bound target 
sequence. Unbound labeled streptavidin is typically removed prior to analysis. 

Nucleic acid assays can be direct hybridization assays or can comprise "sandwich 
assays", which include the use of multiple probes, as is generally outlined in U.S. Patent Nos. 
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5,681;702, 5,597,909, 5,545,730, 5,594,117, 5,591,584, 5,571,670, 5,580,731, 5,571,670, 
5,591,584, 5,624,802, 5,635,352, 5,594,118, 5,359,100, 5,124,246 and 5,681,697, all of 
which are hereby incorporated by reference. In this embodiment, in general, the target nucleic 
acid is prepared as outlined above, and then added to the biochip comprising a plurality of 
5 nucleic acid probes, under conditions that allow the formation.of a hybridization complex. 

A variety of hybridization conditions may be used in the present invention, including 
high, moderate and low stringency conditions as outlined above. The assays are generally 
run under stringency conditions which allow formation of the label probe hybridization 
complex only in the presence of target. Stringency can be controlled by altering a step 
1 0 parameter that is a thermodynamic variable, including, but not limited to, temperature, 
formamide concentration, salt concentration, chaotropic salt concentration, pH, organic 

solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is generally 
outlined in U.S. Patent No. 5,681,697: Thus it may be desirable to perform certain steps at 

15 higher stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways. Components 
of the reaction may be added simultaneously, or sequentially, in different orders, with 
preferred embodiments outlined below. In addition, the reaction may include a variety of 
other reagents. These include salts, buffers, neutral proteins, e.g., albumin, detergents, etc. 

20 which may be used to facilitate optimal hybridization and detection, and/or reduce non- 
specific or background interactions. Reagents that otherwise improve the efficiency of the 
assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may also be 
used as appropriate, depending on the sample preparation methods and purity of the target. 
The assay data are analyzed to determine the expression levels, and changes in 

25 expression levels as between states, of individual genes, forming a gene expression profile. 

Screens are performed to identify modulators of the lung cancer phenotype. In one 
embodiment, screening is performed to identify modulators that can induce or suppress a 
particular expression profile, thus preferably generating the associated phenotype. In another 
embodiment, e.g., for diagnostic applications, having identified differentially expressed genes 

30 important in a particular state, screens can be performed to identify modulators that alter 

expression of individual genes. In an another embodiment, screening is performed to identify 
modulators that alter a biological function of the expression product of a differentially 
expressed gene. Again, having identified the importance of a gene in a particular state, 



61 



WO 02/086443 PCT/US02/1?476 
screens are performed to, identify agents that bind and/or modulate the biological activity of 

the gene product, or evaluate genetic polymorphisms. 

Genes can be screened for those that are induced in response to a candidate agent. 

After identifying a modulator based upon its ability to suppress a lung cancer expression 

5 pattern leading to a normal expression pattern, or to modulate a single lung cancer gene 

expression profile so as to mimic the expression of the gene from normal tissue, a screen as 

described above can be performed to identify genes that are specifically modulated in 

response to the agent. Comparing expression profiles between normal tissue and agent 

treated lung cancer tissue reveals genes that are not expressed in normal tissue or lung cancer 

10 tissue, but are expressed in agent treated tissue. These agent-specific sequences can be 
identified and used by methods described herein for lung cancer genes or proteins. In 
particular these sequences and the proteins they encode find use in marking or identifying 
agent treated cells. In addition, antibodies can be raised against the agent induced proteins 
and used to target novel therapeutics to the treated lung cancer tissue sample. 

15 Thus, in one embodiment, a test compound is administered to a population of lung 

cancer cells, that have an associated lung cancer expression profile. By "administration" or 
"contacting" herein is meant that the candidate agent is added to the cells in such a manner as 
to allow the agent to act upon the cell, whether by uptake and intracellular action, or by 
action at the cell surface. In some embodiments, nucleic acid encoding a proteinaceous 

20 candidate agent (i.e., a peptide) may be put into a viral construct such as an adenoviral or 
retroviral construct, and added to the cell, such that expression of the peptide agent is 
accomplished, e.g., PCT US97/01019. Regulatable gene therapy systems can also be used. 

Once a test compound has been administered to the cells, the cells can be washed if 
desired and are allowed to incubate under preferably physiological conditions for some 

25 period of time. The cells are then harvested and a new gene expression profile is generated, 
as outlined herein. 

Thus, e.g., lung cancer or non-malignant tissue may be screened for agents that 
modulate, e.g., induce or suppress a lung cancer phenotype. A change in at least one gene, 
preferably many, of the expression profile indicates that the agent has an effect on lung 
30 cancer activity. By defining such a signature for the lung cancer phenotype, screens for new 
drugs that alter the phenotype can be devised. With this approach, the drug target need not be 
known and need not be represented in the original expression screening platform, nor does 
the level of transcript for the target protein need to change. 
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Measure of lung cancer polypeptide activity, or of lung cancer or the lung cancer 
phenbtype can be performed using a variety, of assays. For example, the effects of the test 
compounds upon the function of the metastatic polypeptides can be measured by examining 
parameters described above. A suitable physiological change that affects activity can be used 

5 to assess the influence of a test compound on the polypeptides of this invention. When the 
functional consequences are determined using intact cells or animals, one can also, measure a 
variety of effects such as, in the case of lung cancer associated with tumors, tumor growth, 
tumor metastasis, neovascularization, hormone release, transcriptional changes to both known 
and uncharacterized genetic markers (e.g., northern blots), changes in cell metabolism such as 

10 cell growth or pH changes, and changes in intracellular second messengers such as cGMP. In 
the assays of the invention, mammalian lung cancer polypeptide is typically used, e.g., 
mouse, preferably human. 

Assays to identify compounds with modulating activity can be performed in vitro. 
For example, a lung cancer polypeptide is first contacted with a potential modulator and 

15 incubated for a suitable amount of time, e.g., from 0.5 to 48 hours. In one embodiment, the 
lung cancer polypeptide levels are determined in vitro by measuring the level of protein or 
mRNA The level of protein is typically measured using immunoassays such as western 
blotting, ELISA and the like with an antibody that selectively binds to the lung cancer 
polypeptide or a fragment thereof. For measurement of mRNA, amplification, e.g., using 

20 PCR, LCR, or hybridization assays, e.g., northern hybridization, RNAse protection, dot 
blotting, are preferred. The level of protein or mRNA is typically detected using directly or 
indirectly labeled detection agents, e.g., fluorescently or radioactively labeled nucleic acids, 
radioactively or enzymatically labeled antibodies, and the like, as described herein. 

Alternatively, a reporter gene system can be devised using a lung cancer protein 

25 promoter operably linked to a reporter gene such as luciferase, green fluorescent protein, 
CAT, or p-gal. The reporter construct is typically transfected into a cell. After treatment 
with a potential modulator, the amount of reporter gene transcription, translation, or activity 
is measured according to standard techniques known to those of skill in the art. 

In a preferred embodiment, as outlined above, screens may be done on individual 

30 genes and gene products (proteins). That is, having identified a particular differentially 

expressed gene as important in a particular state, screening of modulators of the expression of 
the gene or the gene product itself can be done. The gene products of differentially expressed 
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genes are sometimes referred to herein as "lung cancer proteins." The lung cancer protein 

may be a fragment, or alternatively, be the full length protein to a fragment shown herein. 

In one embodiment, screening for modulators of expression of specific genes is 
performed. Typically, the expression of only one or a few genes are evaluated. In another 
5 embodiment, screens are designed to first find compounds that bind to differentially 
expressed proteins. These compounds are then evaluated for the ability to modulate 
differentially expressed activity. Moreover, once initial candidate compounds are identified, 
variants can be further screened to better evaluate structure activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or isolated 
10 gene product is used; that is, the gene products of one or more differentially expressed 

nucleic acids are made. For example, antibodies are generated to the protein gene products, 
and standard immunoassays are run to determine the amount of protein present. Alternatively, 
cells comprising the lung cancer proteins can be used in the assays. 

Thus, in a preferred embodiment, the methods comprise combining a lung cancer 
15 protein and a candidate compound, and determining the binding of the compound to the lung 
cancer protein. Preferred embodiments utilize the human lung cancer protein, although other 
mammalian proteins may also be used, e.g., for the development of animal models of human 
disease. In some embodiments, as outlined herein, variant or derivative lung cancer proteins 
may be used. . 

20 Generally, in a preferred embodiment of the methods herein, the lung cancer protein 

or the candidate agent is non-diffusably bound to an insoluble support, preferably having 
isolated sample receiving areas (e.g., a microtiter plate, an array, etc.). The insoluble 
supports may be made of a composition to which the compositions can be bound, is readily 
separated from soluble material, and is otherwise compatible with the overall method of 

25 screening. The surface of such supports may be solid or porous and of a convenient shape. 
Examples of suitable insoluble supports include microtiter plates, arrays, membranes and 
beads. These are typically made of glass, plastic (e.g., polystyrene), polysaccharides, nylon 
or nitrocellulose, teflon™, etc. Microtiter plates and arrays are especially convenient because 
a large number of assays can be carried out simultaneously, using small amounts of reagents 

30 and samples. The particular manner of binding.of the composition is typically not crucial so 
long as it is compatible with the reagents and overall methods of the invention, maintains the 
activity of the composition, and is nondiffusable. Preferred methods of binding include the 
use of antibodies (which do not sterically block either the ligand binding site or activation 
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sequence when the protein is bound to the support), direct binding to "sticky" or ionic 

supports, chemical crosslinking, the synthesis of the protein or agent on the surface, etc. 
Following binding of the protein or agent, excess unbound material is removed by washing. 
The sample receiving areas may then be blocked through incubation with bovine serum 
5 albumin (BSA), casein or other innocuous protein or other moiety. 

In a preferred embodiment, the lung cancer protein is bound to the support, and a test 
compound is added to the assay. Alternatively, the candidate agent is bound to the support 
and the lung cancer protein is added. Novel binding agents include specific antibodies, non- 
natural binding agents identified in screens of chemical libraries, peptide analogs, etc. Of 

10 particular interest are screening assays for agents that have a low toxicity for human cells. A 
wide variety of assays may be used for this purpose, including labeled in vitro protein-protein 
binding assays, electrophoretic mobility shift assays, immunoassays for protein binding, 
functional assays (phosphorylation assays, etc.) and the like. 

The determination of the binding of the test modulating compound to the lung cancer 

15 protein may be done in a number of ways. In a preferred embodiment, the compound is 

labeled, and binding determined directly, e.g., by attaching all or a portion of the lung cancer 
protein to a solid support, adding a labeled candidate agent (e.g., a fluorescent label), washing 
off excess reagent, and determining whether the label is present on the solid support. Various 
blocking and washing steps may be utilized as appropriate. 

20 In some embodiments, only one of the components is labeled, e.g., the proteins (or 

proteinaceous candidate compounds) can be labeled. Alternatively, more than one 
component can be labeled with different labels, e.g., ,25 I for the proteins and a fluorophor for 
the compound. Proximity reagents, e.g., quenching or energy transfer reagents are also 
useful. 

25 In one embodiment, the binding of the test compound is determined by competitive 

binding assay. The competitor may be a binding moiety known to bind to the target molecule 
(i.e., a lung cancer protein), such as an antibody, peptide, binding partner, ligand, etc. Under 
certain circumstances, there may be competitive binding between the compound and the 
binding moiety, with the binding moiety displacing the compound. In one embodiment, the 

30 test compound is labeled. Either the compound, or the competitor, or both, is added first to 
the protein for a time sufficient to allow binding, if present. Incubations may be performed at 
a temperature which facilitates optimal activity, typically between 4 and 40° C. Incubation 
periods are typically optimized, e.g., to facilitate rapid high throughput screening. Typically 
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between 0. 1 and 1 hour will be sufficient. Excess reagent is generally removed or washed 

away. The second component is then added, and the presence or absence of the labeled 

component is followed, to indicate binding. 

m a preferred embodiment, the competitor is added first, followed by a test 

5 compound. Displacement of the competitor is an indication that the test compound is binding 
to the lung cancer protein and thus is capable of binding to, and potentially modulating, the 
activity of the lung cancer protein. In this embodiment, either component can be labeled. 
Thus, e.g., if the competitor is labeled, the presence of label in the wash solution indicates 
displacement by the agent. Alternatively, if the test compound is labeled, the presence of the . 

10 label on the support indicates displacement. 

In an alternative embodiment, the test compound is added first, with incubation and 
washing, followed by the competitor. The absence of binding by the competitor may indicate 
that the test compound is bound to the lung cancer protein with a higher affinity. Thus, if the 
test compound is labeled, the presence of the label on the support, coupled with a lack of 

1 5 competitor binding, may indicate that the test compound is capable of binding to the lung 
cancer protein. 

In a preferred embodiment, the methods comprise differential screening to identity 
agents that are capable of modulating the activity of the lung cancer proteins. In one 
embodiment, the methods comprise combining a lung cancer protein and a competitor in a 

20 first sample. A second sample comprises a test compound, a lung cancer protein, and a 

competitor. The binding of the competitor is determined for both samples, and a change, or 
difference in binding between the two samples indicates the presence of an agent capable of 
binding to the lung cancer protein and potentially modulating its activity. That is, if the 
binding of the competitor is different in the second sample relative to the first sample, the 

25 agent is capable of binding to the lung cancer protein. 

Alternatively, differential screening is used to identify drug candidates that bind to the 
native lung cancer protein, but cannot bind to modified lung cancer proteins. The structure of 
the lung cancer protein may be modeled, and used in rational drug design to synthesize agents 
that interact with that site. Drug candidates that affect the activity of a lung cancer protein 

30 are also identified by screening drugs for the ability to either enhance or reduce the activity of 
the protein. 

Positive controls and negative controls may be used in the assays. Preferably control 
and test samples are performed in at least triplicate to obtain statistically significant results. 
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Incubation of all samples is for a time sufficient for the binding of the agent to the protein- 

Following incubation, samples are washed free of non-specifically bound material and the 

amount of bound, generally labeled agent determined. For example, where a radiolabel is 

employed, the samples may be counted in a scintillation counter to determine the amount of 

5 bound compound. 

A variety of other reagents may be included in the screening assays. These include 
reagents like salts, neutral proteins, e.g., albumin, detergents, etc. which may be used to 
facilitate optimal protein-protein binding and/or reduce non-specific or background 
interactions. Also reagents that otherwise improve the efficiency of the assay, such as 
1 0 protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may be used. The mixture 
of components may be added in an order that provides for the requisite binding. 

In a preferred embodiment, the invention provides methods for screening for a 
compound capable of modulating the activity of a lung cancer protein. The methods 
comprise adding a test compound, as defined above, to a cell comprising lung cancer 
1 5 proteins. Preferred cell types include almost any cell. The cells contain a recombinant 
nucleic acid that encodes a lung cancer protein. In a preferred embodiment, a library of 
candidate agents are tested on a plurality of cells. 

In one aspect, the assays are evaluated in the presence or absence or previous or 
subsequent exposure of physiological signals, e.g., hormones, antibodies, peptides, antigens, 
20 cytokines, growth factors, action potentials, pharmacological agents including 

chemotherapeutics, radiation, carcinogenics, or other cells (e.g., cell-cell contacts). In another 
example, the determinations are determined at different stages of the cell cycle process. 

In this way, compounds that modulate lung cancer agents are identified. Compounds 
with pharmacological activity are able to enhance or interfere with the activity of the lung 
25 cancer protein. Once identified, similar structures are evaluated to identify critical structural 
feature of the compound. 

In one embodiment, a method of inhibiting lung cancer cell division is provided. The 
method comprises administration of a lung cancer inhibitor. In another embodiment, a 
method of inhibiting lung cancer is provided. The method may comprise administration of a 
30 lung cancer inhibitor. In a further embodiment, methods of treating cells or individuals with 
lung cancer are provided, e.g., comprising administration of a lung cancer inhibitor. 

In one embodiment, a lung cancer inhibitor is an antibody as discussed above. In 
another embodiment, the lung cancer inhibitor is an antisense molecule. 
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A variety of cell growth, proliferation, viability, and metastasis assays are known to 

those of skill in the art, as described below. 

Soft agar growth or colony formation in suspension 
5 Normal cells require a solid substrate to attach and grow. When the cells are 

transformed, they lose this phenotype and grow detached from the substrate. For example,, 
transformed cells can grow in stirred suspension culture or suspended in semi-solid media, 
such as semi-solid or soft agar. The transformed cells, when transfected with tumor 
suppressor genes, regenerate normal phenotype and require a solid substrate to attach and 

10 grow. Soft agar growth or colony formation in suspension assays can be used to identify 
modulators of lung cancer sequences, which when expressed in host cells, inhibit abnormal 
cellular proliferation and transformation. A therapeutic compound would reduce or eliminate 
the host cells' ability to grow in stirred suspension culture or suspended in semi-solid media, 
such as semi-solid or soft. 

1 5 Techniques for soft agar growth or colony formation in suspension assays are 

described in Freshney (1994) Culture of Animal Cells a Manual of Basic Technique (3 rd ed.), 
herein incorporated by reference. See also, the methods section of Garkavtsev, et al. (1996), 
supra, herein incorporated by reference. 

20 Contact inhibition and density limitation of growth 

Normal cells typically grow in a flat and organized pattern in a petri dish until they 
touch other cells. When the cells touch one another, they are contact inhibited and stop 
growing. When cells are transformed, however, the cells are not contact inhibited and 
continue to grow to high densities in disorganized foci. Thus, the transformed cells grow to a 

25 higher saturation density than normal cells. This can be detected morphologically by the 
formation of a disoriented monolayer of cells or rounded cells in foci within the regular 
pattern of normal surrounding cells. Alternatively, labeling index with ( 3 H)-thymidine at 
saturation density can be used to measure density limitation of growth. See Freshney (1994), 
supra. The transformed cells, when transfected with tumor suppressor genes, regenerate a 

30 normal phenotype and become contact inhibited and would grow to a lower density. 

In this assay, labeling index with ( 3 H)-thymidine at saturation density is a preferred 
method of measuring density limitation of growth. Transformed host cells are transfected 
with a lung cancer-associated sequence and are grown for 24 hours at saturation density in 
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non-limiting medium conditions. The percentage of cells labeling with ( 3 H)-thymidine is 

determined autoradiographically. See, Freshney (1994), supra. 

Growth factor or serum dependence 
5 Transformed cells typically have a lower serum dependence than their normal 

counterparts (see, e.g., Temin (1966) J. Natl. Cancer Insti. 37:167-175; Eagle, et al. (1970) I 
Exp. Med. 13 1 :836-879); Freshney, supra. This is in part due to release of various growth 
factors by the transformed cells. Growth factor or serum dependence of transfoimed host 
cells can be compared with that of control. 

10 

Tumor specific markers levels 

Tumor cells release an increased amount of certain factors (hereinafter "tumor 
specific markers") than their normal counterparts. For example, plasminogen activator (PA) 
is released from human glioma at a higher level than from normal brain cells (see, e.g., 

1 5 Gullino, " Angiogenesis, tumor vascularization, and potential interference with tumor growth" 
in Mihich (ed. 1985) Biological Responses in Cancer, pp. 178-184). Similarly, Tumor 
angiogenesis factor (TAF) is released at a higher level in tumor cells than their normal 
counterparts. See, e.g., Folkman (1992) "Angiogenesis and Cancer" in Sem Cancer BioU . 
Various techniques which measure the release of these factors are described in 

20 Freshney (1994), supra. Also, see, Unkeless, et al. (1974) J. Biol. Chem. 249:4295-4305; 
Strickland and Beers (1976) J. Biol. Chem . 251:5694-5702; Whur, et al. (1980) Br. J. Cancer 
42:305-312; Gullino, "Angiogenesis, tumor vascularization, and potential interference with 
tumor growth" in Mihich (ed. 1985) Biological Responses in Cancer, pp. 178-184; Freshney 
Anticancer Res. 5:11 1-130 (1985). 

25 

Invasiveness into Matrigel 

The degree of invasiveness into Matrigel or some other extracellular matrix 
constituent can be used as an assay to identify compounds that modulate lung cancer- 
associated sequences. Tumor cells exhibit a good correlation between malignancy and 
30 invasiveness of cells into Matrigel or some other extracellular matrix constituent. In this 
assay, tumprigenic cells are typically used as host cells. Expression of a tumor suppressor 
gene in these host cells would decrease invasiveness of the host cells. 
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Techniques described in Freshney (1994), supra, can be used. Briefly, the level of 

invasion of host cells can be measured by using filters coated with Matrigel or some other • 

extracellular matrix constituent. Penetration into the gel, or through to the distal side of the 

filter, is rated as invasiveness, and rated histologically by number of cells and distance 

5 moved, or by prelabeling the cells with 125 I and counting the radioactivity on the distal side of 

the filter or bottom of the dish. See, e.g., Freshney (1984), supra. 

Tumor growth in vivo 

Effects of lung cancer-associated sequences on cell growth can be tested in transgenic 

1 0 or immune-suppressed mice. Knock-out transgenic mice can be made, in which the: lung 
cancer gene is disrupted or in which a lung cancer gene is inserted. Knock-out transgenic 
mice can be made by insertion of a marker gene or other heterologous gene into the 
endogenous lung cancer gene site in the mouse genome via homologous recombination. 
Such mice can also be made by substituting the endogenous lung cancer gene with a mutated 

1 5 version of the lung cancer gene, or by mutating the endogenous lung cancer gene, e.g., by 

exposure to carcinogens. 

A DNA construct is introduced into the nuclei of embryonic stem cells. Cells 

containing the newly engineered genetic lesion are injected into a host mouse embryo, which 
is re-implanted into a recipient female. Some of these embryos develop into chimeric mice 

20 that possess germ cells partially derived from the mutant cell line. Therefore, by breeding the 
chimeric mice it is possible to obtain a new line of mice containing the introduced genetic 
lesion (see, e.g., Capecchi, et al. (1989) Science 244:1288). Chimeric targeted mice can be < 
derived according to Hogan, et al. (1988) Manipulating the Mouse Embryo: A Laboratory 
Manual Cold Spring Haibor Laboratory and Robertson (ed. 1987) Teratocarcinomas and 

25 Embryonic Stem Cells: A Pr actical Approach. , IRL Press, Washington, D.C. 

Alternatively, various immune-suppressed or immune-deficient host animals can be 
used. For example, genetically athymic "nude" mouse (see, e.g., Giovanella, et al. (1974) I 
Natl. Cancer Inst. 52:921), a SCID mouse, a thymectomized mouse, or an irradiated mouse 
(see, e.g., Bradley, et al. (1978) Br. J. Cancer 38:263; Selby, et al. (1980) Br. J. Cancer 41 :52) 

30 can be used as a host. Transplantable tumor cells (typically about 10 6 cells) injected into 

isogenic hosts will produce invasive tumors in a high proportions of cases, while normal cells 
of similar origin will not. In hosts which developed invasive tumors, cells expressing a lung 
cancer-associated sequences are injected subcutaneously. After a suitable length of time, 
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preferably 4-8 weeks, tumor growth is measured (e.g., by volume or by its two largest . - 
dimensions) and compared to the control. Tumors that have statistically significant reduction 
(using, e.g., Student's T test) are said to have inhibited growth. 

5 Polynucleotide modulators of lung cancer 

Antisense and RNAi Polynucleotides 

In certain embodiments, the activity of a lung cancer-associated protein is 
downregulated, or entirely inhibited, by the use of antisense or an inhibitory polynucleotide, 
i.e., a nucleic acid complementary to, and which can preferably hybridize specifically to, a 

10 coding mRNA nucleic acid sequence, e.g., a lung cancer protein mRNA, or a subsequence 
thereof. Binding of the antisense polynucleotide to the mRNA reduces the translation and/or 
stability of the mRNA. 

In the context of this invention, antisense polynucleotides can comprise naturally- 
occurring nucleotides, or synthetic species formed from naturaUy-occurring subunits or their 

15 close homologs. Antisense polynucleotides may also have altered sugar moieties or inter- 
sugar linkages. Exemplary among these are the phosphorothioate and other sulfur containing 
species which are known for use in the art. Analogs are comprehended by this invention so 
long as they function effectively to hybridize with the lung cancer protein mRNA. See, e.g., 
Isis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., Natick, MA. 

20 Such antisense polynucleotides can readily be synthesized using recombinant means, 

or can be synthesized in vitro. Equipment for such synthesis is sold by several vendors, 
including Applied Biosystems. The preparation of other oligonucleotides such as 
phosphorothioates and alkylated derivatives is also well known to those of skill in the art. 
Antisense molecules as used herein include antisense or sense oligonucleotides. 

25 Sense oligonucleotides can, e -g., be employed to block transcription by binding to the anti- 
sense strand. The antisense and sense oligonucleotide comprise a single-stranded nucleic 
acid sequence (either RNA or DNA) capable of binding to target mRNA (sense) or DNA 
(antisense) sequences for lung cancer molecules. A preferred antisense molecule is for a lung 
cancer sequence in the tables, or for a ligand or activator thereof. Antisense or sense 

30 oligonucleotides, according to the present invention, comprise a fragment generally at least 
about 14 nucleotides, preferably from about 14 to 30 nucleotides. The ability to derive, an 
antisense or a sense oligonucleotide, based upon a cDNA sequence encoding a given protein 
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is described in, e.g., Stein and Cohen (1988) Cancer Res. 48:2659 and van der Krol, et al. 

(1988) BioTediniaues 6:958). 

: RNA interference is a mechanism to suppress gene expression in a sequence specific 
manner. See, e.g., Brumelkamp, et al. (2002) Sciencexpress (21March2002); Shaip (1999) 
5 Genes Dev. 13:139-141; and Cathew (2001) Cum Op. CeU Biol 13:244-248. In mammalian 
cells, short, e.g., 21 nt, double stranded small interfering RNAs (siRNA) have been shown to 
be effective at inducing an RNAi response. See, e.g., Elbashir, et al. (2001) Nature 41 1;494- 
498. The mechanism may be used to downregulate expression levels of identified genes, e.g., 
treatment of or validation of relevance to disease. 

10 

Ribozymes 

In addition to antisense polynucleotides, ribozymes can be used to target and inhibit 
transcription of lung cancer-associated nucleotide sequences. A ribozyme is an RNA 
molecule that catalytically cleaves other RNA molecules. Different kinds of ribozymes have 

1 5 been described, including group I ribozymes, hammerhead ribozymes, hairpin ribozymes, 
RNase P, and axhead ribozymes (see, e.g., Castanotto, et al. (1994) Adv. in Pharmacology 
25: 289-3 1 7 for a general review of the properties of different ribozymes). 

The general features of hairpin ribozymes are described, e.g., in Hampel, et al. (1990) 
Nucl. Acids Res. 18:299-304; European Patent Publication No. 0 360 257; U.S. Patent No. 

20 5,254,678. Methods of preparing are well known to those of skill in the art (see, e.g., WO 
94/26877; Ojwang, et al. (1993) Proc. Natl. Acad. Sci. USA 90:6340-6344; Yamada, et al. 
(1994) Human Gene Therapy 1:39-45; Leavitt, et al. (1995) Proc. Natl. Acad. Sci. USA 
92:699-703; Leavitt, et al. (19994) Human Gene Therapy 5:1 151-120; and Yamada, et al. 
(1994) Virology 205: 121-126). 

25 Polynucleotide modulators of lung cancer may be introduced into a cell containing the 

target nucleotide sequence by formation of a conjugate with a ligand binding molecule, as 
described in WO 91/04753. Suitable ligand binding molecules include, but are not limited to, 
cell surface receptors, growth factors, other cytokines, or other ligands that bind to cell 
surface receptors. Preferably, conjugation of the ligand binding molecule does not 

30 substantially interfere with the ability of the ligand binding molecule to bind to its 

corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide 
or its conjugated version into the cell. Alternatively, a polynucleotide modulator of lung 
cancer may be introduced into a cell containing the target nucleic acid sequence, e.g., by 
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fonnation of an polynucleotide-lipid complex, as described in WO 90/10448. It is 

understood that the use of antisense molecules or knock out and knock in models may also be 

used in screening assays as discussed above, in addition to methods of treatment. 

Thus, in one embodiment, metho ds of modulating lung cancer in cells or organisms 

5 are provided. In one embodiment, the methods comprise administering to a cell an anti-liing 

cancer antibody that reduces or eliminates the biological activity of an endogenous lung 

cancer protein. Alternatively, the methods comprise administering to a cell or organism a 

recombinant nucleic acid encoding a lung cancer protein. This may be accomplished in any 

number of ways. In a preferred embodiment, e.g., when the lung cancer sequence is down- 

1 0 regulated in lung cancer, such state may be reversed by increasing the amount of lung cancer 
gene product in the cell. This can be accomplished, e.g., by overexpressing the endogenous 
lung cancer gene or administering a gene encoding the lung cancer sequence, using known 
gene-therapy techniques. In a preferred embodiment/the gene therapy techniques include the 
incorporation of the exogenous gene using enhanced homologous recombination (EHR), e.g., 

15 as described in PCT/US93/03868, hereby incorporated by reference in its entirety. 

Alternatively, e.g., when the lung cancer sequence is up-regulated in lung cancer, the activity 
of the endogenous lung cancer gene is decreased, e.g., by the administration of a lung cancer 
antisense or RNAi nucleic acid. 

In one embodiment, the lung cancer proteins of the present invention may be used to 

20 generate polyclonal and monoclonal antibodies to lung cancer proteins. Similarly, the lung 
cancer proteins can be coupled, using standard technology, to affinity chromatography 
columns. These columns may then be used to purify lung cancer antibodies useful for 
production, diagnostic, or therapeutic purposes. In a preferred embodiment, the antibodies 
are generated to epitopes unique to a lung cancer protein; that is, the antibodies show little or 

25 no cross-reactivity to other proteins. The lung cancer antibodies may be coupled to standard 
affinity chromatography columns and used to purify lung cancer proteins. The antibodies 
may also be used as blocking polypeptides, as outlined above, since they will specifically 
bind to the lung cancer protein. 

30 Methods of identifying variant lung cancer-associated sequences 

Without being bound by theory, expression of various lung cancer sequences is 
correlated with lung cancer. Accordingly, disorders based on mutant or variant lung cancer 
genes may be determined. In one embodiment, the invention provides methods for 
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identifying cells containing variant lung cancer genes, e.g., determining all or part of the . 

sequence of at least one endogenous lung cancer genes in a cell. In a preferred embodiment, 

the invention provides methods of identifying the lung cancer genotype of an individual, e.g., 

determining all or part of the sequence of at least one lung cancer gene of the individual. - 

5 This is generally done in at least one tissue of the individual, and may include the evaluation 

of a number of tissues or different samples of the same tissue. The method may include 

comparing the sequence of the sequenced lung cancer gene to a known lung cancer gene, i.e., 

a wild-type gene. 

The sequence of all or part of the lung cancer gene can then be compared to the 
10 sequence of a known lung cancer gene to determine if any differences exist. This can be 
done using known homology programs, such as Bestfit, etc. In a preferred embodiment, the 
presence of a difference in the sequence between the lung cancer gene of the patient and the 
known lung cancer gene correlates with a disease state or a propensity for a disease state, as 
outlined herein. 

15 In a preferred embodiment, the lung cancer genes are used as probes to determine the 

number of copies of the lung cancer gene in the genome. 

In another preferred embodiment, the lung cancer genes are used as probes to 

determine the chromosomal localization of the lung cancer genes. Information such as 

chromosomal localization finds use in providing a diagnosis or prognosis in particular when 
20 chromosomal abnormalities such as translocations, and the like are identified in the lung 

cancer gene locus. 

Administration of pharmaceutical and vaccine compositions 

In one embodiment, a therapeutically effective dose of a lung cancer protein or 
25 modulator thereof, is administered to a patient. By "therapeutically effective dose" herein is 
meant a dose that produces effects for which it is administered. The exact dose will depend 
on the purpose of the treatment, and will be ascertainable by one skilled in the art using 
known techniques (e.g., Ansel, et al. (1992) Pharmaceutical Dosage Forms and Drug 
Delivery: Lieberman, Pharmaceutical Dosage Forms (vols. 1-3), Dekker, ISBN 0824770846, 
30 082476918X, 0824712692, 0824716981; Lloyd (1999) The Art. Science and Technology of 
Pharmaceutical Compounding : and Pickar (1999) Dosage Calculations^ . Adjustments for 
lung cancer degradation, systemic versus localized delivery, and rate of new protease 
synthesis, as well as the age, body weight, general health, sex, diet, time of administration, 
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drug interaction and the severity of the condition may be necessary, and will be ascertainable 

with routine experimentation by those skilled in the art. 

A "patient" for the purposes of the present invention includes both humans and other 

animals, particularly mammals. Thus the methods areapplicabie to both human therapy, and 

5 veterinary applications. In the preferred embodiment the patient is a mammal, preferably a 

primate, and in the most preferred embodiment the patient is human. 

The administration of the lung cancer proteins and modulators thereof of the present 

invention can be done in a variety of ways, including, but not limited to, orally, 

subcutaneously, intravenously, intranasally, transdermally, intraperitoneally, intramuscularly, 

1 0 intrapulmonary, vaginally, rectally, or intraocularly. In some instances, e.g., in the treatment 
of wounds and inflammation, the lung cancer proteins and modulators may be directly 
applied as a solution or spray. 

The pharmaceutical compositions of the present invention comprise a lung cancer 
protein in a form suitable for administration to a patient. In the preferred embodiment, the 

15 pharmaceutical compositions are in a water soluble form, such as being present as 

pharmaceutically acceptable salts, which is meant to include both acid and base addition 
salts. 'Tharmaceutically acceptable acid addition salt" refers to those salts that retain the 
biological effectiveness of the free bases and that are not biologically or otherwise 
undesirable, formed with inorganic acids such as hydrochloric acid, hydrobromic acid, 

20 sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as acetic acid, 
propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic 
acid, fumaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, 
methanesulfonic acid, ethanesulfonic acid, p-toluenesulfonic acid, salicylic acid and the like. 
'Tharmaceutically acceptable base addition salts" include those derived from inorganic bases 

25 such as sodium, potassium, lithium, ammonium, calcium, magnesium, iron, zinc, copper, 
manganese, aluminum salts and the like. Particularly preferred are the ammonium, 
potassium, sodium, calcium, and magnesium salts. Salts derived from pharmaceutically 
acceptable organic non-toxic bases include salts of primary, secondary, and tertiary amines, 
substituted amines including naturally occurring substituted amines, cyclic amines and basic 

30 ion exchange resins, such as isopropylamine, trimethylamine, diethylamine, triethylamine, 
tripropylamine, and ethanolamine. 

The pharmaceutical compositions may also include one or more of the following: 
carrier proteins such as serum albumin; buffers; fillers such as microcrystalline cellulose, 
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lactose, corn and other starches; binding agents; sweeteners and other flavoring agents; - 

coloring agents; and polyethylene glycol. 

The pharmaceutical compositions can be administered in a variety of unit dosage 

forms depending upon the method of administration. For example, unit dosage forms 

5 suitable for oral administration include, but are not limited to, powder, tablets, pills, capsules 

and lozenges. It is recognized that lung cancer protein modulators (e.g., antibodies, antisense 

constructs, ribozymes, small organic molecules, etc.) when administered orally, should be 

protected from digestion. This is typically accomplished either by complexing the 

molecule(s) with a composition to render it resistant to acidic and enzymatic hydrolysis, or by 

1 0 packaging the molecule(s) in an appropriately resistant carrier, such as a liposome or a 
protection barrier. Means of protecting agents from digestion are well known in the art. 

The compositions for administration will commonly comprise a lung cancer protein 
modulator dissolved in a pharmaceutical^ acceptable carrier, preferably an aqueous carrier. 
A variety of aqueous carriers can be used, e.g., buffered saline and the like. These solutions 

15 are sterile and generally free of undesirable matter. These compositions may be sterilized by 
conventional, well known sterilization techniques. The compositions may contain 
pharmaceutical^ acceptable auxiliary substances as required to approximate physiological 
conditions such as pH adjusting and buffering agents, toxicity adjusting agents and the like, 
e.g., sodium acetate, sodium chloride, potassium chloride, calcium chloride, sodium lactate 

20 and the like. The concentration of active agent in these formulations can vary widely, and 
will be selected primarily based on fluid volumes, viscosities, body weight and the like in 
accordance with the particular mode of administration selected and the patient's needs (e.g.. 
Remington's Pharmaceutical Science (15th ed., 1980) and Hardman, et al. (eds. 1996) 
Goodman and Gilman: The Pharmacologial Basis of Therapeutics). 

25 Thus, a typical pharmaceutical composition for intravenous administration would be 

about 0. 1 to 10 mg per patient per day. Dosages from 0.1 up to about 100 mg per patient per 
day may be used, particularly when the drug is administered to a secluded site and not into 
the blood stream, such as into a body cavity or into a lumen of an organ. Substantially higher 
dosages are possible in topical administration. Actual methods for preparing parenterally 

30 administrable compositions will be known or apparent to those skilled in the art, e.g., 

Remington's Pharmaceutical Science and Goodman and Gilman. The Pharmacologial Basis 
of Therapeutics, supra. 
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The compositions containing modulators of lung cancer proteins can be administered 

for therapeutic or prophylactic treatments. In therapeutic applications, compositions are 

administered to a patient suffering from a disease (e.g., a cancer) in an amount sufficient to 

cure or at least partially arrest the disease and its complications. An amount adequate to 

5 accomplish this is defined as a "therapeutically effective dose." Amounts effective for this 

use will depend upon the severity of the disease and the general state of the patient's health. 

Single or multiple administrations of the compositions may be administered depending on the 

dosage and frequency as required and tolerated by the patient. In any event, the composition 

should provide a sufficient quantity of the agents of this invention to effectively treat the 

10 patient. An amount of modulator that is capable of preventing or slowing the development of 
cancer in a mammal is referred to as a "prophylactically effective dose." The particular dose 
required for a prophylactic treatment will depend upon the medical condition and history of ; 
the mammal, the particular cancer being prevented, as well as other factors such as age, 
weight, gender, administration route, efficiency, etc. Such prophylactic treatments may be 

15 used, e.g.i in a mammal who has previously had cancer to prevent a recurrence of the cancer, 
or in a mammal who is suspected of having a significant likelihood of developing cancer 
based, at least in part, upon gene expression profiles. Vaccine strategies may be used, in 
either a DNA vaccine form, or protein vaccine. 

It will be appreciated that the present lung cancer protein-modulating compounds can 

20 be administered alone or in combination with additional lung cancer modulating compounds 
or with other therapeutic agent, e.g., other anti-cancer agents or treatments. 

In numerous embodiments, one or more nucleic acids, e.g., polynucleotides 
comprising nucleic acid sequences set forth in the tables, such as antisense or RNAi 
polynucleotides or ribozymes, will be introduced into cells, in vitro or in vivo. The present 

25 invention provides methods, reagents, vectors, and cells useful for expression of lung cancer- 
associated polypeptides and nucleic acids using in vitro (cell-free), ex vivo, or in vivo (cell or 
organism-based) recombinant expression systems. 

The particular procedure used to introduce the nucleic acids into a host cell for 
expression of a protein or nucleic acid is application specific. Many procedures for 

30 introducing foreign nucleotide sequences into host cells may be used. These include the use 
of calcium phosphate transfection,spheroplasts, electroporation, liposomes, microinjection, 
plasma vectors, viral vectors and other well known methods for introducing cloned genomic 
DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell (see, e.g., 



77 



WO 02/086443 PCT/US02/12476 
Berger and Kimmeh Guide to Molecular Cloning Techniques. M ethods in Enzvmologv 

volume 152 (Berger), Ausubel, et al. (eds. 1999) Current Protocols (supplemented through 

. 1999), and Sambrook, et al. (1989) Molecular Cloning - A Laboratory Manual (2nd ed., Vol. 

1-3). 

5 In a preferred embodiment, lung cancer proteins and modulators are administered as 

therapeutic agents, and can be formulated as outlined above. Similarly, lung cancer genes 
(including both the full-length sequence, partial sequences, or regulatory sequences of the 
lung cancer coding regions) can be administered in a gene therapy application. These lung 
cancer genes can include antisense or inhibitory applications, e.g., as inhibitory RNA or gene 

1 0 therapy (e.g., for incorporation into the genome) or as antisense compositions. 

Lung cancer polypeptides and polynucleotides can also be administered as vaccine 
compositions to stimulate HTL, CTL, and antibody responses.. Such vaccine compositions 
can include, e.g., lipidated peptides (see, e.g.,Vitiello, et al (1995) J. Clin. Invest. 95:341), 
peptide compositions encapsulated in poly(DL-lactide-co-glycolide) ("PLG") microspheres 

15 (see, e.g., Eldridge, et al. (1991) Molec. Immunol. 28:287-294; Alonso, et al. (1 994V Vaccine 
12:299-306; Jones, et al. (1995) Vaccine 13:675-681), peptide compositions contained in 
immune stimulating complexes (ISCOMS) (see, e.g., Takahashi, et al. (1990) Nature 
344:873-875; Hu, et al. (1998) Clin Exp Immunol 1 13:235-243), multiple antigen peptide 
systems (MAPs) (see, e.g., Tarn (1 988V Proc. Natl. Acad. Sci. U.S.A. 85:5409-5413; Tarn 

20 (1996) J. Immunol, Methods 196:17-32), peptides formulated as multivalent peptides; 

peptides for use in ballistic delivery systems, typically crystallized peptides, viral delivery 
vectors (Perkus, et al., p. 379 In: Kaufinann (ed. 1996) Concepts in v accine development: 
Chakrabarti, et al. (1986) Nature 320:535; Hu, et al. (1986) Nature 320:537; Kieny, et al. 
(1986) AIDS Bio/Technoloev 4:790; Top, et al. (1971) J. Infect. Pis. 124:148; Chanda, et al. 

25 (1990) Virology 175:535), particles of viral or synthetic origin (see, e.g., Kofler, et al. (1996) 
J. Immunol. Methods 192:25; Eldridge, et al. (1993) Sem. Hematol. 30:16; Falo, et al. (1995) 
Nature Med. 7:649), adjuvants (Warren, et al. (1986) Annu. Rev. Immunol. 4:369; Gupta, et 
aL (1993) Vaccine 1 1 :293), liposomes (Reddy, et al. (1992) J. Immunol. 148:1585; Rock 
(1996) Immunol. Today 17:131), or, naked or particle absorbed cDNA (Ulmer, et al. (1993) 

30 Science 259:1745; Robinson, et al. (1993) Vaccine 1 1 :957; Shiver, et al., p. 423 In: 

Kaufinann (ed. 1996) Concepts in vaccine development : Cease and Berzofsky (1994) Annu. 
Rev. Immunol. 12:923 and Eldridge, et al. (1993) Sem. Hematol. 30:16). Toxin-targeted 
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delivery technologies, also known as receptor mediated targeting, such as those of Avant 
Immunotherapeutics, Inc. (Needham, Massachusetts) may also be used. . , .. 

i Vaccine compositions often include adjuvants. Many adjuvants contain a substance 
designed to protect the antigen from rapid catabolism, such as aluminum hydroxide or 

5 mineral oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis or 
Mycobacterium tuberculosis derived proteins. Certain adjuvants are commercially, available 
as, e.g., Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, 
MT); Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline ; 
Beecham, Philadelphia, PA); aluminum salts such as aluminum hydroxide gel (alum) or 

1 0 aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of acylated 
tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; 
polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A 
Cytokines, such as GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be 
used as adjuvants. 

1 5 Vaccines can be administered as nucleic acid compositions wherein DNA or RNA 

encoding one or more of the polypeptides, or a fragment thereof, is administered to a patient. 
This approach is described, for instance, in Wolff, et. al. (1990) Science 247:1465 as well as 
U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,118; 5,736,524; 5,679,647; WO 
98/04720; and in more detail below. Examples of DNA-based delivery technologies include 

20 "naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) delivery, cationic lipid 
complexes, and particle-mediated ("gene gun") or pressure-mediated delivery (see, e.g., U.S. 
Patent No. 5,922,687). 

For therapeutic or prophylactic immunization purposes, the peptides of the invention 
can be expressed by viral or bacterial vectors. Examples of expression vectors include . 

25 attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of 
vaccinia virus, e.g., as a vector to express nucleotide sequences that encode lung cancer 
polypeptides or polypeptide fragments. Upon introduction into a host, the recombinant 
vaccinia virus expresses the immunogenic peptide, and thereby elicits an immune response. 
Vaccinia vectors and methods useful in immunization protocols are described in, e.g., U.S. . 

30 Patent No. 4,722,848. Another vector is BCG (Bacille Calmette Guerin). BCG vectors are 
described in Stover, et al. (1991) Nature 351:456-460. A wide variety of other vectors useful 
for therapeutic administration or immunization e.g., adeno and adeno-associated virus 
vectors, retroviral vectors, Salmonella typhi vectors, detoxified anthrax toxin vectors, and the 
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like, will be apparent to those skilled in the art from the description herein (see, e.g., Shata, et 
al. (2000) Mol Med Today 6:66-71; Shedlock, et al. (2000) J. Leukoc. Biol. 68:793-806; 
Hipp, et al. (2000) biViyo 14:571-85). 

Methods for the use of genes as DNA vaccines are well known, and include placing a 
5 lung cancer gene or portion of a lung cancer gene under the control of a regulatable promoter 
or a tissue-specific promoter for expression in a lung cancer patient. The lung cancer gene 
used for DNA vaccines can encode full-length lung cancer proteins, but more preferably 
encodes portions of the lung cancer proteins including peptides derived from the lung cancer 
protein. In one embodiment, a patient is immunized with a DNA vaccine comprising a 

10 plurality of nucleotide sequences derived from a lung cancer gene. For example, lung cancer- 
associated genes or sequence encoding subfragments of a lung cancer protein are introduced 
into expression vectors and tested for their immunogenicity in the context of Class I MHC 
and an ability to generate cytotoxic T cell responses. This procedure provides for production 
of cytotoxic T cell responses against cells which present antigen, including intracellular 

15 epitopes. 

In a preferred embodiment, DNA vaccines include a gene encoding an adjuvant 
molecule with the DNA vaccine. Such adjuvant molecules include cytokines that increase 
the immunogenic response to the lung cancer polypeptide encoded by the DNA vaccine. 
Additional or alternative adjuvants are available. 

20 In another preferred embodiment lung cancer genes find use m generating animal 

models of lung cancer. When the lung cancer gene identified is repressed or diminished in 
metastatic tissue, gene therapy technology, e.g., wherein antisense or inhibitory RNA directed 
to the lung cancer gene will also diminish or repress expression of the gene. Animal models 
of lung cancer find use in screening for modulators of a lung cancer-associated sequence or 

25 modulators of lung cancer. Similarly, transgenic animal technology including gene knockout 
technology, e.g., as a result of homologous recombination with an appropriate gene targeting 
vector, will result in the absence or increased expression of the lung cancer protein. When 
desired, tissue-specific expression or knockout of the lung cancer protein may be necessary. 
Ifcis also possible that the lung cancer protein is overexpressed in lung cancer. As 

30 such, transgenic animals can be generated that overexpress the lung cancer protein. 

Depending on the desired expression level, promoters of various strengths can be employed 
to express the transgene. Also, the number of copies of the integrated transgene can be 
determined and compared for a determination of the expression level of the transgene.. 
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Animals generated by such methods will find use as animal models of lung cancer and are 
additionally useful in screening for modulators to treat lung cancer. 

Kits for Use in Diagnostic and/or Prognostic Applications 

5 1 For use in diagnostic, research, and therapeutic applications suggested above, kits are 
also provided by the invention. In diagnostic and research applications such kits may include 
at least one of the following: assay reagents, buffers, lung cancer-specific nucleic acids or 
antibodies, hybridization probes and/or primers, antisense polynucleotides, ribozymes, RNAi, 
dominant negative lung cancer polypeptides or polynucleotides, small molecule inhibitors of 

10 lung cancer-associated sequences, etc. A therapeutic product may include sterile saline or 
another pharmaceutically acceptable emulsion and suspension base. 

In addition, the kits may include instructional materials containing instructions (e.g., 
protocols) for the practice of the methods of this invention. While the instructional materials 
typically comprise written or printed materials they are not limited to such. A medium 

15 capable of storing such instructions and communicating them to an end user is contemplated 
by this invention. Such media include, but are not limited to electronic storage media (e.g., 
magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the like. Such 
media may include addresses to internet sites that provide such instructional materials. 
The present invention also provides for kits for screening for modulators of lung 

20 cancer-associated sequences. Such kits can be prepared from readily available materials and 
reagents. For example, such kits can comprise one or more of the following materials: a lung 
cancer-associated polypeptide or polynucleotide, reaction tubes, and instructions for testing 
lung cancer-associated activity. Optionally, the kit contains biologically active lung cancer 
protein. A wide variety of kits and components can be prepared according to the present 

25 invention, depending upon the intended user of the kit and the particular needs of the user. 
Diagnosis would typically involve evaluation of a plurality of genes or products. The genes 
typically will be selected based on correlations with important parameters in disease which 
may be identified in historical or outcome data. 
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Example 1: Gene Chip Analysis 

Molecular profiles of various normal and cancerous tissues were determined and 
analyzed using gene chips. RNA was isolated and gene chip analysis was performed as 
described (Glynne, et al. (2000) Nature 403:672-676; Zhao, et al. (2000) GgnesDev, 14:! 
993). 
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Tables 1A and 1B were previously filed on April 18, 2001 in USSN KM284|770 (IBSOI-OOISOOUS) and on November 29, 2001 in 
(18501-001520US) 
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oDaonucteoMes were designed. Geneclusteis were compiled using sequences denved from Genbank ESTs and mRNAs. 
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Tables 2A-8C were previously filed on November 9, 2001 in USSN 60/339,245 (18501-004100US) 

Table 2A shows 504 genes down-regulated in lung tumors retaSve to normal hmg and chronically diseased lung. Chronically leased lung ^^^^ 
ntfwBnl lung diseases such as fibrosis, emphysema, end bronchitis. These genes were selected from 59580 probesets on ^ Jc^^etix . Gene 
expression data for each probeset obtained from this analysis was expressed as average- intensity (AT), a normalized vahie reflecting the relaQve level of mRNA expression. 

Ptey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UntgenelD: Unigene number 

Unlgene Title: Unigene gene CBe M , 

R1: 90th percentrTe of Al for normal lung samples divided by the 80th percenHe of Al for adenaarcinoma and squamous cell carcinoma lung tumor 

R2 Sanof Al for normal hmg samples divided by 90th percenHe of Al for adenocarcinoma and squamous cell carcinoma lung tumor samples. 

Rl median of Al for norma) lung samples minus the 1 5th percentile of Al lor all normal lung, chronically diseased lung and tumor samples divided by 

the 90th percenHe of Al for adenocarcinoma and squamous cell carcinoma lung tumor samples minus the 15th percentile of Al for all normal 

lung, chronically diseased lung and tumor samples. ■ 

R4: average of Al for normal lung samples divided by average Al for squamous cell carcinoma and adenocarcinoma lung tumors. 

R5: median of Al for normal lung samples divided fay Ow 90lh peicenlile of far adanocannnomas. - 

R5: median of Al for normal lung samples minus the 1 5th percentSe of Al for all normal lung, chronically diseased lung and tumor samples divided by the SDtfl 

percentile of Al for adenocarcinomas minus the 15th percentile of Al for all normal lung, chronically diseased lung and tumor samples. 
R7: average ofAl for normal lung samples divided by the 90th percentile of Al for so^an^ , AJJ ^u „.-o™. 

R8: median of Al for normal lung samples minus the 1 5lh percentile of Al for all normal rung, chronically diseased lung and tumor samples divided by the 90th 

percentile of Al for squamous cell carcinomas minus ttie 15th percentSe of Al for all normal lung, chronically diseased lung and tumor samples. 
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AJ131245 

AF017987 

AL134O30 

U41518 

BE149455 

NMJJQ1B72 

T58485 

AF035718 

L34657 

AW175787 

A137258B 

AA285135 

AI873257 



Hs.202949 

Hs.237868 

Hs.109843 

Hs.330780 

Hs.279772 

Hs.1 10903 

Hs.198726 

Hs.1154 

Hs.172847 

Hs.1 04105 

Hs.1369 

Hs.1 378 

Hs.274463 

Hs.267695 

Hs.153614 

Hs.153863 

Hs.15430 

Hs.155597 

Hs.214507 

Hs.12696 

Hs.20912 

Hs.22039 

Hs.202949 

HS.28B650 

Hs.268744 

Hs.22588 

Hs.246218 

Hs.22824 

Hs.23735 

Hs,1 84482 

Hs.23912 

Hs.2420 

Hs.23767 

Hs.25227 

Hs,4 

Hs.25274 

Hs.25956 

Hs.26270 

Hs.27263 

Hs.27373 

Hs.253395 

Hs.28564 

Hs.323092 

Hs.30089 

Hs.30509 

Hs.30514 

Hs.30941 

Hs.3107 

Hs.31595 

Hs.107767 

Hs.164577 

Hs.33010 

Hs.184245 

Hs.3731 

Hs.38489 

Hs.42721 

Hs.45033 

Hs.49476 

Hs.4980 

Hs.172510 

Hs.530 

Hs.53447 

Hs.61260 

Hs.55608 

Hs.300842 

Hs.64313 

Hs.65424 

Hs.65551 

Hs.66 

Hs.94896 

Hs.6786 



Hs.7239 

Hs.7306 

Hs.284180 

Hs.74602 

Hs.75415 

Hs.75572 

Hs.222566 

Hs.78061 

Hs.78146 

Hs.334841 

Hs.8022 

Hs.301914 

Hs.7994 



WAA1 102 protein 
intsffeuJdn 7 receptor 
ESTs 

Human cytochrome P450-UB (W1B3) mRNA; 

ESTs; HigWy similar to CGJ-38 protein [ 

daudin 5 (transmembrane protein deleted 

vasoactive intesana! papBri e receptor 1 

owJuctal glycoprotein 1; 120kD 

ESTs; Weakly similar to collagen alpha 1 

EST .900 

decay acceterafing factor for complement 

annexinA3 

defensin; alpha 1; myeloio^related seque 

UDP-GatrbetaGScNAc beta 1;3^a!actosy1tr 

retinitis pigmentosa GTPase regulator 

MAD {molhers against decapentaplegic; Dr 

DKFZP586G121 9 protein 

D component of compfemanl (adipsin) 

ESTs 

ESTs 6-00 
adenomatous polyposis coil Eke 
WAA0758 protein 

K3AA1 102 protein 2&00 
aquaporin4 41.20 
ESTs; Moderately similar to WAA0273 [H. 
ESTs 

ESTs; Weakly similar to zinc finger prot 
ESTs; WeaWy similar to p160 myb-binding 
potassium voltage-gated channel; subfami 
OKFZP5B6D0S24 protein 
ESTs 

superoxide cfismutase 3; exCraceflufar 
ESTs 
ESTs 

alcohol dehydrogenase 3 {class 0; gamma 

ESTs; Moderately similar to putative sev 

DKFZP564D206 protein 8.00 

ESTs 10.00 

butyrate response factor 2 (EGF-response 28.80 

Homo sapiens mRNA; cDNA OKFZp56401763 (f 

ESTs; Highly similar to semaphorin via [ 39.00 

ESTs 

ESTs; Weakly simBar to dual specificity 
ESTs 
ESTs 

ESTs 45.80 
calcium channel; voltage-dependent; beta 
C097 antigen 
ESTs 

ESTs; toderatety similar to CaM-KU inhi 
ESTs 

KIAA0533 protein 

K1AA0929 protein Msx2 Interacting nuclea 54.00 
ESTs 
ESTs 
ESTs 

lacrimal proline rich protein 

Homo sapiens clone TUAB Cri-du-chat regi 34.20 
UM binding domain 2 
ESTs 13.00 
collagen; type IV; alpha 3 (Goodpasture 
ESTs; Moderately similar to kinesln Bgh 
ESTs 

ESTs; WeaWy similar to cDNA EST yk484g1 
ESTs 
ESTs 

tetranectin (plasminogen-binding protein 
ESTs ■ 

intorfeukin 1 recepMke 1 
ESTs 
ESTs 
ESTs 

SEC24 (S. cerevisiae) related gene f amB 41 .20 

secreted frizzled-related protein 1 50.20 
protocadherin 2 (cadherin-like 2) 
aquaporin 1 (channel-forming integral pr 
Accession not listed to Gen bank 
carboxypeptidase B2 (plasma) 
ESTs 

banscripSon factor 21 
plateiet/endotheiial cell adhesion motec 
selenium binding protein 1 
TU3A protein 

Homo sapiens mRNA; cDMA DKFZp586K1 220 (f 
ESTs; Weakfy simflar to CGJ-69 protein ( 



PCT/US02/12476 

4.09 



2.29 



293 



272 
254 



16a 80 

87.80 
44.60 

4220 

51.60 



10.00 



130 



3.40 



4.05 
4.09 



3.16 
4.77 
3.54 



31.40 
29.60 



9.00 



298 
115 



3220 



3.14 
180 



6.40 



411 
3.50 

186 
167 

144 



4.03 



11.00 
10.00 



247 
228 

287 



268 
4.66 



266 



237 
227 
263 

6.20 



105 



40.60 



49.20 
34.80 



30.60 



178 
167 



3.48 



11.43 



31.40 



9.00 



265 
292 
4.49 



90.80 



40.80 



4.02 
118 



169 



172 



3.05 



5.49 



135 



145 
4.05 



127 



100 



WO 02/086443 

134641 AI092634 Hs.155114 

134677 AA251363 Hs. 1777 11 

134745 NM.000685 Hs.89472 

134749 T28499 Hs.89485 

134786 T29618 

134825 U33749 

134978 AT82900B 

135010 N50465 

135053 AW798190 Hs.93678 

135081 AF069517 Hs.173993 

135091 AA493650 

135135 M775910 

135203 C15737 

135236 AI636208 

135266 R41179 

135346 NM.000928 Hs.992 

135378 AW961818 Hs.24379 

1353B7 NM.001972 Hs.99853 

135388 W27965 Hs.99865 

135402 L12398 Hs.99922 



Hs.89540 
Hs.197764 
Hs.333383 
Hs.92927 



Hs.94367 

Hs.95011 

Hs.269386 

Hs.96901 

Hs.97393 



protein tyrosine phosphatase; non-recept 
ESTs 

angiotensin receptor 1B 

carbonic anhydrase IV 

angiopoisOn 1 receptor; TEK tyrosine Id 

thyroid transcription factor 1 

ficoGn (coflagen/Sbrinogen domanwont 

ESTs 

ESTs 

RNA binding motif protein 6 
ESTs 

syntropHn; beta 1 (dystraphin-assocfeta 

ESTs 

ESTs 

Human mRNA for K1AA0328 gene; partial cd 
phosphoCpase A2; group IB (pancreas) 
potassium voltage-gated channel; shaker- 
etastase 2; neutrophil 
EST 

dopamine receptor 04 



15.00 



105 



2.52 



3120 



57.80 



31.60 



2B.80 



43.00 



37.20 
38.80 



8.00 



182 
4.15 



PCT/US02/12476 

176 



173 



121 



431 



4.21 



4.24. 



6.42 , 



TABLE 2B shows the accession numbers for those primekeys lacking unigendtys for Table 2A. For each probeset we have listed the gene duster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Toots (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the . 
"Accession* column. j 

Pkey. Unique Eos probeset identifier 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey 

108447 

108550 

108655 

102397 

126303 

125810 

103627 

121366 

114609 

115272 

10833B 

108434 

123802 

102310 

102636 1 

104776 

120504 

113502 

108499 

101308 

108629 

103098 

103241 

103508 

103575 

119514 

121082 

128634 

105817 

121516 

114449 

114648 

121950 

107723 



CAT number Accessions 

43452.-7 AA079126 

120073.1 AA084867AA084996 

12752^1 AA099960AA1 13013 

44371.-1 U41898 

1525933J D78841 D78BB0 

1554054.1 H0Q083 R81062 

2615_2 Z48513 248512 

280401 1 AI743515AA405517AW2767C5 

116777 1 AA079505AA079537 

172113 1 AWD15947 AA21 1890 AA279425 

112186 1 AA070773AA07Q774 

114012.1 AA078899AA078782 M075788 

genbarMA620448 AA620448 

NOT FOUND entre*JJ33839 U33839 

entrez_U67092 U67092 

genbanJu\A026349 AA026349 

genbanK_AA256837 AA256837 

ganbanK.T89130T89130 

genbankuAA083103 AA083103 

entre*_L41390 L41390 

genbanK_AA102425 AA102425 

221 215 M86361 Z26593 X02850 D13070 AE000659 M17649 M878S9 M87871 X61077 M16285 AF018169 X61079 S59351 X60142 AF043169 
entreOC76223 X76223 
entrez_Y10U1 Y10141 
entrez.226256 Z26256 
NOT.FOUND_entrer.W37937 W37937 
genbank_M398722 AA398722 
AA464918 at AA464918 



genbanlLAA397825 
genbank J AA412155 
genbank L ^A020736 
genbanK_AA101056 
genbank.AA4295l5 
genbankJ^A015957 



AA397825 
M412155 
AA020736 
AA101056 
M429515 
AA015967 



101 
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Table 3A shows 452 genes up-regulated in chronically diseased tung relative to normal lung. Cfacnicaffy cflseased lung samples represent chronic norwriaB^t (ung tffee^es 
such as fibrosis, emphysema, and branchife. These genes were selected from 59680 probesets on the Ecs/Anymetrix Hu03 Genechip array, Gene expression data for each 
probeset obtained from (his analysis was expressed as average intensity (AI), a ncrmalized value reflecting the relative level of mRNA expression. 



Pkey: 

ExAccn: 

UnigenelD: 

Unigene Title: 

R1: 

R2: 

R3: 



, a Bos probeset idenfifier number 
Exemplar Accession number, Genbank accession number 
Unigene number 
Unigene gene title 

80th percentile of AJ for chronically diseased lung samples divided by the 90th percenBe of AJ for norma! lung samples. 
80th percentileof AJ for chronically diseased lung sanples divided by the 9M 

TOth f SSS AI for ethically diseased hmg samples minus the 15th percenb-le of AI for all normal lung! chronically diseased lung and tumor samples 
divided by the 90!h percentile of normal lung samples, squamous cell carcinomas and adenocarcinomas minus the 15th percentile of AI for all normal lung, 
chronically diseased lung and tumor samples 



Pkey 


ExAccn 


unigeneiu 


135423 


U50531 


rts.1 Jo/01 


135378 


AW951B18 


HS-z4o/y 


135346 


Mil nnnmo 


Ue> OQO 


135235 


AW298244 




135057 


U90268 


Hs.93810 


134951 


BE305O81 


Hs.ioyjoo 


134799 


M3S821 


11 _ on con 


13478S 


T29618 


Hs.89b40 


134772 


NM_0Q0829 


Hs. 153597 


134752 


BE246762 


Hs.89499 


134749 


T28499 


Hs.89485 


134696 


BE326276 


Hs.6361 


134636 


NMJJ05582 


Hs.87205 


134627 


A1018758 


Hs. 12482 


134622 


A t A /m n A eft 

AW975159 




134570 


U66615 


iu 47000/) 


134561 


U76421 


HS.OOOUi 


134468 


NM.001772 


Hs.o3r31 


134417 


Lift J rm£A4£: 

NM_006416 


U«. DOOOI 

Hs.tizyZl 


134343 


D50583 




134323 


BE 170651 


Hs.870Q 


134300 


kit i Milan 

NM_001430 


Hs.8136 


134299 


AW580939 


Hs.sj/ iyy 


134253 


X52075 


HS.tSUrJo 


134182 


D52059 


Uf> 7070 


133985 


134657 


nS.rol*KJ 


133978 


ArOJorio 


ns.r ouoi 


133335 


AIC77QQ7 


He 7fifidfl 
MS./ DOW 


133651 


AI3UU4U 


He 170JJfl1 


133633 




He 7<v-lT7 
nS.f ooor 


133565 


AwyoD//D 


He 11 T^nn 




AWQZfiORA 


Ue 178112 




AAOOCOQC 


Hs.74120 


49QA7Q 
1004f B 


AOO/Ui} 


Hs.31432 






He 793676 


IJOlUU 


A on? 77 15 


Hs 183639 


1 JO 10 J 




Hs 66170 


10010.0 


AH9RRflfi 

AI HODUO 


Hs.6557 




NM nM77fl 


Hs.65424 






Hs.1 69449 


132336 


AB023177 


Hs.29900 


132799 


W73311 


Hs.169407 


132742 


AA025480 


Hs.292812 


132548 


X12830 


Hs.193400 


132476 


AL1 19844 


Hs.49476 


132439 


AK001942 


Hs.4863 


132240 


AB018324 


Hs.42676 


132210 


NM.007203 


Hs.42322 


132199 


AL041299 


Hs.165084 


131751 


T96555 


Hs.31562 


131745 


A1828559 


Hs^1447 


131694 


NM_000246 


Hs.3076 


131686 


N.VL012295 


Hs.30687 


131676 


AI126821 


Hs.30514 


131629 


Z45794 


Ks.238809 


131589 


C18825 


Hs^9191 


131536 


AA019201 


Hs.269210 


131517 


AB037789 


Hs.263395 


131355 


R52804 


Hs.25956 


131253 


R71602 


Hs^4B53 


131207 


AF104266 


Hs.24212 


131155 


A1472209 


Hs.323117 


131066 


AW169287 


Hs.22588 


131061 


N64328 


Hs^68744 


131053 


AA348541 


Hs.296261 


130895 


AA641767 


Hs.21015 


130762 


D84371 


Hs.1898 



Human BRCA2 region, mRNA sequence CG030 
MUM2 prolan 

phosphoKpase A2, group IB (pancreas) 
ESTs 

cerebral cavernous malfarm aliens 1 
hypothetical protein 
GR03 oncogene 

TEK tyrosine kinase, endothelial {venous 
glutamate receptor, lonotrophic, AMPA 4 
arachid onate 5-Jipoxygenase 
carbonic anhydrase IV 
ESTs 

lymphocyte anBgen 64 (mouse) homoJog, r 
pjyceronephosphate Oacy I transferase 
ESTs, Weakly similar to A55380 factogenl 
SWi/SNF related, matrix associated, acfi 
adenosine deaminase, RNA-specific, B1 (h 
C033 antigen (gp67) 
solute carrier family 35 (CMP-siaGc aci 
{rans forming growth factor, beta recepto 
deleted in fiver cancer 1 
endothelial PAS domain protein 1 
complement component C1 q receptor 
siatophorfn (gpL1 15, teukosi'afln, CD43) 
K1AA0871 protein 

platelet/endothelial cell adhesion motec 

transcription factor 21 

RGC32 protein 

aThydrop^riiTiidinase-rike 2 

nucleolar and coBed-body phosphprotein 

ESTs, Moderately similar to ALU7.HUMAN A 

DNA segment, single copy probe LNS-CAI/L 

adipose specific 2 

cardiac ankyrin repeat protein 

ESTs 

hypothetical protein FU 10210 

HSKM-B protein 

zinc finger protein 161 

tetranectin (ptesminogen-Wndlng protein 

protein kinase C, alpha 

KIAA0960 protein 

SAC2 (suppressor of acBn mutations 2, 
ESTs, Weakly similar to T33468 hypothetj 
interteukin 6 receptor 

Homo sapiens clone TUA8 Crf-du-chat regi 

hypothetical protein DKFZp566Al524 

K1AA0781 protein 

A kinase (PRKA) anchor protein 2 

ESTs 

ESTs 

ESTs, Moderately similar to A46010 X-H 
MHC class II transactrvator 
GRB2-associaied binding protein 2 
ESTs 
ESTs 

epithelial membrane protein 2 
ESTs 

sema domain, transmembrane domain (TM), 

DKFZP564D206 protein 

ESTs 

latropMin 

ESTs 

ESTs 

K1AA1796 protein 

guanine nudeofide binding protein (Gpr 
hyoothefxal protein DKFZp564L0864 slma 
;1 



R1 

1Z40 



12.40 
11.67 



25.80 

13.60 
1320 



20.60 
12.20 



15.20 



30.60 
22.60 

13.80 

41.60 
40.40 



2U0 
15.20 
27.80 

21.40 
15.00 



16.60 
12.00 



R2 



aoo 

8.20 



R3 
2.13 



6.20 



1.93 
2.07 



1.92 
1.92 

1.78 



9.60 



1.77 
2.08 
1.77 



7.20 
4.76 



4.00 
620 



a4o 

3.59 
4.48 



3.54 



1.88 
1.99 
1.76 



1.75 
1.84 



1.93 



102 



.5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85. 



WO 02/086443 

130657 AW337575 . Hs^01591 ESTs 

130555 ■ AIS31952 Hs. 17409 cysteimwich protein 1 (intestinal) 

130589 All 10226 Hs.1 6441 " DKFZP434H2W protein 

130562 D50402 Hs.1 82611 solute earner fam3y 11 (proton-coupled 

130555 R59743 Hs.1 16774 integrm, alpha 1 

130365 W56119 Hs.1 55103 eukaryotic translation toiflation factor 

130273 AW972422 Hs.1 53853 MAD (mothers against decapentapleglc, Dr 

130259 NM_000328 Hs.1 53514 reSnfos pigmentosa GTPase regulator 

130090 H9787B Hs.132390. zinc finger protein 36 (KOX 18) 

129958 R27496 Hs.1378 ." annexinA3 

129898 A1672731 Hs.13256 ESTs 

129875 AA181018 Hs.13055 hypothetical protein RJ13920 

129699 'AB007899 Hs.12017 homoiog of yeast ubiquifin-crotein tigas 

129626 F13272 HsJl 11334 ferritin, fight porypepiida 

129598 N30436 Hs.11556 Homo sapiens cDNAFU 125666s, clone NT 

129593 AI338247 Hs.98314 Homo sapiens mRNA; cDNA DKFZp586L0120 (f 

129565 X77777 Hs.198726 vasoactive intestinal peptide receptor 1 

129527 AA769221 Hs.270847 delta-tubuBn 

129402 W72082 . Hs.11112 . ESTs 

129385 AA172106 • Hs.110950 Rag C protein 

129315 NM_014553 Hs.174038 spondyloepiphysea! dysplasia, late 

129312 T97579 Hs.110334 ESTs, Weakly similar to 178885 serine/lh 



PCT/US02/12476 



129240 AA35125B 

129210 AL039940 

129122 AW958473 

129057 N90866. 

128946 Y13153 

128793 AF015525 

128789 AW3S8576 . 

128778 AA504776 

128766 AW160432 

128631 R44238' 

128624 BE154765 

128609 NM.003616 

128603 NM_004915 

128598 AA305407 

128458 H55864 

128061 AF150882 

127968 AA830201 

127959 A1302471 

127944 AI557081 . 

127925 AA805151 

127896 AI669586 

127859 AA761802 

127817 AA836641 

127742 AW293496 

127628 AI240102 

127609 X80031 

127582 AA908954 . 

127543 AK000787 

127535 AA568424 

127404 AJ379920 

127396 L31968 

127374 AA442797 

127346 AA203616 

127340 BE047653 

127307 AW962712 

127242 AW390395 

127167 AA625690 

127046 AA321948 

126928 AA480902 

126900 AF137386 

126852 AA399961 



Hs.237868 
Hs.202949 
Hs.301957 
Hs.276770 
Hs.107318 
Hs.302043 
Hs.139851 
Hs.186709 
Hs.296460 
Hs.155546 
Hs.102647 
Hs.102456 
Hs.10237 
Hs.102308 
Hs.56340 
Hs.186877 
Hs.124347 
Hs.124292 
Hs.262476 
Hs.3628 
Hs.222194 
Hs.291559 
Hs.1 63085 
Hs.180138 
Hs.322430 
Hs.530 
Hs.130844 
Hs.1 57392 
Hs.164450 
Hs.270224 
Hs.187991 
Hs312110 
Hs.44896 
Hs.1 19183 
Hs.126712 
H&181301 
Hs.1 90272 
HsJ293968 
Hs.137401 
Hs.12701 



interleukin 7 receptor 
K1AA1 102 protein 
nudix (nucleoside diphosphate linked moi 
COW52 antigen (CAMPATH-1 anfigen) 
kynurenlne 3-monooxygenase (kynurenine 3 
chemoWne (C-C motif) receptor-like 2 
caveofin 2 

ESTs, Weakly similar to 138022 hypothet 
cranio facia} development protein 1 
KIAA1080 protein; Golgi-associaterJ, gamm 
ESTs, Weakly similar to TRHY.HUMAM TRiCH 
survival of motor neuron protein interac 
ATP-Wnding cassette, sub-family G (WHIT 
potassium inwardly-rectifying channel, s 
ESTs 



plasmolipm 

gb2u68c0ij1 Soares Jestis.NHT Homo sap 



11.50 
21.20 
18.60 
22.63 

39.20 

15.20 
12.40 
20.63 



1Z20 
26.40 



16.00 
1i80 



sodium channel, voltage-gated, type XII, 


1770 


ESTs 


21.30 


Homo sapiens cDNA: FU23123 fJs, done L 


10.60 


S-adenosytmelhionine decarboxylase 1 


rra'togen-acuvaied protein kinase kinase 


13.40 


ESTs 




ESTs 


14.00 


ESTs 


14.00 


ESTs 


11.00 


NDRGfamfly, member 4 


11.10 


collagen, type IV, alpha 3 (Goodpasture 




ESTs 


19.60 


Homo sapiens cONA FU2Q780 ffs, done CO 


15.40 


ESTs 


17.50 


ESTs 


14.60 


DKFZP564A122 protein 


15.40 


ESTs, Weakly similar to 138022 hypothet 


14.60 


OnaJ (Hsp40) homoiog, subfamily B, membe 


21.00 


ESTs, Weakly similar to ZN91.HU WAN ZINC 


15.80 


ESTs, Weakly similar to AF 1 91 020 1 E2IG5 




cathepshS 


22.60 


ESTs 


21.40 


ESTs 


41.20 


ESTs 


11.00 



126816 


AA248234 




gb:csg2228.seq.F Human fetal heart, Lamb 


12.20 


126812 


AB037860 


Hs.173933 


• nudear factor l/A 


17.19 


126666 


AA648886 


Hs.151999 


ESTs 


13.57 


125645 


AA316181 


Hs.61635 


sbc transmembrane epithelial antigen of 


15.40 


126592 


AI611153 


Hs.6093 


Homo sapiens cDNA FU22783 6s, clone K 




126556 


AF255303 


Hs.1 12227 


membrane-associated nucleic add binding 


18.00 


126433 


AA325606 




gb:EST28707 Cerebellum II Homo sapiens c 


16.77 


126299 


AW979155 


Hs.298275 


amino acid transporter 2 


14.60 


126218 


AL049301 


Hs.13649 


Novel human gene mapping to chomosome 13 




126182 


M721331 


Hs.293771 


ESTs 


13.40 


126177 


AW752782 


Hs.129750 


hypothetical protein FU10546 


18.20 


126142 


H86261 


Hs.40568 


ESTs 


' 14.00 


126077 


M78772 


Hs^10836 


ESTs 


16.59 


125994 


AI990529 


Hs^70799 


ESTs 


17.40 


125934 


AA193325 


Hs.32646 


hypothetical protein FU21901 


13.00 


125847 


AW161885 


Hs.249034 


ESTs 


49.57 
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104776 


AA026349 




gb:zj99f01.s1 SoaresjJregnanLuterus^NbH 




104691 


U29690 


Hs.37744 


Homo sapiens beta-1 adrenergic receptor 




104667 


A1239923 


Hs.30098 


ESTs 




104404 


H58762 




gb:EST00057 HE6W Homo sapiens cDNA clone 




104392 


AA076049 


Hs.274415 


Homo sapiens cONA FU 10229 fis, clone HE 


27.20 


104212 


AB002298 


Hs.173035 


K1AA0300 protein 




104074 


AL162039 


Hs.31422 


Homo sapiens mRNA; cDNA DKFZp434M229 (fr 


11.20 


103749 


AL135301 


Hs.8768 


hypothetical protein FU10849 


10.86 


103645 


AW246253 


Hs.7043 


sucdnaie-CoA Ggase, GDP-rorming, alpha 


12.00 


103554' 


AJB7S826 


Hs.323469 


caveoHn l.caveoiae protein, 22kD 




103541 


AI815601 


Hs.79197 


C083 antigen (activated B tymphocytes, I 




103496 


Y09257 


Hs.132821 


flavin containing monooxygenase 2 




103428 


BE383507 


Hs.78921 


A kinase (PRKA) anchor protein 1 


11.20 


103353 


X39399 


Hs.119274 


RAS p21 protein acfjvatar (GTPase acfiva 


19.80 



17.20 
9.60 



6.00 



191 



1.89 



1.83 



3.60 
11.40 
4.76 



7.13 
7.00 



29.80 
3.70 



8.30 
8,09 

5.40 
7.60 



10.20 
5.69 
3.82 
4.20 



1.76 



2.05 



240 
1.78 
1.76 
Z19 



1.94 
1.75 
2.47 



1.92 



1.87 
1.93 



1.91 



1.80 
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103295 X81479 Hs.2375 

103280 U84722 Hs.76206 

103100 MMJXJ5574 Hs.184585 

103025 NMJJ02837 Hs.1 2364 1 

102698 M18657 Hs.1867 

102659 BE245169 H&211610 

102580 U 50303 Hs.152981 

102417 AA034127 Hs.153487 

102353 NM.003734 Hs.198241 

102302 AA306342 Hs.69171 

102283 AW161552 Hs.83381 

102188 U20350 Hs.78913 

102151 T27013 Hs.3132 

101957 L28824 Hs.74101 

101842 M93221 Hs.75182 

101771 NM_002432 Hs.153837 

101764 A1198550 Hs,81256 

101716 AF050558 Hs.2563 

101678 M62505 Hs^161 

101447 M21305 

101383 NM.000132 Hs.79345 

101346 AI738616 Hs.77348 

101345 NNL005795 Hs.152175 

101336 NMJJ06732 Hs.75678 

101330 L43821 Hs.80261 

101277 BE297626 Hs.296049 

101262 135854 

101168 NMJJ05308 Hs.211569 

101102 NM_003243 Hs.79059 

101088 X70697 Hs.553 

101066 AW970254 Hs.889 

100971 BE379727 Hs.83213 

100893 BE245294 Hs.1 80789 

100770 W25797.comp Hs.1 77488 

100716 X89887 Hs.172350 

100555 M69181 

100425 NMJM4747 -Hs.78748 

100408 D86640 Ks.56045 

100382 D83407 Hs.156007 

100351 D64158 

100299 D49493 Hs.2171 

100134 AA305746 Hs.49 

100108 U09577 Hs.76873 

100095 Z97171 Hs.7B454 
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egf-Gke module containing, muctn-fika, 
cadherin 5, type 2, VE-cadherin (vascula 
UM domain only 2 (rhomboSn-Hce 1) 
protein tyrosine phosphatase, receptor t 
prbgasrncsin (pepsinogen C) 
OJG triplet repeat, RNA-binding protein 
COP-diacyigiyceroi synthase (phosphatida 



amine oxidase, copper containing 3 (vase 
protein kinase Office 2 
guanine nucleotide binding protein 11 
chemoldne (C-X3-C) receptor 1 
steroidogenic acute regulatory protein 
spleen tyrosine kinase 
mannose receptor, C type 1 
myeloid eel nudear differentiation ant 
S100caldum-binding protein A4 (calcium 
tachykinin, precursor 1 (substance K. su 
complement component 5 receptor 1 (C5a 1 
gbifrman alpha satellite and satellite 3 
coagulation factor VIII, procoagulant co 



calcitonin receptor-flke 
FBJ murine osteosarcoma vira) oncogene h 
enhancer of fi lamentation 1 (cas-nke do 
microfibriUar-associatHd protein 4 
gbiHuman dystrophin (dp140) mRNA, 5* end 
G protein-coupled receptor kinase 5 
transforming growth factor, beta recepto 
solute carrier family 6 (neurotransmitte 
Charot-Leyden crystal protein 
fatty acid binding protein 4, adipocyte 
S164 protein. 

amyloid beta (A4) precursor protein (pro 
HIR (histone cell cycle regulation defec 
gb:Human nonmuscte myosin heavy chain-B 
K1AA0237 gene product 
sre homology three (SH3) and cysteine ri 
Down syndrome critical region gene 1-Bk 

growth differentiation factor 10 
macrophage scavenger receptor 1 
hyaluronoglucosaminidase 2 
myocilin, trabecular meshwork inducible 



1.76 
2.15 



11.00 
25.40 
14.00 

10.86 



16.40 
15.40 



18.80 
504.80 



19.00 



19.38 

15.40 
11.20 
14.80 
33.00 
16.20 



7.40 



31.00 



7.52 



1.78 
2.22 

1.75 
224 

2.01 

1.91 



11.29 



4.00 
4.24 
6.20 
21.20 



&40 



1.79 



TABLE 3B shows the accession numbers for those primekeys lacking unjgenelD's for Table 3A. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were ctatered based on sequence 
srraTaray using Catering and Alignment Tools (DoubleTwist Oakland CaBfomia). The Genbank accession numbers for sequences composing each duster are listed In the 
"Accession' column. 



Pkey: Unique Eos probeset identifier number 
CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 



123619 3716B1J 

126433 127143J 

125831 1522905J 

126816 122973J 

126852 136135J 

121059 273450J 

120637 2Q0885J 

122011 7617^-2 

120934 177521J 

123802 genbanRJVA620448 

116814 genbanlLH50834 

118329 genbanleN63520 

104404 H58762_at H58762 

104776 genban»LAA025349 

113502 genbank_T89130TB9130 

101262 entrez_L35854 U5854 

108573 genbankuAATJ86005 

101447 entre*_M21305 M21305 

124357 genbank_N22401 

108781 genbank_AA128654 

112794 genbank_R97018 

100351 entrezj>64158 D64158 

100555 6gr_HT2245 M691B1 M81105U51039 



AA602964AA609200 
AA325606 AA099517 N89423 
H04043 D60988 D60337 
AA248234AA090985 
AA399961 AA128347 
AA393283AA39B528 

AA81 1 804 AA809404 AA286907 AW977624 
AA431082 

AA226198AA226513AA383773 
AA620448 
H50834 
KS3520 



AA026349 



AA086005 

N22401 

AA128654 

R97018 
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Table 4Asho»s 202 geiusup^ula^tosarapte^ 

(he ralaSvB level of mRNA expression. 

Ptey: Unique Eos probeset identifier number 
ExAcac . Exemplar Accession nunber.Genbank accession number 
UifigenelD: Unigene number 
Umgene TifieUnigene gene BSe 
R1: average of A) for samples from 



pafients treated with chemolherapy or radlofherapy diviited by the average ofAl for nonnal lung samjies. 



Ptey 


ExAccn 


UmgenelD 


100113 


NM.D01269 


Hs.84746 


100187 


D17793 


Hs.78183 


100210 


026361 


Hs.3104 


100225 


D28539 


Hs.167185 


100269 


NM-001949 


Hs.1189 


100438 


AA013051 


Hs.91417 


100877 


X80821 


Hs.27973 


100893 


BE245294 


Hs.1 80789 


101273 


211933 


Hs.182505 


101447 


M21305 




101649 


AW959908 


Hs.1 690 


101724 


L11690 


Hs.620 


101748 


NMJM1944 


Hs.1925 


101809 


W86849 


Hs.323733 


101B79 


M176374 


Hs.243886 ' 


101915 


AF207881 


Hs.1 551 85 


101973 


U41514 


Hs.80120 


102025 


LKM045 


Hs.78934 


102031 


U04898 


Hs.2156 


102052 


NM.002202 


Hs.505 


102391 


AA296874 


Hs.77494 


102420 


U44060 


Hs.14427 


102610 


U65011 


Hs.30743 


102829 


NIOQ5183 


Hs.80962 


103000 


NMJJ01975 


Hs.146580 


103036 


M13509 


Hs.83169 


103507 


AJ000512 


Hs.296323 


103587 


BE270266 


Hs.82128 


104660 


BE298665 


Hs.14846 


104896 


AW015318 


Hs.23165 


105038 


AW503733 


Hs.9414 


105298 


BE387790 


Hs.26369 


105510 


Z42047 


Hs.283978 


105667 


AA767526 


Hs.22030 


105073 


AL157441 


Hs.17834 


106205 


AW965058 


Hs.111583 


106516 


AL137311 


Hs.234074 


106533 


AL1 34708 


Hs.145998 


106575 


AW970602 


Hs.105421 


106654 


AW075485 


Hs.286049 


106851 


AM58623 




106995 


AB023139 


Hs.37892 


107332 


TB7750 


Hs.1 83297 


107532 


AA443473 


Hs.1 73684 


107922 


BE153855 


Hs.61460 


108609 


BE409857 


1 1_ rn Ann 

Hs.69499 


108760 


AU076442 


Hs.1 17938 


109166 


AA219691 


Hs.73625 


109260 


AW978515 


Hs.1 31 9 io 


109280 


AK001355 


HS.ZfSOlU 


109292 


AW975745 


11. «Oftfifi9 

Hs.iooooZ 


109384 


AA219172 


nS.ooo4y 


imwio 


1 IRr>7^R 
UOU# 00 


Hs 110826 

PIC). 1 lUUtU 


109445 


AA232103 


Hs.189915 


109502 


AW967069 


Hs.211556 


109633 


AW003785 


Hs.170267 


109786 


A1989482 


Hs.146286 


109958 


AA001266 


Hs.133521 


110920 


N47224 


Hs.20521 


110924 


AW058463 


Hs.12940 


111084 


H44186 


Hs.15456 


111132 


AB037807 


Hs.83293 


111229 


AW389845 


Hs.110855 


111337 


AAB37396 


Hs^63925 


111987 


NMJH5310 


Hs.6763 


112046 


AA383343 


Hs.22116 


112268 


W39609 


Hs.22003 


112685 


RB7650 


Hs.33439 


112871 


AL1 10216 


Hs.12265 


112897 


AW205453 


Hs.3782 


112973 


AB033023 


Ks.318127 


112992 


AL157425 


Hs.133315 


113073 


N39342 


Hs.103042 



chromosome condensation 1 
eJdo-keto reductase family 1, member C3 
K1AA0042 gene product 
ghitamate receptor, metabotroptc 5 
E2F transcription factor 3 
topoisomerase (DNA) II binding protein 
WAA0874 protein 
S1 64 protein 

POU domain, class 3, transcription facto 
gbiHuman alpha satellite and satellite 3 
heparin-cmding growth factor binding pr 
bullous pemphigoid antigen 1 (230/240kD) 
desmoglein 3 (pemphigus vulgaris anlrgen 
gap junction protein, beta 2, 26XD (conn 
nuclear autoantigenic sperm protein (his 
cytosoOc ovarian carcinoma antigen 1 
UDP^^ryl-alpha^alactosanMne:p<)ryp 
mutS (E coli) homotog 2 (coton cancer, 
RAR-related orphan receptor A 
ISL1 transcription factor, UM/homeodama 
deoxyguanosine kinase 
Homo sapiens cDNA: FU21800 lis, clone H 
preferentiaily expressed antigen In mela 
neurotensin 

enoiase 2, (gamma, neuronal) 

matrix metalloproteinase 1 (interstitial 

serum/glucocorticoa regulated kinase 

5T4 oncofetal trophoblast glycoprotein 

Homo sapiens mRNA; cDNA DKFZp564D016(fr 

ESTs 

KIM1488 protein 

hypothetical protein FU20237 

Homo sapiens PR02751 mRNA, complete cds 

paired box gene 5 (B-cefl lineage specif 

downstream neighbor of SON 

ESTs, Weakly similar to 138022 hypothefi 

Homo sapiens mRNA; cDNA DKFZp761G02121 ( 

ESTs 

ESTs 

phosphoserine aminotransferase 
gb:tk04g09jc1 NCI_CGAP_Uj24 Homo sapiens 
KIAA0922 protein 
DKFZP566F21 24 protein 
Homo sapiens mRNA; cDNA DKFZp762G207 (fr 
Ig superfemily receptor LN1R 
hypothetical protein 
coflagen, type XVII, alpha 1 
RAB6 interacting, kinesWfte (rabkines 
K1AAQ8S3 protein 
hypothetical protein FU10493 
KIAA1702 protein 
ESTs 

trinucleotide repeat containing 9 
ESTs 

hypothetical protein MGC5487 
ESTs 

kinesin family member 13A 
ESTs 

HMT1 (hnRNP nte ^transferase, S. cerevi 
zinc-fingers and homeoboxes 1 
PD2 domain containing 1 
hypothetical protein 
ESTs 

USI-interacflng protein NUDE1, rat homo 
WAA0942 protein 

CDC1 4 (cefl division cycle 14, S. cerevi 
solute carrier family 6 (neumtransmitte 
ESTs, Weakly similar to ALU1JWMAN ALU 
ESTs, Weakly similar to 155214 salivary 
ESTs 

hypotheBcaf protein FU10201 

Homo sapiens mRNA; cONA OKFZp761J1324 (f 

microtiibula-assoctated protein 1B 



R1 

27.20 

20.60 

20.40 

20.60 

29.40 

23.50 

35.56 

43.40 

21.80 

193.60 

38.40 

198.80 

78.60 

162.20 

50.00 

26.00 

37.20 

32.00 

51.20 

13.90 

28.80 

110.60 

116.80 

2.30 

181.40 

49.20 

86.60 

42.60 

29.40 

21.50 

32.80 

20.20 

28.40 

25.40 

32.00 

40.60 

59.80 

43.40 

50.80 

53.40 

20.88 

23.60 

57.20 

49.00 

19.67 

48.17 

59.20 

26.60 

22.60 

21.00 
31.60 
24.20 
21.40 
20.40 
19.60 
24.00 
28.40 
36.00 
61.20 
24.60 
2720 
48.00 
37.80 
26.80 
63.80 
26.40 
47.64 
22.00 
65.00 
42.00 
55.40 
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113434 


T91451 


Hs.86538 


* 

ESTs 


2260 


113560 


T91015 


Hs268626 


ESTs 


22.80 


113849 


AM57211 


Hs.8858 


brorraxJomain adjxent to zinc finger doma 


• 51.80 


113950 


A1267652 


Hs.30504 


Homo sapiens mRNA; cDNA DKFZp434E082 {fr 


. 2820 


114339 


M782845 


Hs.22790 


ESTs 


2020 


114365 


H42169 


Hs. 18653 


hypothetical protein RJ 14627 


21.00 


114455 


H37908 


Hs.271616 


ESTs, Weakly similar to ALU8_HUMAN ALU S 


25.80 


114518 


AW1 63267 


Hs. 105469 


suppressor of vail (Sxerevisiae) 3-fite 


23.60 


114824 


AA960951 


Hs.305953 


zinc finger protein 83 (HPF1 ) 


2720 


114837 


BE244930 


Hs.166895 


ESTs 


3020 


114974 


AW966931 


Hs.179662 


nucteosome assembly protein 1-Cke 1 


20.80 


115075 


AA814043 


Hs£8045 


ESTs 


30.60 


115084 


BE383668 


Hs.42484 


hypothetical protein FU10618 


28.86 


115291 


BE545072 


Hs.1 22579 


hypothetical protein RJ10461 


38.00 


115313 


AA808001 


Hs.1 84411 


albumin 


22.60 


115697 


D31382 


Hs.63325 


transmembrane protease, serine 4 


173.60 


115909 


AW872527 


Hs59761 


ESTs, WeaHy similar to DAP1_HUMAN DEATH 


27.77 


116090 


AI591147 


Hs.61232 


ESTs • 


2030 


116107 


AL133916 


Hs, 172572 


hypothetical protein RJ20093 


164.20 


116399 


AA889120 


Hs.1 10637 


homeoboxA10 


38.00 


117099 


H93599 




gb:yv16a11.s1 Soares fetal liver spleen 


21.60 


117881 


AF1 61470 


Hs.260622 


buryrate-induced transcript 1 


49.40 


118091 


AW005054 


Hs.47883 


ESTs, Weakly similar to KCC1 HUMAN CALO 


22.40 


118138 


AA374756 


Hs.93560 


Homo sapiens mRNA for KIAA1771 protein. 


22.00 


118720 


N73515 




gb:za49d07.5l Soares fetal liver spleen 


20.00 


118873 


AIB24009 


Hs 44577 


ESTs 


19.40 


119126 


R45175 


Hs.117183 

n»»i if l w 


ESTs 


111.20 


119717 


AA918317 


Hs 57987 


B-cef( CLUymphoma 11B(z/nc finger pro 


33.00 


119940 


AL050097 


Hs.272531 


DKFZP586B0319 protein 


31.00 


120266 


A) 807264 


Hs 205442 


ESTs, Weakly similar to T34036 hypotheti 


20.20 


120515 


AA258356 




gb:zr59c10.s1 Soares_.NhHMPu_S1 Homosapi 


25.00 


120859 


AA826434 


Hs.1619 


achaete-scute complex (Drosophila) homo) 


95.40 


120983 


AA398209 


Hs 975B7 


EST 


105.20 


121054 


AW976570 

All Ji UJf \J 


HsJJ7387 


ESTs 


38.80 


121369 


AW450737 


Hs.128791 


CGJ-09 protein 


41.60 


122335 


AA443258 


Hs 241551 


chloride channel, calcium activated, fam 


30.80 


122612 


AA974S32 


Hs.1 28708 

1 U» I £.1)1 Vtf 


ESTs 


19.60 


123130 


AA487200 




gb:ab19f02s1 Stratagene lung (937210) H 


3320 


123440 


A1733692 


Hs 112488 


ESTs 


23.17 


123596 


AA421130 


Hs.1 12640 


EST 


23.00 


123619 






gbmo97cOZs1 NCLCGAPJ>r2 Homo sapiens 


28.80 


124006 


AI147155 


Hs.270016 


ESTs. 


77.60 


124159 


BE079334 


Hs.271630 


ESTs 


2220 


124281 


Al 333756 


Hs 111801 


arsenate resistance protein ARS2 


4220 


124472 


N52517 


Hs 102670 


EST 


32.60 


124617 


AW628168 


Hs 152684 


ESTs 


21 JO 


124631 


MM 014053 


Hs 270594 


FLVCR protein 


30.40 


124839 


R55784 


Hs 140942 


ESTs 


21.20 


125186 


AA610B20 


Hs 181244 


major hfetocompatibility complex, class 


42.80 


125321 


T85652 


Hs 178294 


ESTs 


27.00 


125535 


NM_013243 


Hs22215 


secrelogranin III 


23.80 


125646 


AA628962 


Hs 75209 


protein kinase (cAMP-dependent, cataryS 


2320 


125684 


AW589427 


Hs.1 58849 


Homo sapiens cDNA* FU21663 fis, clone C 


2120 


125724 


AL3601S0 


Hs 295978 


Homo sapiens mRNA full length Insert cDN 


48.80 


125847 


AW161B8S 


Hs 249034 1 


ESTs 


31.00 


125934 


AA1 93325 


Hs.32646 


hypothetical protein FU21901 


21.20 


126077 


M78772 


Hs 210836 


ESTs 


49.80 


126299 


AW979155 


Hs.298275 


amino acid transporter 2 


21.80 


126395 


A1468004 


Hs 278956 


hypothefical protein FU12929 


71.00 


126433 


AA325606 




gb:EST28707 Cerebellum li Homo sapiens c 


23.20 


126509 


R47400 


Hs, 23850 


ESTs 


23.80 


126538 


AB030656 


Hs.17377 


coronin, actin-binding protein. 1C 


23.10 


126666 


AA648886 


Hs.151999 


ESTs 


35.00 


126812 


AB037860 


Hs.173933 


nuclear factor l/A 


20.B0 


126872 


AW450979 




gb:UI-H-Bi3«fa-a-12-04/!.s1 NCLCGAP_Su 


46.29 


127046 


AA321948 


Hs.293968 


ESTs 


22.80 


127431 


AW771958 


Hs.1 75437 


ESTs, Moderately similar to PC4259 ferri 


30.00 


127489 


AA650250 


Hs.272076 


ESTs 


20.80 


127521 


AW297206 


Hs.164018 


ESTs 


25.20 


127742 


AW293496 


Hs.1 60138 


ESTs 


26.00 


127925 


AA805151 


Hs.3628 


mitogen-acBvated protein kinase kinase 


2120 


127930 


AA809672 


Hs.123304 


ESTs 


20.54 


127968 


AA83O201 


Hs.1 24347 


ESTs 


28.20 


127987 


AI022103 


Hs.1 24511 


ESTs 


19.60 


128116 


H07103 


Hs.286014 


Homo sapiens, clone IMAG&3867243, mRNA 


20.40 


128609 


NM_003616 


Hs.102455 


survival of motor neuron protein interac 


34.40 


128777 


AJ878918 


Hs.10526 


cysteine and giyrine-nch protein 2 


53.80 


128949 


AA009647 


Hs.8850 


a disintegrin and metallop rotein ase do ma 


23.00 


129168 


AI132988 


Hs.109052 


chromosome 14 open reading frame 2 


37.60 


129404 


AI267700 


Hs.317584 


ESTs 


28.60 


129527 


AA769221 


Hs.270847 


delte-tubuBn 


40.80 


129574 


AA026815 


Hs.11463 


UMP-CMP kinase 


3120 


129598 


N30436 


Hs.1 1556 


Homo sapiens cONA FU12566 fis, done NT 


29.60 


129785 


H19006 


Hs.184780 


ESTs 


7220 


129970 


AV65580S 


Hs296193 • 


chromosome 12 open reading frame 4 


2220 
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130149 


AW0878D5 


Hs.172665 


13019? 


248579 


riS.i/ZlXdO 


130441 


U63530 


Hs. 155637 


130466 


W19744 


Hs. loUuoy 


130482 


AYV4U37U1 


U<? 1 C7Q 


130617 


M90516 


nS.lo/4 


idu/Uo 


R7T776 


ns.la iuo 


11A7T3 




ns*D03o*t 
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AI499220 
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Hs. 170290 


134032 


Mil nnenis 


Up 7QC0Q 

ns./oooy 
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AA339449 
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melhylenetetrahycfpofolatB dehydrogenase 29.60 

a cDsintegrin and rrietaScproteinase doma 27.60 

protein kinase, DNA-acfivated, catalytic 28.36 

Homo sapiens cDNA FU20653 6s, dona KA 20.20 

bacu!(wraIlAPrepeakoni3in3ng5(sur 22.40 

ghrtamin&-6^ctose-6-phosphate transamin 19.60 

ESTs • 19-40 

cadherin 13, H-cadhertn (heart) 21 .40 

UDP glyrayi transferase 1 family, poJype 1 1 0.00 

CCAAT/ennancer binding protein (OEBP). 25.20 

chromogranin B (secretogranin 1) 40.60 

E1A binding protein p300 24.60 

K1AA0648 protein 21.00 

Rho guanine nucleotide exchange factor { 33.40 

replication factor C (activator 1)4 (37 60.80 

Homo sapiens cDNA: FU22373 fe. clone H 20.40 

karyopherin alpha 3 {importin alpha 4) 29.40 

ESTs 32.40 

cyctmLania-Sa 27.40 

bone rnorphogenefic protein 7 (osteogenic 75.60 

carbonic anhydrase XII 31.36 

guanine mo n phosphate synthetase 32.40 

Cdc42 guanine exchange factor (GEF) 9 23.40 

ESTs, Weakly similar to T33468 hypoiheti 61.20 

phosphoserine phosphatase 22.33 

a disintegrin and metanoproteinase doma 23.50 

HSKM-B protein 30.00 

twist (Drosophila) homotog (acrocephatos 23.80 

SRB7 (suppressor of RNA polymerase B, ye 51 .60 

hypothetical protein FU10074 33.00 

general trans cripfion factor TI1A 82.00 
secretogranin II (chromogranin C) 

(Bscs, targe (Drosophila) homotog 5 69.33 

serine (or cysteine) proteinase inhibito 33-20 

WAA0203 gene product 31.60 

BCL2/adenovirus E1B 19kWnterac5ng pro 30.60 

ESTs, Moderately similar to A46010 X-fin 23.40 

phospbonTjosylglydnamide fa 49.20 

SW1/SNF related, matrix associated, act) 20.20 

Ajal-spedfidtytyrosine^phosphoryl 20.80 

G antigen 7B 37.60 

hydroxysteroid (17-beta) dehydrogenase 53.40 

eyefin-dependent kinase 5, regulatory su 31 .60 

rKurotrophin 3 28.80 



TABLE 4B shows the accession numbers for those prirnekeys lacking unlgenelffs for Table 4A For each probeset we have listed the gene duster number from which the 
digenudeofides were designed. Gene clusters were compHed using sequences derived from Genbank ESTs and mRNAs. These sequences were dustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwist, Oakland Cafifomia). The Genbank accession numbers for sequences comprising each cluster are listed in the 
"Accession' column. 



Pkey: Unique Eos probeset identifier number 
CAT number. Gene duster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 

123619 371681.1 AA602964 AA609200 

126433 127143 1 AA325606 AA099517 N89423 

126872 142696J AW450979 AA136553 AA136656 AW419361 AA984358 AA492073 BE168945 AA809054 AW238038 BE011212 BE011359 

BE011367 BE011368 BE011362 BE011215 BE011365 BE011363 

106851 322947 1 A1458623AA639708AA485409 R22065AA485570 

118720 genbankjtf3515 N73515 

120515 genbanleAA258356 AA258356 

117099 321871 1 H93699 H97976 H80036 

101447 entrez_M21305 M21305 

123130 genbank_AA487200 AA487200 
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WO 02/086443 PCT/US02/12476 * 

labia 5A shows 680 genes unregulated In squamous ceil carcinoma or adenocarcinoma lung tumors relative to normal lung and chronically diseased rung. Tnese genes were 
selected from 59580 probesets on the Eos/Affymetrix Hu03 Genechfp array. Gene expression data tor each probeset obtained from this anarysis was expressed as average 
intensity (Al), a normalized value reflecting the reiaSve tevd of mfttJA expression 

Pkey: Unique Eos probeset idenfifer number 

ExAccrc Exemplar Accession number, Genbank accession number 

UntgenelD: Unigene number 

Unigene Title Unigene gene Gild . . ... 

R1: 70th percentile of AJ for squamous cell carcinoma and adenocarcinoma lung tumor samples divided by the 90th percentile of Al for normal and chronically 

diseased lung samples. tL 

R2 eoth percentile of A! adenocarcinoma lung turner sanies dwidedtyta 

R3: 80th percenfBe of Al squamous cell carcinoma lung tumor samples divided by the 90th percenKe of Al for normal and chronically diseased lung samples. 

R4- 80th percentile of Al adenocarcinoma lung turnor samples divided by the 80ft pe^^ . 

R& • 70th percentile of A) for squamous cell carcinoma and adenocarcinoma rung tumor samples minus the 1 5th percentile of Al for all normal rung, chronically 

diseased rung and tumor samples Divided by 90th percentile of Al for normal and chronically diseased lung samples minus the 1 5th percentile of Al for aD 

normal rung, chronically diseased rung and tumor samples 
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HsJoloo 
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nS.571Ul 


100202 


BE294407 


u» 0001 n 


100216 


AA4B9908 


Hs.1390 


100269 


NAU)01949 


Hs.1189 


100287 


AU076657 


Hs.1600 


100297 


AU077258 


Hs. 182429 


100330 


AW410976 


Hs.77152 


100335 


AW247529 


Hs.6793 


100360 


W70171 


Hs.7KJ39 


100372 


NMJJ14791 


Hs.184339 


100474 


NMJ300699 


HS.3Q02BG 


100486 


T19006 


Hs.10842 


100491 


056165 


IJ_ AfMM 


100516 


D90278 


HS.11 


100522 


X51501 


M_ QQftJQ 

ns.yyiwy 


100559 


NM_0000S4 


Hs.1540 


100576 


X00356 


II. 

Hs^/Ubo 
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AA01569J 
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no^tao l it 


mnfttn 

1UUOJU 


a rnn 4 770 


Hs.4756 


innnfi7 

lUUOOf 


1MAR90 




mnono ■ 




u«! 907070 




AUUf 05MO 


Hs.5398 




J0Q124 


Hs 117729 

l mJ* III* *v 


iu iy*+3 


J05614 




101061 


NML000175 


Hs.180532 


101071 


L02840 


Hs.84244 


101124 


L10343 


Hs.112341 


101175 


U82671 


Hs.36980 


1011B1 


BE262621 


Hs.73798 


101204 


L24203 


Hs.82237 


101210 


L29301 


Hs.2353 


101216 


AA284166 


Hs.84113 


101228 


AA333387 


Hs.82916 


101233 


AL135173 


Hs.878 


101273 


Z11933 


Hs.182505 


101342 


U52112 


Hs.182018 


101346 


AI738616 


Hs.77348 


101369 


NKL000892 


Hs.1901 


101396 


BE267931 


Hs.78996 


101431 


BE1852B9 


Hs.1076 


101448 


NM.000424 


Hs.195850 


101462 


AL035658 


Hs.73853 


101466 


BE262660 


Hs.170197 


101484 


AA053486 


Hs.20315 


101502 


M26958 




101505 


AA307680 


Hs.75692 


101526 


NMJJ02197 


Hs.154721 


101535 


X57152 


Hs.99853 


101577 


M34353 


Hs.1041 


101649 


AW959908 


Hs.1690 


101663 


NfVL003528 


Hs.2178 


101664 


AA436989 


Hs.121017 


101659 


L24498 


Hs.80409 



Unigene Title 

AFFX controt GAPDH 

AFFX control: GAPDH 

AFFX control: GAPDH 

Human GABAa receptor alpha-3 subunlt 

thymidylate synthetase 

10AA01 01 gene product 

aldo-keto reductase family 1 , member C3 

mhlchronrcsome maintenance deficient (S. 

phosphofructoklnase, platelet 

proteasome (proscme, macropain) subunit, 

E2F transcription factor 3 

chaperonin containing TCPf, subunit 5 (e 



R1 R2 R3 



R4 



mWchrornosome maintenance deficient (S. 



uridine monophosphate kinase 
KIAA0175 gene product 
amylase, alpha 2A; pancreatic 
RAN, member RAS oncogene family 
non-metastafic cells 2, protein (NM23B) 
cardnoembryonic antigen-related cell ad 
prdactin-induced protein 
collagen, type VII, alpha 1 (epidermofys 
caWtonin/calcitonin-relaterJ porypeptid 
mhogen-activated protein kinase kinase 
Homo sapiens ribosomal protein L39 mRNA, 
zinc ribbon domain containing, 1 
general transcription factor IIA, 1 (37k 
myeldd/lyrnphoid or mixed-lineage leukem 
K1AA0618 gene product 
flap structure-specific endonuclease 1 
gbiHuman transketolase-iike protein gene 
ret proto-oncogene (multiple endocrine n 
guanine monphosphate synthetase 
keratin 14 (epidermolysis bullosa simple 
gb:Human proliferating cell nuclear antl 
glucose phosphate kom erase 
potassium voltage-gated channel, Shab-re 
protease Inhibitor 3, skin-derived (SKAL 
melanoma antigen, family A, 2 
macrophage migration Inhibitory factor ( 
ataxia-telangiectasia group D-associated 
opioid receptor, mu 1 
cycTirwfependent kinase inhibitor 3 (CDK 
chaperonin containing TCP1, subunit 6A ( 
sorbitol dehydrogenase 
POU domain, class 3, transcription facto 
interieukin-1 receptor-associated kinase 



kaffikrein B, plasma (Fletcher factor) 1 
proliferating cell nuclear antigen 
small proline-rich protein 1B (comifin) 
keratin 5 (epidermolysis bullosa simplex 
bone morphogenetic protein 2 
glutamic-oxatoacetjc transaminase 2, mil 
interferarHnduced protein with teiratri 
gb:Human parathyroid hormone-related pro 



aconitase 1, soluble 
(Marin 

v-ros avian UR2 sarcoma virus oncogene h 
heparin-binding growth factor binding pr 
H2B histone family, member Q 
H2A histone family, member A 
growth arrest and D^arnage-irtducfole, 



8.00 



3.84 
3.33 



2.55 



5.07 



7.20 



aio 

3.85 



24.80 



Z57 

312 

350 

4.08 
2.53 

8.50 

3.24 
&31 

10.50 
4.02 



54.00 

5.59 

7.00 



7.60 
10.20 

aoo 



12-91 



6.40 



15.65 



14.20 

9.30 
20.60 



10.00 



R5 

6.76 
5.77 
5.75 

5.71 



4.52 
5,49 
5.67 

&66 
3.81 
4.50 

4.82 
3.79 

5.49 
4.17 



21.89 
12.80 



38.80 
12.00 

9.09 



5.16 

4.69 
4.19 



5.69 



7.90 
4.45 

4.17 



7.90 

4.01 

4.46 
4.65 



7.60 
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101695 


M69136. 


Hs.135626 


101724 


L11690 


Hs.620 


101748 


NM.001944 


Hs.1925 


101759 


M80244 


Hs.184601 


101771 


•NMJ302432 


Hs.1 53837 


101804 


M56599 


Hs.1 69840 


101809 


MB6B49 


Hs.323733 


101833 


AU076442 


Hs.1 17938 


101842 


M93221 . 


Hs.75182 


101851 


BE260964 


Hs.82045 


102002 


NM_002484 


Hs.81469 


102039 


AL1 34223 


Hs.306098 


102072 


U09410 


Hs.78743 


102083 


T35901 


Hs.75117 


102111 


L361S6 


Hs.81834 


102123 


NMJJ01809 


Hs.1594 


102154 


U17760 


Hs.75517 


102193 


AL036335 


Hs.313 


102217 


AAB29978 


Hs.301613 


102224 


NKUJ02810 


Hs.148495 


102234 


AW163390 


Hs.278554 


102251 


NMJM439B 


Hs.41706 


102305 


AL043202 


Hs.90073 


102330 


BE298063 


Hs.77254 


102340 


U37055 


Hs.278557 


102348 


U37519 


Hs.87539 


102358 


U39817 


Hs.36820 


102394 


NMJ10381& 


Hs.2442 


102404 


NM_005429 


Hs.79141 


102537 


U57094 


Hs.50477 


102581 


AU077228 


Hs.77256 


102605 


AI435128 


Hs.181369 


102610 


U65011 


Hs.30743 


102623 


AW249285 


Hs.37110 


102642 


AA205847 


Hs.23016 


102654 
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Hs.24385 
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BE245169 


Hs.211610 


102669 


U71207 


Hs.29279 


102672 


U72066 


Hs.29287 
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NM.007019 


Hs.93002 


1026S6 


BE540274 


Hs.239 
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U82321 
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BE258778 


Hs.108809 


1027B4 


U85658 


Hs.61796 


102824 


U90916 


Hs.82845 
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NMJJ06183 
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103060 


NMJJ05940 
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103099 


A1693251 


Hs.8248 


103119 


X63629 


Hs.2877 


103168 


X53463 


Hs.2704 


103185 
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Hs.74368 


103192 


M22440 


Hs.170009 


103223 


BE275607 


Hs.1708 . 
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XB3301 
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BE270266 
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Hs.816 
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NM.0Q6235 


Hs.2407 


103768 


AF086009 




03841 


AA314821 


Hs.38178 


03847 


AF219946 


Hs.102237 


03913 


AW967500 


Hs.133543 


04094 


AA418187 


Hs.330515 
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BE563452 
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132581 
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132617 


AF037335 
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132638 
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Hs.54277 


132653 


Z15008 


Hs.54451 


132669 


W38586 
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132710 


W74001 


Hs.55279 


132771 


Y10275 


Hs.55407 


132799 


W73311 
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132833 


U78525 
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132892 


AW834050 


Hs.9973 


132906 


BE613337 


Hs.234896 


132959 


AW014195 


Hs.61472 


132962 
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132994 
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Hs.65648 


133155 


M58583 


Hs.662 


133181 


X91662 


Hs.66744 
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Hs.73818 
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TABLE 5B shows the accession numbers for those primekeys lacking unigeneltys for Table 5A. For eachprobeset we have fisted the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tods (DoubfeTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are fisted in the 
'Accession' column. 

Pkey: • Unique Eos profaesef identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey 

117079 

124305 

101502 

109792 

126034 

102768 

126345 

127066 

127099 

119243 

125875 

112054 

126979 

126992 

122318 

114699 

114793 

108305 ■ 

108393 

100867 

123731 

109700 

120715 

113702 

115113 

101045 

108554 

108573 

119052 

126522 

126605 

103768 
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Accessions 



1621717J 
242183 1 
18202.-$ 
754958.1 
1598157 1 
44641.1 
1653B33J 
1703458.1 
244301.1 
1774795.1 
1566433.1 
1538292J 
171411.1 
880655 1 
292419.1 
135322.1 
150742.1 
111550 1 
113411.1 
tigr_HT4586 
genbank_AA609839 
genban)LF09609 F09609 
genbanJLAA292700 
genbank.T97307 T97307 
genbank_AA256460 
entrezJ05614 J05614 
genbank_M084948 
genbankJW3fi5005 
149538.1 



H92325 T97125 

AW983221 AA344870 AA344871 H93331 
M26958 
R49625F10674 
HS0340N91637 
U82321 H66077 
N49713 N49819W03810 
R25066 R20144 R20145 Z43845 
AA34766B AW956810 Z44271 F07065 F07064 R13506 
T12603T12604 
H14480N98295 
R43590F10439 
AA21O954AA211007 
AI809521 H12174Z42556 
AA429743AA442754 
AA127386R15644AA1 27404 
AA158245AA158235 
AA071391 AA069892 AA069891 
AA075211 AA075245 AA075126 AA074946 
U14622 

AA609B39 



AA292700 
AA256460 



AA084948 
AA086005 
R10889R10888 
416020.1 W31912AI167491 
439280 1 AA676910AA778853AA778865WB6800 

46922.1 W42667 AI580740 AI690440 AI561350 AW467906 AW151450 A1S25927 AL041716 At8856G0 AI742213 AW248624 AI955498 AA033947 

AA845593 AI62371 1 N68583 C00064 AA193567 AW083868 AW163216 AA191595 AA522778 AI628008 AI915518 AA843508 AI926195 
M176265AW167963AA992115VW3647AW103572A)862994A!342W^ 
AI199673A)811768AI275832A1422233AI191852AI096682AI580124AI683512AA582453AA^ 

H44848 H20477 T91 695 W47039 AA070055 AA024795 AA32B855 AA379248 AA379330 AA385580 W25920 W03688 AA448359 AA093881 
AW362477 AA0B9997 AI350265 W93479 N99688 AA932257 AW351469 H68590 AA663402 M069771 AW087986 AI858420 AA600214 
AI970774 AI857712 AI683081 AI885584 AW131 1 50 AI567981 AW002714 AW189973 AW075495 AW168303 AA953714 AW51 6881 AI357375 
A1566563 AW512676 AI570580 AI023690 AA44821 6 AI079853 AI422707 AA77951 6 AW026972 AW130082 AW1 62307 AW438646 AA709332 
AW192394 All 67350 AI217879 All 291 52 AA719509 AI350480 AA66341 8 AI003634 AW118546AA180261 AA442833 AI268625 AA888B81 
A103B759 AA846723 AI248770 AA993694 AI280335 AI885107 AW518649 AA641563 AA995835 AA582521 AI276744 AA436478 A1017360 
A1620763 AIB59887 N73926 A1076327 AI74161 5 AJ160617 AW172819 AI492005 AA677429 AA996334 AI693771 A195C039 AI245629 A128851 5 
AI8661 86 T93293 AA173262 AA599779 AI6B0092 AW439316 AI084555 AI272672 AI583507 AW47321 9 AA738132 AW473283 AI367492 
AA9954 10 AI689624 AA206353 AI033095 AI04O382 AA873530 AI221074 A/934840 AJ41 8680 AA844306 R94503 AA773520 AA843 169 
AA219425 AA629658 AI811719 AW411275 AJ590981 W37907 AI591 178 AI684051 AA983238AA669347 AA976239 M704570 A1628339 
AIB84391 AI241 580 A10O3539 AW176687 M009650 N34566 AI333493 AI188070 AA070B27 AA41 1 683 A12808B4 AA872023 AA207255 
AA021576 N71 953 AI885888 AW076039 T15777 A1537673 AW248048 H09554 W93480 W47001 AW0791 1 4 AA0631 60 AA757453 R60788 
AI859431 H20478 AA21 8882 AA757465 AA100995 A1864135 AI934209 AA070503 H47008 AA219646 W61039 W93907 AW385050 W37967 
W78028 M189007 AA479136 R93650 AA442312 T30287 AA847628 AA180262 AA009549 C03892 AW149464 AA310963 AA219693 
AA069747 R29207 AA094784 AA293615 AA447848 AI984167 N90393 C05097 N56499 AW292351 AW149681 AW473258 AA629322 AI004409 
AW105577 AI954937 AI81 1 070 AA902422 AW514437 AA535460 AA916877 AW517122AA974657AA975649AW517130AW517129 F31737 
W07588 AA193645 AA378994 AA489273 F32267 W39303 AA021 181 N86B10AA406524AA062553AA436801 H08985 H15979 N40310 
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112382 genbankj*59904 R59904 

105264 genbankJVA227934 AA227934 

100071 entrezj\28102 A28102 

123315 714071.1 AA49B369AA496646 



Table 6A shows 99 genes up-reguiaied nonsmokers wiih tung cancer relative to smokers win lung cancer. These genes were selected from 59S80 probesets on the 
Eos/Affymstrix Hu03 Genechip array. Gene expression dala for each probeset obtained from this analysis was expressed as average Intensity (AI), a normalized value retlecfing . 
the relative level of mRNA expression. 

Pkey: Unique Eos probeset identifier number 

ExAccrv Exemplar Accession number, Genbank accession number 

Unigeneia Unigene number 

Unigene Title: Unigene gene title iir 

R1- average of Al for samples from non-smokers with adenocarcinoma divided by the 90th percentile of AI for samples from smokers with adenocarcinoma 

R2 average of AI for samples from non-smokers with squamous ceB carcinoma divided by the 90th percentile of AI for samples from smokers with squamous cell 
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ESTs 
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ESTs 
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ESTs 
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protein tyrosine phosphatase, receptor t 
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EST 
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ESTs 
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chromosome 12 open reading frame 2 
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ESTs 
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ESTs 
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TABLE 6B show the accession numbers for those primekeys lacking unlgenelD-s for Table 6A. For each probeset we have fisted the gene duster number from 
oligonudeotldesweredesigned. Genechisters were complied using sequences derived from Genbank ESTs and mRNAs. ™ M8S ^f™^ 
simBarity using Clustering and Alignment Tools (DoubleTwist. Oakland CaTifomla). The Genbank accession numbers for sequences composing each duster are listed in the 
"Accession* column. 
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CAT number Gene duster number 
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Table 7A shows 98 genes dowwegutated in non-smokers with Jung cancer relative to smokers with lung cancer. These genes were selected from 59680 probesete on the 
Eos/Affymetrix Hu03 Genechip array. Gene expression data for each probeset obtained from this analysis was expressed as average intensity (AJ), a normalized value reflecting 
the relative level of mRNA expression- ' 

Pkey: Unique Eos probeset identifier number 

ExAccru Exemplar Accession number, Genbank accession number 

UnigeneJD: Vntgene number 

UnigeneTBte.* Unigene gene title 

R1: 90th percentile of Al for samples from smokers with adenocarcinoma divided by the average of Al for samples from non-smokers with adenocarcinoma. 

R2; 90fn percentile of Al for samples from smokers with squamous cell carcinoma divided by the average of Al for samples from non-smokers with squamous ceO 
carcinoma. 
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TABLE 7B shows the accession numbers for those primekeys lacking unigenelD's for Table 7A. For each probeset we have listed the gene cluster number from which the 
oV^^deTwele Sd Sn^S«3cS^sequences derived from Genbank ESTs and mRNAs, These sequences were clustered basedon sequence 

'Accession* column. 

Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT number Accessions 
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poSonUie Eos/^eto Hu02 Genech^ array. Gene expression data for each probeset obtained from Ous analysts *as expressed as average intensity (Al), a 
normalized value reflecting the relative level of mRNA expression. 

Pkey: Unique Eos probeset identifier number - 
• ExAccn: Exemplar Accession number, Genbank accession number ~ 
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Unigene Title: Unigene gene title ' ^_ tkic 
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1.30 
4.55 
0.82 
0.47 
1.36 
0.69 
1.88 
4.72 
0.79 
0.45 
1.00 
3.93 
1,50 
3.59 
5.94 
8.14 
1.55 
5.16 
0.59 
3.70 

ai3 

0.32 
1.00 
4.01 
1.04 
0.29 
1.50 
5.88 
0.55 
4.31 
0.38 
1.24 
0.46 
1.61 
0.73 
0.20 
2.18 
0.50 
3.64 
Z93 
0.95 
1.71 
1.15 
3.22 
3.42 
5.50 
Z99 
3.80 
0.28 
0.46 



0.84 

0.64 

1.11 

1.01 

15.75 

12.61 

8.01 

198 

1.53 

1.17 

1.58 

0.98 

1.36 

1.90 

2.10 

2.35 

5.67 

2.60 

0.96 

1.65 

2.13 

2.53 

1.42 

4.50 

0.94 

1.00 

0.89 

0.91 

1.13 

0.98 

2.15 

1.09 

14.30 

1.38 

12.30 

12.03 

8.63 

1.59 

0.76 

3.70 

1.04 

1.70 

3.14 

0.62 

1.10 

3.56 

5.76 

1.31 

4.75 

15.40 

19.53 

1.56 

8.07 

213 

7.15 

10.60 

1.14 

3.84 

7.43 

2.96 

1.14 

1.39 

18.65 

1.75 

8.01 

1.97 

1.98 

1.47 

1.42 

0.97 

0.26 

1.58 

0.71 

14.94 

3.98 

0.91 

1.92 

140 

10.54 

4.89 

1175 

3.07 

a 29 

0.57 
1.17 
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OJOOOO 


2.55 


181 


OoO/DO 


2.24 


1.07 




0.20 


0 07 


•3330 IH 


t;i3 


1,48 


335815 


2.45 


351 




1.00 


4.16 




0.49 


1.70 




1.66 


1.39 


-¥3 COCO 
JuJOOO 


2.98 


6.43 


335896 


0.98 


0.99 


335936 


12.10 


21.93 


335943 


1.00 


1.64 


335983 


1.00 


4.21 


<nctme 


0.37 


1.17 


jjDUei 


104 


0.84 




11.40 


23.54 


336038 


1.19 


1.21 


336066 




1.63 


336107 


0.95 




•JO CO AC 


3.13 


6.29 


336275 


3.20 


10.10 


336292 


9 14 


*> AO 


33W1 


1.00 


1.00 




0.65 


0.79 




2.33 


2.16 


Infill ' ' 


2.55 


2.23 




2.19 


2*03 


330OJ0 


2.69 


2.48 


336636 


2.13 


1.83 


33003/ 


2.43 


2.24 


336638 


2.31 


2.03 


33BD59 


0.60 


1,31 


336675 


0.31 


1.18 


336684 


i 


114 


336694 




7 m 

/. IV 


33S716 


4.43 


6.37 


336721 


2.20 


0.74 


Q1C7Q0 


1.64 


2.14 


OJDsHJU 


6\14 


12.73 


300348 


1.00 


1.00 


33/070 


1.30 


2.09 


33/1*43 


4.01 


11.53 


337046 


1 G7 
1.0/ 


1 84 


337004 


Z78 


7.35 


33/110 


7,20 


16.14 


0*971 CO 

337162 


3 45 


5 34 


337183 




11.41 


337184 


3.72 


5.90 


337192 


1 77 


1.05 


337194 


1.88 


1.68 


337229 


0 99 




337268 


1 nn 

I.W 


3 31 

0-0 1 


337299 


O.cO 


^ 14 

o. i** 


337325 




3 79 
0./* 


337389 


5.60 


10.42 


33/493 


2.06 


6.30 


J J/ 49/ 


7.88 


20.29 




3.80 


4.48 


33/ 049 


1.66 


2.31 


33/0U3 


1.27 


8.54 


33/DU3 


5.76 


7.16 


33101 I 


0.73 


0.97 


TT77W 
331 IOD 


1*54 


(X92 


3377RP. 
301 I0Q 


5.07 


9.73 


«W« Ova 


6.18 


12.87 


30/00* 


3.78 


12.97 


30lol 1 


2.66 


8.16 


Mi 900 


0.26 


1.34 


338008 


1.48 


1.12 


338033 


. 238 


14.59 


338083 


0.65 


2.16 


338110 


1.00 


1.61 


338112 


' 5.86 


8.25 


338145 


1.70 


1.97 


338148 


8.07 


18.19 


338158 


1.30 


4.55 


338161 


2.58 


3.57 


338179 


1.00 


1.00 


338162 


3.32 


4.63 


338189 


1.00 


3.34 


338197 


0.99 


1.69 


338199 


4.58 


7.62 


338215 


6,01 


15.85 


338279 


0.53 


0.95 


338316 


20.58 


38.66 
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338322 

338357 

338359 

338366 

338374 

338414 

338418 

338469 

338501 

338506 

33B523 

338549 

338561 

338562 

338671 

338576 

338726 

338779 

338804 

338836 

338871 

338872' 

338879 

338937 

338966 

338993 

339047 

339100 

339114 

339121 

339170 

339229 

339264 

339293 



123 
4.10 
10.12 
0.69 
0.40 
0.47 

ai2 

3.09 

6.28 

6.97 

3.10 

1.70 

0.79 

1.72 

0.17 

2.10 * 

1.20 

0.12 

0.99 

1.00 

4.30 

5.02 

0.23 

6.55 

1.76 

1.00 

5.26 

5.10 

1.00 

1.00 

10.36 

4.08 

2.64 

1.73 



7.39 

11.39 

21.59 

1.02 

1.18 

1.06 

1186 

5.11 

10.32 

1141 

5.84 

2.70 

0.81 

1.46 

0.91 

15.86 

1.09 

0.57 

1.67 

1.00 

9.81 

1181 

1.12 

12.25 

5.42 

140 

10.81 

6.88 

1.70 

3.75 

19.67 

13.48 

3.83 

1.94 



TABLE 8B shows the accession numbers for those Pkeys In Table 8A lacking uregeneHys. For each probeset we have fisted the gene cluster number from which the 
oligonucleofides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each duster are listed in the 
'Accession' column. 

Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey 


CATnumbe 


r Accessions 


322044 


187363J 


AW340926AA249063 N86075 


322060 


44320J 


AI341937 AW003063 U34725 AA904742 


321430 


42705,1 


X57414 X57415 


321467 


43034.1 


X13075X13076 


322125 


46779.1 


R93901 AF075073R93902 


322166 


46861J 


H69434AF08595BH69846 


322173 


46873 1 


H52557 H52557 AF085970 H52164 


322178 


46882J 


H56535AF085980 H56712 


322179 


46885 1 


H92891AF085982 H92777 


321577 


1615102J 


H84849 H84252 H84260 H86664 H85320 


321567 


1615333J 


H9553f H95521 H84529 


313723 


111953.1 


AAO7O412AA102346AAO818B5 


320997 


627492.1 


H22544 H46842AI204929 


322278 


47271J 


W69304AF086283W69200 


321687 


218439.1 


AA625149 AA313030 AA313052 H97463 


313883 


129439.1 


AA665089 AA135130 AA484059 AA102419 AW877765 


322320 


47422.1 


W79150AF086419 


322339 


8145B4J 


AI658646AI734214W17348 


314648 


293660.1 


AW979268 AA8784 19 AA431342 AA431628 


300201 


682222J 


AI308300AI308296 


306897 


25196.-2 


AI093967 


323155 


979809.1 


AL120701AL135041AL121524 


322527 


38927.1 


AF147359 T58511T58560 


322585 


473768 2 


W88919W39125 


300362 


1574395.1 


24230BH23514 


322635 


82296 1 ' 


AA005129AA679084 M694399 


322664 


85042J 


AA011522AA702841 AA011691 AA330797 


315454 


380580.1 


AI239454 AI239473 AA625812 AI208703 


322587 


37373.1 


AF074666AI1 10759 AF090902 


314852 


327472J 


AI903735 AA491283 AI694953 AW976903 AA761362 


307783 


©7809 1 


AI347274AW844024 


324072 


269032.1 


AA381 722 AA381829 AW963906 AW963902 AA381 242 


300627 


221345L1 


AA488472 W27363 AA317053 BE082689 AW967036 BE079872 


323505 


196389 1 


AW970512AA280251 A1552287 BE466438 A1650725AA551B54AA281574AW571481 


315791 


403558 1 


AA678177AA677034 


324303 


233842.1 


AL118754AA333202 H38001 


316519 


442885.1 


AA847835AA768376 


300926 


333127.1 


AA504860AA504911 



144 



WO 02/086443 



PCT/TJS02/12476 



324560 
301882 
324804 
324889 
302697 
302711 
302742 
318499 
310624 
302847 
304122 
303598 
311409 
312094 
319312 
319407 
319425 
320007 
320018 
319484 
318865 
312220 
319546 
312389 
319611 
312437 



311896 
319B34 
321102 
321158 
321199 

305528 
321270 
314126 
320714 
305442 
306446 
306458 
306510 
306557 
306572 
306582 
306656 
306686 
306751 
308011 
306892 
308106 
308154 
306956 
306958 
308213 
308216 
308219 



30B643 
30B673 
308697 
308778 
308808 
308875 
308886 
308898 
308966 
308979 
303011 
303077 
305016 
305034 
305072 
305148 
305190 
303978 
303990 
303998 
303999 
305235 
305312 
305413 
305447 
321244 



328264J 

275087.1 

39B093J 

1515978J 

43219.1 

45419.1 

458.39 

364430.1 

34624.4 

458.105 

77271.-5 . 

270283.1 

837264.1 

797889.1 

1540116.1 

1688823.1 

1689571 1 

229683.1 

1815987J 

1691553.1 

1535937.1 

1671607.1 

243305.1 

902067.1 

1566863.1 

291472.1 



579192,1 

112523.1 

80531.1 

410938.1 

212379.1 

28832.-3 
1662057.1 
177666.1. 
743644.1 
AA976899 
AA977348 
AAS7Bm 
AA98B546 
AA994530 
AA995686 
AA996248 
AI004024 
AI015615 
AI032589 
A1439473 
AI092465 
AI476803 
AI500600 
AI125111 
A1125152 
AI557041 
AI557135 
AI557246 
Af718299 
' AI719893 
AI745040 
AI760854 
A1767143 
A1811109 
AI818289 
A1832332 
A1833240 
AI858845 
A1870704 
A1873111 
41689 1 
44060.1 
AA626876 
AA630128 
AA641012 
AA654070 
AA665955 
AW513315 
AW515465 
AW516449 
AW516611 
M670480 
AA700201 
AA724659 
M737856 
29327.1 



M4925B8AA492493AA492571 
T78054 T79888AA398185 

A1692552AJ393343AI800510AI377711 F24263AA661876 

D31010D30B91 031168 D31 166 031465 

AJ001409AJ001410 

L08442 D51348 

L12051 

T25451 AA58529BAA585305 

U88896 U88898 AA91 6056 T03285 AJ341594 A1359534 AI634031 U88897 
X98941 X98942 X98943 X98953 X98949 



AA382814 AA402411 AA412355 
AI698839AI909260AI909259 
278390 T97427 
Z45481 F12393T74437 
R05329 R01555R08276 
T82930 R02424 T85145 

AA335314 T82938 AA327744 AW967388 AA639967 T10753 

T83263T85731T85730 

T91772 R07257 R07098 

H10818F07831 243072 

N74613T98756T98589 

R09692R09414AA346353 

At863140WB0703 R43474 

H14957 R56522 R11908 

BE080180 AW827313 AW231970 AA995028 AA428584 AW872716 AW892508 AW854593 AA578441 AW975234 AA664937 AA984131 
AA528743 AA552B74 AA564758 AW063245 A1267534 AW070190 AW893483 AA770330 AA906928 AA906582 AA758746 AA55171 7 
AW063311AA429538 

AW205447 AI248530 AI084433 A1400976 R16553 
AA071267T65940T64515 M071334 
AA018306 H38925AA001221 
H79570H47798AA700289 

N34524 AA305071 AW954803 AA502335 AI433430 A1203597 AW026670 AW265323 AW850787 AA317554 AW993643 AW835572 

AW385512A1334966 W32951 H62656 H53902 R88904 AW835732 

AA769156 

N59537 N7B278 R83560 
AA226431 AA226569 AA488748 
R91883A1445591 



AFO9O4O5AF090407AFO904O6 
AF163305AF163307AF163303 



AF068654 AF068656 AF068655 



145 



WO 02/086443 PCT/US02/12476 

305614 AA782B65 



305637 


AA80S124 


305639 


AAB06138 


305650 


AA807709 


305690 


AA813477 


305728 


AA828209 


305759 


AAS35353 


305792 


AA845256 


307041 


A1144243 


307091 


AJ167439 


307181 


AI189251 


305901 


AA872958 


305910 


AAB759B1 


307415 


A1242118 


307426 


AI243364 


307517 


AJ275055 


307551 


AI281556 


307561 


A1282207 


307608 


AI290295 


307691 


A1318285 


307730 


AJ335092 


307760 


AI342387 


307764 


AJ 342731 


307796 


AJ 350556 


• 309045 


AI910902 


309051 


A191 1975 


307807 


A1351799 


307808 


A1351826 


307820 


AI355761 


307852 


At 365541 


309122 


AI928178 


309164 


A1937761 


309177 


AI951118 


307902 


AJ380462 


309299 


AW001478 


309303 


AWQ04823 


309476 


nil KMUU - 


309532 




309747 


AW264889 


309769 


AW27234G 


309799 




309866 


AW299916 


302679 


31 1 853.1 H65022 AA1 86889 


309923 


AW340684 


309928 


AW341416 


309931 


Ayi/341 683 


309933 


AW341936 


302705 


31765 1 U09Q6DU09Qfi1 


302789 


34161 1 AJ245067 AJ245070 


304006 


AW517947 


304024 


T03036 


304026 


T03160 


304028 


703266 


304046 


T54803 


304061 


T61521 


304063 


762536 


302802 


34487J Y0B250Y08245 


304114 


R78946 


304155 


H68696 


304203 


N56929 


304234 


W81608 


304348 


AA179868 


304430 


AA347682 


304456 


AA411240 


304521 


AA464716 


304526 


AA476427 


304607 


AA513322 


304735 


AA576453 


304760 


AA580401 


305015 


AA897116 


306063 


AA906316 


306065 


AA906725 


306104 


AA910956 


306109 


AA911861 


306242 


AA932805 


306288 


AA936900 


306396 


AA970223 


330568 


N0TJ=OUND..entrez U56244 


330599 


15323.-12 U90437 


331131 


genbanleR54797 R54797 


331203 


N0T_F0UND entrez T82310 


331531 


genbanleN51343 N51343 


331547 


457396J AA828597 N54811 


332074 


genbankJWCTOU AA599012 
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TABLE 8C shows the genomic postfion far those Pfaeys in Table 8A lacking unjgene Iff send accession numbefs. For each predicted exon, we have feted the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon ere ateo listed. 

Pkey. Unique number corresponding to an Eos probeset 

Ret Sequence source. The 7 digit numbers in this column are Gen bank Identifier (Gl) numbers, "Dunham L et aL' refers to the pubBcafion entitled The DNA 

sequence of human chromosome 22/ Dunham L et a!., Nature (1999) 402:489-495. 
Strand: Indicates QUA strand from which exons were predicted. 

NLposifion: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref Strand 


NLpxifon 


332792 


Dunham 1 at al 
LHiiUlciin, U 01.31. 


Plus 


73381-73768 


332816 


Dunham 1 oM al 
Uuiukun, u 61.31. 


Plus 


359844-360030 


33205 


Dunham 1 of fal 


Plus 


1923101-1923205 


332911 


DitnhfSm 1 a4 al 


Plus 


1951767-1961858 


332912 


Dunham 1 □! 

uuiuiam, 1. BU3i. 


Plus 


1962120-1962246 




Dunham 1 al al 


Plus 


2009620-2009738 


OJ £900 


uunnam, l ei.aj. 


Plus 


2510528-2510658 


oo^yoa 


uunnam, i. ei.ai. 


Plus 


2516145-2518213 


0001O0 


rVuf\Kim 1 a| 

uunnam, i. bl.23. 


Plus 


3369205-3363323 


oooioa 


Dunham, 1. eLal 


Plus 


3369495-3369571 




Dunham, 1. etal. 


" Plus 


397807O-39781 87 


OOOOOU 


Dunham, L eta). 


Phis 


49047754904846 


000007 


Dunham, t. etal 


Plus 


4910935-4910997 




Dunham, t etaL 


Plus 


5560510-5560564 


333524 


uunnam, L etal. 


Plus 


5612620-5612780 


oooood 


Dunham, t etal. 


Plus 


6234778-6234894 


000010 


uunnam, r. etaJ. 


Plus 


6562391-6562566 


JJoW/ 


uunnam, l. elaL 


Plus 


65205 84-6620903 


333628 


uunnam, l. et.at 


Plus 


6629004-6629233 


333550 


Dun nam, i. etaf. 


Plus 


6796852-6797128 


333578 


Dunham, 1. etal. 


Phis 


7Q68223-7068288 


333750 


uunnam, I. etal. 


Plus 


7608165-7608234 


000/00 


uunnam, I. etal 


Plus 


7692491-7692630 


333767 


Dunham, (. etal 


Plus 


7694407-7694623 


333768 


uunnam, I. etal. 


Plus 


7695440-7695597 


333769 


Dunham, 1. etal. 


Plus 


7696625-7696707 




n,,-|tAm 1 ftt ml 

uunnam, i. etaL 


Plus 


7706773-7706902 


333777 


uunnam, I. eLal 


Plus 


7746805-7746916 


00O040 


uunnam, i. eiai. 


Plus 


8008623-8008757 


000004 


Dunham, 1. eta). 


Plus 


8153960-8154161 


00000/ 


Dunham, I. etat 


Plus 


8154882-8155025 


OOOWI 


Dunham, I. etaL 


Plus 


8156437-8156709 


•3*2100. 0 

OOOOM 


uunnam, I. eiai. 


Plus 


8156825-8157001 


oojyflo 


HiinViim 1 e ( al 

uunnam, i. eiai. 


Plus 


8583497-B5B3627 


oojyM 


nunli il.il t n» nl 

uunnam, i. eiaL 


Plus 


5563186-6553335 


oo jy do 


Dunham, 1. etai 


Plus 


8655643-8655826 


OOJ3C30 


Dunham, 1. etal. 


Plus 


86B1004-86B1241 


OOAUOl 


Dunham, 1. etaL 


Plus 


9686941-9687077 


WW!W 


Dunham, t etat 


Plus 


9889953-9890105 


oot i to 


Dunham 1 ol a? 

uunnam, i- dlsl 


Plus 


10282459-10282597 


OOh ID 1 


Dunham 1 at at 


Plus 


10599033-10599180 


334219 


UUIUItUII, I. CLcU. 


Plus 


12716160-12716384 


334239 


Dunham 1 ni al 


Plus 


13056569-13056693 


334333 


Dunham 1 of at 
UUIUIdlll, 1. eucu. 


Plus 


13503544-13603657 


334378 


Dunham 1 Pi al 
uutinain, i. cuoj. 


Plus 


13907239-13907370 


334382 


Dunham 1 at al 


Plus 


13915866-13916036 


334562 


Dunham 1 at al 

UUIUlaJTl, la ClaOJ. 


Plus 


14987847-14987940 


334568 


Dunham 1 at al 


Plus 


15032740-15032817 


334616 


Dunham 1 ffi a) 


Pius 


15176123-15176470 


334633 


Dimham 1 at at 


Plus 


15333206-15333305 


334866 


Dunham 1 etat 


Plus 


18872214-18872317 


334891 


Dunham 1 si si 


Phis 


19299770-19299944 


334S34 


Dunham, t eta). 


Plus 


20103970-20104058 


335015 


Dunham. 1. eta). 


.Plus 


20682792-20682945 


335120 


Dunham, 1. etal 


Plus 


21436286-21436384 


335125 


Dunham, t etaL 


Phis 


21441390-21441471 


335179 


Dunham, 1. etal 


Plus 


21634405-21634526 


335188 


Dunham, 1. etal. 


Plus 


21669118^21669328 


335211 


Dunham, 1. etat. 


Plus 
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TABLE 9A: Potential Therapeutic Diagnostic and Prognostic targets for Therapy of Lung Cancer 

Table 9A shows about 1312 genes up^eguiated in lung tumors (inducing squamous ceD carcinomas, adenocarcinomas, small cell carcinomas, granulomatous and carcinoid 
tumors) relative to norma) body tissues. These genes were selected from about 59580 probesets on (he Ecis/Anymeuix Hu03 Genechip array. 

Table 9B show the accession numbers far those Pke/s lacking UrogenalD's for table 9A. For each probeset we have listed the gene cluster number from which the 
ofigonudeolides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwist. Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 
'Accession" column. 

Table 9C show the genomic positioning for those Pke/s lacking Unigene ID'S and accession numbers in table 9A. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also listed 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Average of lung tumors (including squamous ceD carcinomas, adenocarcinomas, small ceO carcinomas, granulomatous and carcinoid tumors) divided by the . 
average of normal lung samples 

R2: Average of non-malignant lung disease samples deluding bronchifis, emphysema, fibrosis, atelectasis, asthma) divided by the average of normal lung samples 
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Hs.129055 


ESTs 


1.00 


1.00 


409420 


Z15008 


Hs.54451 


lamtnirt, gamma 2 (ntcein (lOOkD), kalin) 


79.74 


96.00 


409430 


R21945 


Hs.346735 


splicing factor* arginine/serine-rich 5 


1.45 


2.10 


409446 


Ai561173 


Ks.67688 


ESTs 


1.00 


4.00 


409506 


NMJJ06153 


Hs.54589 


NCK adaptor protein 1 


- 197 


28.00 


409522 


AA075382 




gb:zm87b03.s1 Stratagene ovarian cancer 


15.98 


141.00 


409582 


AA401369 


Hs.190721 


ESTs 


1.00 


17.00 


409532 


W74001 


fte.55Z79 


serine (or cysteine} proteinase inhibito 


29Z12 


79.00 


409705 


M37762 


Hs.56023 


bran-derived neurotrophic factor 


1.00 


82.00 


409719 


AI769160 


Hs.10868t 


Homo sapiens brain tumor associated prat 


1.00 


1.00 


409731 


AA1 25985 


Hs.56145 


thyrnosin, beta, identified in neuroblast 


0.12 


18.12 


409744 


AW675258 


Hs.55265 


Homo sapiens mRNA; cDNA DKFZp586P2321 (f 


20.75 


51.00 


409757 


NM_001898 


Hs.123114 


cystatin SN 


22.46 


15.80 


409866 


AWH)2152 




gb:UW-BR0p^jr-f-11-04JlJl N1H_MGC_5 


1.00 


1.00 


409893 


AW247090 


Hs .57101 


minichromosome maintenance deficient (S. 


1.50 


1.09 


409902 


AI337658 


Hs.156351 


ESTs 


25.92 


50.00 


409935 


AW511413 


Hs.278025 


ESTs 


2.63 


Z11 


409956 


AW1 03364 


Hs.727 


inhibin, beta A (activin A, acfivin AB a 


2.17 


4.01 


409958 


NM 001523 


Hs.57697 


hyaturonan synthase 1 


0.91 


2.07 


410001 


ABO41036 


H5.57771 


ka/Mn11 


1.04 


228 


410032 


BE065985 




gb:RC3-BT031 9-1 20200-01 4-a09 BT0319 Homo 


1.00 


58.00 


410037 


AB020725 


Hs.58009 


WAA091 8 protein 


1.00 


34.00 


410044 


BE566742 


Hs.58169 


highly expressed in cancer, rich in teuc 


1.00 


1.00 


410048 


W76467 


Hs.58218 


proline oxidase homotog 


1.03 


1.44 


410076 


T05387 


Hs.7991 


ESTs 


1.12 


1.50 


410102 


AW248508 


Hs.279727 


Homo sapiens cDNA FU14035 fis, done HE 


9.89 


1.00 


410153 


BE311926 


Hs.1 5830 


hypothetical protein FU12691 


1.00 


1.00 


410166 


AK001376 


Hs.59346 


hypothetfcai protein FU10514 


1.00 


1.00 


410193 


AJ132592 


Hs.59757 


zinc finger protein 281 


42.01 


51.00 


410274 


AA381807 


Hs.61762 


hypoxia-inducible protein 2 


1.72 


1.32 


410309 


BE043077 


Hs.278153 


ESTs 


1.00 


2.00 


410340 


AW182833 


Hs.1 12168 


hypothetical protein FU13149 


32.08 


75.00 


410348 


AW1 82663 


Hs.95469 


ESTs 


1.00 


1.00 


410407 


X66839 


Hs.63287 


carbonic anhyoVase DC 


1.40 


1.11 


410418 


031382 


ffe.63325 


transmembrane protease, serine 4 


4.30 


Z03 


410438 


AB037756 


Hs.45207 


hypothetical protein KIAA1335 


1.00 


18.00 


410553 


AW015824 


Hs.255527 


hypothetical protein M6C14128 


1.34 


1.04 


410555 


W27235 


Hs.64311 


a disintegrin and metailoproteinase doma 


23.99 


1.41 


410561 


BE540255 


Hs.6994 


Homo sapiens cDNA: FU22044 fis, clone H 


10.04 


1.00 


410681 


AW246890 


Hs.65425 


calbindin1,(28kO) 


10.88 


18.92 


410781 


AI375672 


Hs.1 65028 


ESTs 


1.00 


57.00 


411027 


AF072099 


Hs.67846 


leukocyte (rnrnunogIobuErt*iike receptor, 


1.62 


3.78 


411074 


X60435 


Hs.68137 


adenylate cyclase activating polypeptide 


1.00 


1.15 


411089 


AA456454 




ceD division cycle 2-fike 1 (PITSLRE pr 


1.56 


1.58 


411152 


BE069199 




gb:Q\fl-BT0379-01 0300-105^03 BT0379 Homo 


1.00 


84.00 


411248 


AA551538 


Hs.334605 


Homo sapiens cDNA FU14408 fis, done HE 


1.82 


1.45 


411252 


AB018549 


Hs.69328 


MD-2 protein 


7.32 


1Z74 


411263 


BES7802 


Hs.69360 


Wnesin-iike 6 (mitotic centrcmere-assoc 


3.44 


2.55 


411365 


M75477 


Hs.289082 


GM2 ganglioside activator protein 


1.35 


2.02 


411402 


BE297855 


Hs.69855 


NRAS-related gene 


1.00 


46.00 


411573 


AB029000 


Hs.70823 


K1AA1077 protein 


11.40 


11.35 


411579 


AC00525B 


Hs.70830 


U6 snRNA-associated Sm-fike protein LSm7 


1.08 


1.90 


411617 


AA247994 


Hs.90063 


neurocalcin delta 


1.74 


2.57 


411732 


AA059325 


Hs.71642 


■guanine nucleotide binding protein (G pr 


1.02 


1.00 


411773 


NM.006799 


Hs.72026 


protease, serine, 21 (tesfisin) 


1.34 


2.19 


411789 


AF245505 


Hs.72157 


AdCcan 


. 2.19 


Z79 


411800 


N39342 


Hs.103042 


'micmtubufe^assoctated protein 1B 


23.34 


34.00 


411945 


AL033527 


Hs.92137 - 


v-myc avian rrryetocytomatosis viral oncog 


1.00 


8.00 


412115 


AK001763 


Hs.73239 


hypothetical protein FU10901 


2.07 


1.64 


412140' 


M219691 


Hs.73625 


RAB6 interacting, ktnesin-fike (rabkmes 


118.48 


9Z00 


412276 


BE262621 


Hs.73793 


macrophage migration Inhibitory factor { 


1.93 


1.49 


412464 


T78141 


Hs.22826 


ESTs. WeaWy similar to 155214 salivary 


1.16 


1.34 


412530 


AA766268 


Hs^66273 


hypcthefca! protein FU 13346 


41.52 


64.00 


412537 


AL03177B 




nuclear transcription factor Y, afpha 


17.90 


55.00 
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412859 


AW753865 


Hs.74376 ■ 


dfactomeolri related ER localized protel 


412719 


AVTO1651Q 


Hs.816 


ESTs 


412723 


AA648459 


Hs.335951 


hypdheSca! protein AF301222 


412811 


H06382 




ESTs 


412817 


AL037159 


Hs.74619 


proteasome (prosome, macropain) 26S subu 


412863 


AA121573 


Hs.59757 


zhc finger protein 281 


412924 


BE018422 


Hs.75258 


H2A histone family, member Y 


413004 


T35901 


Hs.75117 


interleuldn enhancer binding factor 2. 4 


413011 


AW068115 


Hs.821 


btgrycan 


413048 


M93221 


Hs.75182 


mannose receptor. C type 1 


413063 


AL035737 


Hs.75184 


chlfinase 3-fike 1 (carQage gJycoprotB 


413129 


AF292100 


Hs.1 04613 


RP42 homoJog 


413142 


M81740 


Hs,75212 


ornithine decarboxylase 1 


413223 


AI732182 


Hs.191866 


ESTs 


413248 


T64658 


Hs.21433 


hypothetical protein DKFZp547J035 


413273 


U75579 


Hs.75257 


stem-bop (histone) binding protein 


413278 


BE563085 


Ks.833 


interferon-slimuiated protein, 15 kDa 


413281 


AA861271 


Hs.222024 


transcription factor 8MAL2 


413364 


BE536218 


Hs.137516 


fidgefin-like 1 


413385 


M34455 


Hs.840 


ir^dearrune-pyrrole 2,3 dioxygenase 


413409 


AI638418 


Hs.1440 


DEAD/H (tep-Qu-AlanAsp/His) box potypep 


413453 


AA129640 


Hs.1 28065 


ESTs 


413527 


BE250788 


Hs.179882 


hypothebca) protein FU12443 


413554 


AA319146 


Hs.75426 


secretogranin 11 (chromogranin C) 


413573 


AI733859 


Hs.149089 


ESTs 


413582 


AW295647 


Hs.71331 


hypothetical protein MGC5350 


413597 


AW302885 


Hs.1 17183 


ESTs 


413690 


BE1 57489 




gb:RC1-HT0375-12020(W1 1-e06 HT0375 Homo 


413691 


AB023173 


Hs.75478 


ATPase, Class VI, type 11B 


413719 


BE439580 


Hs.75498 


small inducible cytokine subfamily A (Cy 


413753 


U17760 


Hs.75517 


taminln, beta 3 (nicein (125kD), kafirun 


413801 


M62246 


Hs^5406 


ESTs, Highly similar to unnamed protein 


413833 


Z15005 


Hs.75573 


centromere protein E (312kD) 


413882 


AA132973 


Hs.184492 


ESTs 


413926 


AA133338 


Hs.54310 


ESTs 


413943 


AW294416 


Hs.144687 


Homo sapiens cDKA FU12981 lis, done NT 


413995 


BE048146 


Hs.75671 


syntaxin 1A (brain) 


414035 


Y00630 


Hs.75716 


serine (or cysteine) proteinase inhibito 


414142 


AW368397 


Hs.334485 


Homo sapiens cDNA FU 14438 fis, cbne HE 


414180 


A! 863304 


Hs.120905 


Homo sapiens cDNA FU 11 448 6s, done HE 


414245 


BE148072 


Hs.75850 


WAS protein family, member 1 


414275 


AW970254 


Hs.689 


Charot-Leyden crystal protein 


414317 


BE263280 


HS.758B8 


phosphogluconate dehydrogenase 


414334 


AA824298 


Hs.21331 


hypothetical protein FU10036 


414341 


D80004 


Hs.75909 


KIAA01 82 protein 


414368 


W70171 


Hs.75939 


uridine monophosphate kinase 


414416 


AW409S35 


Hs.76084 


hypothetical protein MGC2721 


414430 


AI346201 


Hs.76118 


ubiquitin carboxyMerminal esterase L1 


414570 


Y0Q2B5 


Hs.76473 


insuGn-Jfte growth factor 2 receptor 


414618 


AI204600 


Hs.96978 


hypothetical protein MGC10764 


414675 


R79015 


Hs,296281 


Interieukin enhancer binding factor 1 


414683 


S78296 


Hs.76888 


hypothetical protein M6C12702 


414696 


AF002020 


Hs.76916 


Niemann- Pick disease, type CI 


414711 


AI310440 


Hs.288735 


Homo sapiens cDNA FU13522 fis, cbne PL 


414718 


H95348 


Hs.107987 


ESTs 


414732 


AW410976 


Hs.77152 


miruchromosome maintenance defictant (S. 


414747 


U30872 


Hs.77204 


centromere protein F (35Q/400kO, mitosin 


414761 


AU077228 


Hs.77256 


enhancer of zeste (DrosophOa) homotog 2 


414774 


X02419 


Hs.77274 


•plasminogen activator, urokinase 


414806 


D14694 


Hs.77329 


phosphatidyberine synthase t 


414809 


AI434699 


Hs.77356 


transferrin receptor (p90, CD71) 


414812 


X72755 


Hs.77367 


monokine induced by gamma Interferon 


414825 


X06370 


rfe.77432 


epidermal growth factor receptor (avian 


414839 


X63692 


Hs.77462 


DNA (cytosine-5-J-rnethyItransferase 1 


414883 


AA926960 




COC28 protein kinase 1 


414907 


X90725 


Hs.77597 


polo (Drosophia}-!ike kinase 


414914 


U49844 


Hs.77613 


ataxia telangiectasia and Rad3 related 


414945 


BE076358 


Hs.77667 


tymphocyte antigen 6 complex, locus E 


414972 


BE263782 


Hs.77695 


WAAOO03 gene product 


415014 


AW954064 


Hs^4951 


ESTs 


415091 


AL044872 


Hs.77910 


34iydraxy-3-m8thykjlutaryU^rizyrne A sy 


415138 


CI 8356 


Hs.295944 


tissue factor pathway inhibitor 2 


415227 


AW821113 


Hs.724Q2* 


ESTs 


415238 


R37780 


Hs.21422 


ESTs 


415263 


AA948033 


Hs.1 30853 


ESTs 


415295 


R41450 


Ks.6546 


ESTs 


415339 


NM.015156 


Ks.78398 


WAA0071 protein 


415669 


Nf4_005025 


Hs.7B589 


serine (or cysteine) proteinase inhibito 


415574 


BE394784 


Hs.78596 


proteasome (prosome, macropain) subunit, 


415709 


AA649B50 


Hs.278558 


ESTs 


415735 


AA704162 


Hs.120811 


ESTs, Weakly similar to 138022 hypolheB 


415799 


AA653718 


Hs.225841 


OKFZP434D193 protein 


415817 


U38967 


Hs.78867 


protein tyrosine phosphatase, receptor-t 


415857 


AA8S6115 


Hs.127797 


Homo sapiens cDNA FU 1 1381 fis. clone HE 


415989 


A1267700 




' ESTs 



14.65 


47.00 


382.46 


12800 


54.90 


1.00 


1.00 


11.00 


1.63 


1.42 


17.63 


56.00 


1.00 


22.00 


2.19 


2.05 


1.22 


1.88 


0.30 


6.23 


3.43 


8J1 


4.67 


4.77 


1.92 


2.59* 


5.73 


27.00 


0.99 


1.06 


1.00 


18.00 


1.10 


1.09 


95.94 


69.00 


1.00 


1.00 


0.95 


2.09 


1.00 


1,00 


1.00 


31.00 


- 1.08 


1.46 


79.15 


114.00 


1.00 


1.00 


8.60 


10.00 


1.00 


1.00 


1.00 


1.00 


3.16 


232 


288 


9.52 


144.10 


108.00 


1.00 


17.00 


1.00 


1.00 


64.24 


14800 


1.00 


67.00 


43.42 


4Z00 


1.23 


1.11 


2.02 


251 


1.00 


10200 


6.92 


77.00 


1.00- 


1.00 


1.00 


59.00 


1.52 


1.73 


1.78 


1,72 


33,90 


151.00 


171.60 


97.00 


2.32 


1.85 


226.15 


66.00 


1.64 


1.98 


1.87 


72.00 


1.51 


1.39 


43.61 


64.00 


28.63 


71.00 


14.66 


4200 


1.00 


5.00 


1.64 


1.44 


65.01 


74.00 


130.35 


121.00 


224 


219 


1.63 


1.53 


1.97 


260 


3.48 


10.60 


103.22 


143.00 


1.80 


1.69 


14.29 


10.06 


1.95 


220 


3.00 


290 


1.02 


1.21 


1.00 


1.00 


1.42 


284 


1.00 


30.00 


34.72 


107,00 


1.87 


49.00 


1.00 


1.00 


4 Art 
1.00 


4 rin 


1.00 


1.00 


51.18 


166.00 


30.84 


63.00 


1.48 


1.39 


1.00 


1.00 


1.00 


72.00 


£23 


31.00 


24.30 


1.00 


3251 


35.00 


78.89 


1.00 
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416018 


AW1 38239 


Hs.78977 


pro protein ccnvertase subfiHsui/kexin t 


1.00 


1.00 


416065 


BE267931 


Hs.78996 


proSJerafing ceB nuctear an^geji 


3.35 


232 


416111 


AA033813 


Ks.79018 


chromatin assembly factor 1, suburut A ( 


39.03 


100 


416177 


AA174Q69 


Hs.187607 


ESTs 


1.00 - 


9.00 


41617B 


AI808527 


Hs.192822 


serologically defined breast cancer anti 


183 


176 


416208 


AW291168 


Hs.41235 


ESTs, Weakly similar to MUC2_HUMAN MUCIN 


167 


1.00 


416209 


AA236776 


Hs.79078 


MAD2 (mitotic arrest deficient, yeast, h 


9.70 


1.00 


416239 


AL038450 


Hs.48948 


ESTs 


83.87 


129.00 


416250 


AA5813B6 


Hs.73452 


hypothetical protein MGC10791 


1.96 


212 


416322 


BE019494 


Hs.79217 


py^ne-5-carboxylate reductase 1 


203 


1.73 


416423 


H54375 


Hsi58921 


ESTs 


1.00 


-89.00 


41644B 


L13210 


Hs.79339 


. lectin, gaactosifle-Dinfling, soJUDie, o 


1.28 


1.54 


416498 


U33632 


Hs.79351 


potassium channel, subfamily K, member 1 


27.29 


67.00 


416658 


U03272 


Hs.79432 


fibrillin 2 (congenital contracture! ara 


53.29 


51.00 


416661 


AA634543 


Hs.79440 


IGF-H mRNA-binding protein 3 


9.96 


5.00 


416722 


AA354604 


Hs.122546 


- hypothetical protein FU2301 7 
pinv2 oncogene 


168 


3100 


416B19 


U77735 


Hs.80205 


1.59 


1.84 


416936 


N21352 


Hs.42987 . 


ESTs. WeaWy similar to S21 348 probable 


1.00 


1.00 


417034 


NM_005183 


Hs.60962 


neurotensin 


1.00 


1.00 


417061 


A1675944 


Hs.188691 


Homo sapiens cDNA FU 12033 fis, done HE 


32.95 


156.00 


417079 


U65590 


Hs.81134 


interieukin 1 receptor antagonist 


191 


4.93 


417218 


AA129547 


Hs.285754 


' met proto-cncogene (hepatocyte growth fa 


1.00 


51.00 


417233 


W25005 


Hs.24395 


.small inducible cytokine subfamily B (Cy 


3.38 


205 


417308 


H60720 


Hs.81892 


K1AA0101 gene product 


82.94 


25.36 


417315 


A1080042 


Hs.180450 


ribosomal protein S24 


106.61 


121.00 


417324 


AW265494 




ESTs 


1.20 


1.28 


417366 


BE 185289 


Hs.1076 


small protine-rich protein 1B (comifin) 


8.97 


127 


417389 


BE260964 


Hs.82045 


midkine (neurite growth-promoting factor 


Z59 


1.82 


417428 


N87579 


Hs.278871 


gb:LL2030F Human fetal heart lambda ZAP 


1.00 


5200 


417433 


BE270265 


Hs.82128 


5T4 oncofetal trophoblast glycoprotein 


304.75 


173.00 


417466 


A1681547 


Hs.59457 


hypothetical protein FU221 27 


1.24 


1.34 


417512 


AI979168 


Hs.344098 


glycoprotein (transmembrane) nmb 


2.14 


5.50 


417515 


L24203 


Hs.82237 


ataxia-telangiectasia group D-assodated 


2.66 


1.68 


417542 


J04129 


Hs.82269 


progestagan^associated endometrial prole 


1.28 


1.35 


417576 


AA339449 


Hs.82285 


chcsphcribosylglycrnarriide rorrnyl transfer 


42.76 


51.00 


417715 


AW969587 


Hs.86356 


ESTs 


6.35 


275 


417720 


AA205625 


Hs.208067 


ESTs 


11131 


56.00 


417791 


AW965339 


Hs.1 11471 


ESTs 


39.98 


16.00 


417830 


AWKW786 


Hs.122579 


hypothetical protein FU 10461 


261 


31.00 


417866 


AW067903 


Hs.82772 


coHagen, type XI, afcha 1 


235 


244 


417900 


BE250127 


Hs.82906 


CDC20 (cell division cycle 20, S. cerevi 


1.52 


1.11 


417933 


X02308 


Hs.82952 


thymidybte synthetase 


4.74 


255 


417944 


AU077196 


Hs.82985 


collagen, type V, alpha 2 


161 


5.21 


417975 


AA641836 


Hs.30085 


hypothetical protein FU23186 


12.49 


38.00 


417991 


AA731452 


Hs.190008 


ESTs 


1.00 


26.00 


418004 


U37519 


Hs.87539 


aldehyde dehydrogenase 3 family, member 


102 


212 


418007 


M13509 


Hs.83169 


. matrix metaOoproteinase 1 (interstitial 


187.59 


1.00 


418054 


NM.002318 


Hs.83354 


lysyloxidase-tike2 


2.85 


263 


418057 


NM.012151 


Hs.83363 


coagulation factor VJII-assodated (intr 


1.54 


1.69 


418113 


AJ272141 


Hs.83484 


SRY (sex determining region YJ-box 4 


6.82 


5.22 


418140 


BE613836 . 


Hs.83551 


microfibnllar-assocfaled protein 2 


1.26 


1.46 


418203 


X54942 


Hs.B375B 


CDC28 protein kinase 2 


134.19 


144.00 


418207 


C14685 


Hs.34772 


ESTs 


1.00 


1.00 


418216 


AA662240 


Hs.283099 


AF15ql4 protein 


64.66 


61.00 


418236 


AW994005 


Hs.337534 


ESTs 


18.53 


147.00 


418249 


H89226 


Hs.34892 


K1AA1323 protein 


30.53 


10100 


418281 


U09550 


Hs.1154 


oviductaJ glycoprotein 1, 120kO (mucin 9 


1.00 


100 


4182B3 


S79895 


Hs.83942 


cathepsin K (pycnodysostosis) 


. 3.96 


5.16 


418300 


AM33074 


Hs.86682 


Homo sapiens cDNA' FLJ21 578 fis, done C 


118 


291 


418322 


AA284166 


Hs.84113 


cycfin-dependent kinase inhibitor 3 (COK 


11.96 


6.68 


418327 


U70370 


Hs.64136 


paired-tike homeodomain transcription fa 


9.23 


222 


418345 


AJ001696 


Hs.241407 


serine (or cysteine) proteinase inhibito 


1.00 


1.00 


418379 


AA218940 


Hs.137516 


fidgetin-like 1 


21.68 


44.00 


418397 


NM.001269 


Hs.84746 


chromosome condensation 1 


1.00 


100 


418403 


D86978 


Hs.84790 


KIAAQ225 protein 


16.91 


18.98 


418462 


BE001598 


Hs.85265 


integral, beta 4 


1.56 


1.16 


418478 


U38945 


Hs.1 174 


cydin-dependent kinase inhibitor 2A (me 


3.22 


238 


41S506 


AA084248 


Hs.85339 . 


G protein-coupled receptor 39 


266 


222 


418526 


BEO1902O 


Hs.85838 


solute earner family 16 (monocarboxyiic 


204 


221 


418538 


BE244323 


Hs.85951 


exportin, (RNA (nudear export receptor 


1.33 


37.00 


418543 


NM.O05329 


Hs.85962 


hyaluronan synthase 3 


1.04 


1.23 


418574 


N28754 




M-phase phosphoprotein 9 


48.60 


85.00 


416592 


XS9225 


Hs.284153 


Fanconi anemia, complementation group A 


1124 


26.00 


418641 


BE243136 


Hs.86947 


a disintegrin and metaOoproteinase doma 


1.19 


1.41 


418661 


NM.001949 


Hs.1189 


E2F transcription factor 3 


29.05 


43.00 


418663 


AK001100 


Hs.41690 


desmoconin3 


11217 


19.00 


418678 


NM_001327 


Hs.87225 


cancer/tesfJs antigen 


1.18 


1.10 


418686 


Z36830 


Hs.87268 ' 


annexinAS 


1.54 


1.98 


418689 


AI360883 


Hs.274448 


hypothetical protein FU 1 1029 


1.19 


1.04 


418712 


Z42183 




gb:HSC0BF041 normalized infant bran cDN 


1.00 


1200 


418727 


AA227609 


Hs.94834 . 


ESTs 


1.00 


49.00 


418738 


AW388S33 


Hs.6682 


solute carrier farrfly 7, (caDonic amino 


49.65 


1.00 


418819 


AA228776 


Hs.191721 


ESTs 


1.00 


14100 


418830 


BE513731. 


Hs.88959 


hypothetical protein MGC4816 


20.97 


23.00 


418882 


NM004996 


Hs.89433 * 


ATP-biruDng cassette, sub-family C (CFTR 


57.09 


35.00 



156 



WO 02/086443 



PCT/US02/12476 



10 
.15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



418971 

416973 

419078 

419079 

419080 

419088 

419092 

419121 

419171 

419183 

419216 

419288 

419335 

419354 

419359 

419423 

419443 

419452 

419474 

419485 

419488 

419502 

419539 

419556 

419569 

419594 

419703 

419721 

419729 

419741 

419745 

419752 

419839 

419936 

419937 

419983 

420005 

420047 

420058 

420162 

420251 

420259 

420281 

420309 

420332 

420380 

420462 

420520 

420552 



AA350392 

AA23305S 

M93119 

AW014836 

AW150S35 

AI53B323' 

J05531 

AA374372 

NMJW2846 

U60669 

AU076718 

AA256106 

AW960146 



420610 

420689 

420721 

420759 

420783 

420900 

420931 

421002 

421027 

421037 ' 

421041 

421073 

421110 

421133 

421150 

421155 • 

421307 

421316 . 

421379 

421451 

421474 

421506 

421508 

421515 

421524 

421526 

421552 

421574 

421582 

421633 

421659 

421677 

421753 

421773 

421777 



AL043202 

D26488 

D62703 

U33535 

AW968619 

AA469023 

AA316241 

AU076704 

AF070590 

U29615 

AI971651 

AA013051 

AT793257 

NM.001650. 

AA586442 

NM.007019 

AF042001 

AA249573 

U24577 

AI792788 

AB040959 

W55956 

AW271106 

AI47B658 

AK001423 

BE378432 

AW374968 

AFW48S4 

AI623693 

AW043637 

NM.001756 

AA640891 

AF050147 

AK001978 

AK000492 

AW207748 

AI683183 

H79979 

AA927802 

T11832 



AL045633 

AF044197 

AF 11 6030 

AA761198 

A1684808 

N36914 

NM_004689 

AJ250717 

AA401369 

AI913562 

H87879 

BE539976 

AA287203 

Y15221 

AA291377 

U76362 

BE302796 

NM_004833 

Y11339 

AA312082 

AL080121 

AF026692 

AJ000152 

AI910275. • 

AF121860 

NMJ14459 

H64092 

BE314828 

W69233 



Hs.87113 
Hs.191518 
HS.895B4 
Hs.18844 
Hs.18878 
Hs.52620 
Hs.89603 
Hs.89626 
Hs.89655 
Hs.89663 
Hs.164021 
Hs.87507 
Hs.284137 
Hs.1252 
Hs.90073 
Hs.90315 

Hs.90572 
Hs.155849 
Ks.99807 
Hs.90691 

Hs.90869 

Hs.91093 

Hs.91143 

Hs.91417 

Hs.128151 

Hs.288650 

Hs.21411 

Hs.93002 

Hs.93005 

Ks.152618 

Hs.93304 

Ks.93836 

Hs.94030 

Hs.133294 

Hs.94631 

Hs.94694 

Hs.95577 

Hs.348112 

Hs.96253 

Hs.323494 

Hs.21766 

Hs.1305 

Hs.102405 

Hs.97932 

Hs.98510 

Hs.98806 

Hs.59115 

Hs.99348 

Hs.88678 

Hs.159471 

Hs.127797 

Hs.99923 

Hs.44269 

Hs.100431 

Hs. 100932 

Hs.55254 

Hs.197653 

Hs.14691 

Hs.101448 

Hs.1355 

Hs.190721 

Hs.189902 

Hs.102267 

Hs.103305 

Hs.324728 

Hs.103982 

Hs.50831 

Hs.104637 

Hs.105097 

Hs.105115 

Hs.105352 

Hs.105445 

Hs.105460 

Hs.105700 

Hs.105924 

Hs.106260 
Hs.106511 
Hs.38282 
Hs.107911 
Hs.112457 
Hs.108196 



ESTs 
ESTs 

insufinom 3-355 ocialed 1 
ESTs 

hypothetical protein FU21620 
integrin, beta 8 
mucin 1, transmembrane 
parathyroid hormone-fHo hormona 
protein tyrosine phosphatase, receptor t 
cytochrome P450, subfamily XXIV {vitamin 
smafl Inducible cytokine subfamily B (Cy 
ESTs 

hypothefica] protein FU12888 
apoDpoprotein H {beta-2-9lycoprotein I) 
chromosome segregation 1 (yeast homolog) 
KIAA0007 protein 

gb:HUM316G10B Ctontech human aorta pofyA 

PTK7 proteJn tyrosine kinase 7 

ESTs 

ESTs, Weakly similar to unnamed protein 
nudeophosminmudecpiasrnin 3 
fibrinogen, A alpha polypeptide 
Homo sapiens clones 24622 and 24623 mRNA 
chitinase 1 (chitotriosidase) 
jagged 1 (AiagMe syndrome) 
topoisomerase (DNA) D binding protein 
ESTs 

aquaporM 

gb.no53a03.s1 NCI_CGAP_SS1 Homo sapiens 
ubiquitin carrier protein E2-C 
slug (chicken homolog), zinc finger prot 
ESTs, Moderately similar to ZN91JMMAN Z 
phosphofipase A2, group VII (platelet-ac 
gb:o!91d05.y5 NCLCGAP_Wd5 Homo sapiens 
DKFZP434N014 protein 

Homo sapiens mRNA; cDNA OKF2p586E1624 (f 
ESTs 

brnfeklin A-inhlbited guanine nucleotide 

Homo sapiens cDNA FU 10561 lis, done NT 

cyclin-dependent kinase 4 

Human DNA sequence from done RP5-1 103G7 

calcium channel, voltage-dependent, P/Q 

ESTs 

ESTs, Weakly similar to ALU5.HUMAN ALU S 
serine (or cysteine) proteinase inhibito 
ESTs 

chondromoduiin I precursor 
similar to rabl 1-binding protein 
hypotnetfcaJ protein 
ESTs 

distal-lass homeo box 5 
ESTs 

ZAP3 protein 

Homo sapiens cDNA FLJ 11381 fis, clone HE 
lectin, galactoside-btnding, soluble, 7 
ESTs 

small Inducible cytokine B subfamily (Cy 

transcription factor 17 

ESTs 

ESTs 

ESTs, Moderately simflar to I38022 hypot 

metastasis associated 1 

cathepsinE 

ESTs 

ESTs 



Homo sapiens mRNA; cDNA DKFZp434B0425 (f 
SMA5 

small indudbfe cytokine subfamily 8 (Cy 
ESTs 

solute carrier family 1 (glutamate trans 
thymidine kinase 1, soluble 
absent in melanoma 2 
GalNAc alpha-2, 6-siaiyItransferase i, I 
GDNF family receptor alpha 1 
DKFZP564O0823 protein 
secreted frizzted-related protein 4 
defensin, beta 2 , 

trefofl factor 1 (breast cancer, estroge 
sorting nexin 10 
protocadherin 17 
ESTs 

ATP-WncTng cassette, sub-family B (MDW 
ESTs 

HSPC037 protein 



i nn 

l.UU 


li-UU 


A OQ 


9ft nn 


1 m 

l.UU 


mno 


1 no 

I.U3 




2.06 


1.68 


15.60 


51.00 


1.11 


1.83 


1.00 


too 


1.10 


1J4 


1.00 


1.00 


3.18 


2.43 


1.00 


34.00 


1.00 


8.00 


22.63 


54.00 


2.50 


1.98 


1.00 


7.00 


1.00 


12.00 


1 RA 
I.D9 


1 RA 


1 0.0 J 


Bonn 

Dl.UU 


A 77 


9 9R 


0.00 


3.63 


IJ.U3 


1 10. uu 


i A fin 

/9.0U 


117 nn 
111 .uu 


1 A7 

1.9/ 


A Qfl 


l.UU 


a nn 

«l.UU 


qa in 

S9.0U 


qa nn 

99.UU 


10.4.9 


50.00 


1 nn 

l.UU 


1Q1 nn 
la l.UU 


I.UU 


co nn 


o no 


1 nn 

l.UO 


i nn 

l.UU 


i nn 

l.UU 


£3.0/ 


77 nn 


cn oo 


oia nn 


i nn 

l.UU 


1 nn 

l.UU 


I.W 


O A7 


10. U 


qa nn 


1 1C 

Alt) 


1.90 


il AC 


io nn 


l.UU 


11 /.uu 


1 Al 

1.90 


1 91 

1.4 1 


£.03 


1 77. 


n 77 

U./r 


1 K 
1.10 


9o.u9 


cA nn 

09. UU 


AQ 77 


n nn 

OI.UU 


U.UD 


9 no 


n qq 


9 7 A 
t./9 


1 nn 

l.UU 


1 nn 

l.UU 


49.74 


lOu.UU 


OA fiiv 


aa nn 

OO. UU 


1 nn 


17 nn 

l/.UU 


1 nn 

l.UU 


11 nn 
10. uu 


cn no 
ou.ua 


oc nn 

sO.UU 


1 nn 


11 nn 

Ol.UU 


1 nn 

l.UU 


ah nn 

90.UU 


0.U9 


1 9R 


9 OA 
L.CH 


7 nn 
/.uu 


1 nn 

l.UU 


8.00 


1 nn 

l.UU 


97 nn 


9R7 
£.01 


ir nn 
oo.uu 


1 nn 

l.UU 


ar nn 

90. UU 


i on 

l.UU 


90.UU 


(.09 


1 Afi 


119.47 


427.00 


1.10 


17.00 


1.45 


1.63 


1.00 


15.00 


1.37 


1.10 


1.00 


21.00 


1.92 


394 


5.89 


14.00 


1.46 


1.76 


1.56 


1.08 


5.11 


5.23 


1.00 


3.00 


2.63 


10.58 


1.46 


1.88 


30.21 


50.32 


1.67 


1.74 


1.23 


1.00 


1.00 


116.00 


0,05 


6.33 


Ml 


1.42 


1.41 


1.20 


1.12 


1.14 


1.97 


1.29 



157 



WO 02/086443 



PCT/US02/12476 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



421800 


AA293151 


Hs.7??969 


421817 


AF146074 


Hs.108660 


421896 


K62293 


Hs.45107 


421928 


AF013758 


Hs.1 09543 


421931 


NMJMKJ814 


Ks.1440 


421948 


142583 


Hs.334309 


421975 


AW981017 


Hs.6459 


422026 


U30736 


Hs. 11 0826 


422094 


AF 129535 


Hs.272027 


422095 


AI868872 


Hs-282804 


422109 


S73265 


Hs.1473 


422128 


AW381145 




422129 


AU076635 


Hs.1478 


422134 


AW179019 


Hs.112110 


42215B 


L10343 


Hs.112341 


422168 


"AA58SS94 


Hs. 11 2408 


422278 


AF072873 


Hs.1 14218 


422282 


AF019225 


Hs.114309 


422283 


AW411307 


Hs.114311 


422310 


AA316622 


Hs.98370 


422311 


AF073515 


Hs. 114943 


422330 


D30783 


Hs.1 15263 


422364 


AF067800 


Hs.1 1551 5 


422406 


AF025441 


Hs.1 16205 


422424 


Al 185431 


H&296S38 


422440 


NM.004812 


Hs.1 16724 


422487- 


AJQ10901 


Hs.198267' 


422511 


AU076442 


Hs.1 17938 


422515 


AW50047O 


Hs.1 17950 


422658 


Al 870435 


Hs.1569 


422737 


M26939 


Hs.1 19571 


422756 


AA4417B7 


Hs.1 19689 


422765 


AW409701 


Hs.1 578 


422809 


AK001379 


Hs.121028 


422867 


L32137 


Hs.1584 


422938 


NM_001809 


Hs.1 594 


422956 


8E545072 


Hs.122579 


422S50 


AW890487 


Hs.63984 


422963 


AA401359 


Hs.1 90721 


422976 


AU076657 


Hs.1 600 


422981 


AF026445 


Hs.122752 


422986 


AA319777 


Hs.221974 


423034 


AL1 19930 




423049 


X59373 


Hs.188023 


423081 


AF262992 


Hs.123159 


423184 


NIYL004428 


Hs.1 624 


423217 


NM_000094 


Hs.1 640 


42324B 


AA380177 


Hs.125B45 


423309 


BE006775 


Hs.1 26782 


423361 


AW1 70055 


Hs.47628 


423453 


AW450737 


Hs.128791 


423511 


AF036329 


Hs.129715 


423516 


AB007933 


Hs.1 29729 


423551 


AA327598 


Hs.233785 


423554 


M90516 


Hs.1674 


423575 


C18663 


Hs.163443 


423624 


Al 807408 


Hs.166368 


423634 


AW959908 


Hs.1690 


423642 


AW452650 


Hs.157148 


423662 


AA642452 


Hs.1 30881 


423673 


BE0O3054 


Hs.1 695 


423898 


AA329796 


Hs.1098 • 


423725 


AJ403108 


Hs.132127 


423761 


NM.006194 


Hs.132576 


423787 


AJ295745 


Hs.236204 


423816 


AF151064 




423826 


U20325 


Hs.1707 


423649 


AL1 57425 


Hs.133315 


423887 


AL080207 


Hs.134585 


423934 


UB9995 


Hs.159234 


423954 


AW753164 


Hs^88604 


423961 


D13666 


Hs.136348 


424012 


AW368377 


Hs.137569 


424016 


AW163729 


Hs.6140 


424028 


AF055084 


Hs.153692 


424046 


AF027B66 


Hs.138202 


424086 


AI351010 


Hs.102267 


424098 


AF077374 


Hs.139322 


424120 


T80579 


Hs.290270 


424165 


AW582904 


Hs.142255 


424200 


AA337221 




424279 


L29306 


Hs.171814 


424306 


AW975531 


Hs.154443 


424326 


NM.014479 


Hs.145296 


424340 


AA339036 


Hs.7033 



ESTs 

ATP-trinding cassette. sub-tam3y C (CFTR 
ESTs 

. polyadenylala binding proteWnteractin 
gammaamlnobutyric acid (GABA) A recepto 
keratin 6A 

hypothetical protein FU11856 
trinucleotide repeat containing 9 
F-box only protein 5 
hypothetical protein FU22704 
gastrin-reieasing peptide 
gb:QV0^T0O3^1040{M82-aO7 OT0033 Homo 
serine (or cysteine) proteinase inhifcito 
mitochondria) rfoosomal protein L42 
protease Inhibitor 3, skin-derived (SKAL 
S100 calchim-bindinD protein A7 (psorias 
frizzled (Drosophiia) homolog 6 
apofipoprotein L 

COC45 (cell division cycle 45, S.cerevis 
cytochrome P450, subfamily IIS. pdypept 
cytokine receptor-Dice factor 1 
epiremiHn 

fj-type (calcium dependent carbohydrate- 

Opa-inleracting protein 5 

prostate differentiation factor 

aWo-kefo reductase family 1, member B10 

mucin 4, tracheobronchial 

coDagen, type XVII, alpha 1 

multifunctional polypeptide Mar to S 

UM homeobox protein 2 

collagen, type III, alpha 1 (Ehlers-Oanl 

glycoprotein hormones, alpha polypeptide 

baculoviral 1AP repeakjontaining 5 (sur 

hypothetical protein FU 10549 

cartilage oBgomeric matrix protein {pse 

centromere protein A (17kO) 

EOT protein (Epilhefial cell transformi 

cadherin 13, H-cadherin (heart) 

ESTs 

chaperonin containing TCP1, subunll 5 (e 
TATA box binding protein (TBP)-associate 
ESTs 

gb:DKFZp761A092j1 761 (synonym: hamy2) 
ESTs, Moderately similar to HXDA.HUMAN H 
sperm associated antigen 4 
ephrin-A1 

collagen, type VII, alpha 1 (epidermolys 
: nl)ulose-5-phosphate-3«plmerase 
sushi-repeat protein 
ESTs 

CGH)9 protein 

gonadotropin-releaslng hormone 2 
ligand of neuronal nitric oxide synthase 
ESTs 

gtutamine-fructose-6-phosphate transamin 
Homo sapiens cDNA FU11576 &, clone HE 
ESTs 

heparin-blndlng growth factor binding pr 
hypothetical protein MGC13204 
B-ceO CU/lymphcma 1 1A (zinc finger pro 
matrix mataltoproteinase 12 (macrophage 
DKFZp434J1813 protein 
hypothetical protein LOC57822 
paired box gene 9 
■ nuclear pore complex protein 
hypothetical protein 

cocaine- and amphetamine-regulated trans 
Homo sapiens mRNA; cONA DKFZp761J1324 (f 
DKFZP434G232 protein 
fbrkhead box E1 (thyroid transcription f 
KIAA1632 protein 

osteoblast specific factor 2 (fasctdln 
tumor protein 63 kDa wrltti strong homolog 
hypothetical protein MGC15730 
Homo sapiens cDNA FU14354 fis, clone Y7 
serine (or cysteine) proteinase inhibito 
lysyl oxidase 

small profine-rich protein 3 
ESTs 

islet amyloid polypeptide 

gb:EST41944 Endometrial tumor Homo sapie 

tryptophan hydroxylase (tryptophan 5-mon 

mWcfiromosome maintenance deficient (S. 

distntegrin protease 

ESTs 



1.03 


1.30 


1.88 


1.59 


11.84 


22,80 


45.89 


90.00 


1.13 


1.49 


51.83 


2025 


1.17 


1.15 


1.00 


52.00 


67.61 


6100 


4.37 


2.34 


418 


95.50 


40.89 


71.00 


1.13 


1.38 


41.59 


96.00 


2.37 


1.10 


129 


1.68 


4.93 


5.73 


1.49 


1.71 


25.99 


10.91 


1.54 


1.41 


1.15 


1.78 


1.00 


112.00 


9.39 


60.00 


18.33 


53.00 


1.71 


121 


47.53 


32.00 


73.68 


35.54 


173.97 


26.00 


4.68 


2.92 


1.00 


1.00 


189 


4.55 


1.05 


1.46 


3.88 


1.53 


99.56 


53.00 


1.69 


117 


70.46 


61.00 


77.74 


100 


6.88 


8.55 


171.41 


17.00 


Z)2 


1.62 


10.49 


35.00 


12.40 


32.47 


16.41 


60.00 


1.00 


1.00 


1.82 


2.96 


1.14 


1.53 


2.14 


1.69 


7.18 


14.00 


21.90 


64.00 


1.00 


1.00 


55.52 


66.00 


0.88 


1.17 


1.76 


5.40 


3.54 


4.33 


1.00 


50.00 


38.88 


70.00 


1.00 


67.00 


76.02 


1.00 


19.14 


58.00 


3.61 


1157 


240.73 


40.00 


1.00 


59.00 


4.20 


1.00 


1.00 


1.00 


7.18 


6.64 


1.00 


44.00 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


31.33 


31.00 


5.81 


10.67 


3.55 


130 


233.42 


68.00 


0,93 


1.01 


21.30 


52.00 


1.00 


1.00 


21.91 


70.00 


137.82 


54.00 


1.00 


1.00 


1.00 


34.00 


13.08 


48.00 


1.00 


1.00 


164.58 


87.00 


5172 


302.00 


0.88 


1.15 



158 



WO 02/086443 



PCT/US02/12476 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



424351 


BE622117 


Hs.1 45567 


424354 


AW383226 


Hs.201169 


424381 


AA285249 


Hs.146329 


424411 


NM.Q05209 


Hs.146549 


424420 


BE614743 


Hs.1 45583 


424441 


X14850 


Hs.1 47097 


424502 


AF242388 


Hs.1 49585 


424503 


X06256 


Hs. 149509 


424513 


BE385854 


Hs. 149894 


424539 


L02911 


It- i pA J ft 1 

Hs. 150402 


424568 


AF005418 


Hs.1 50595 


424502 


AK002055 


Hs.1 51 045 


424629 


"M90556 


Hs.1 51393 


424645 


NM_014682 


Hs.1 51449 


424587 


J05O70 


Hs.1 51738 


424717 


AW992292 


Hs.1 52213 


424834 


AK001432 


Hs.1 53408 




D79987 


Hs.153479 


424867 


AJ 024860 


Hs.1 53591 


424505 


NM_002497 


Hs.1 53704 


424979 


D87989 


Hs.1 54073 


424999 


AW953120 




425048 


H05468 


Hs, 164502 


425057 


AA826434 


Hs.1 619 


425081 


X74794 


Hs.1 54443 


425118 


AU076611 


Hs.1 54672 


425159 


NR.004341 


Hs.154868 


425202 


AW962282 


Hs.1 52049 


425234 


AW1 52225 


Hs.1 65909 


425236 


AW067800 


Hs.1 55223 


425245 


AI751768 


Hs.1 55314 


425247 


NMJJ05940 


Hs.155324 


425266 


J00077 


Hs.155421 


425274 


BE281191 


Hs. 155462 


425322 


U63630 


Hs.155637 


425349 


AA425234 


Hs.79886 


425371 


D49441 


Hs.1 55981 


425397 


J04088 


Hs. 156346 


425420 


BE536911 


Hs.234545 


425424 


NM-004954 


Hs.157199 


425483 


AF231022 


Hs.1 581 59 


425566 


AW1 62943 


Hs.250618 


425580 


L11144 


Hs.1 907 


425650 


NR.001944 


Hs.1 925 


425692 


D90041 


Hs.1 55956 


425695 


NMJJ05401 


Hs.159238 


425734 


AFO56209 


Hs.159396 


425776 


U25128 


Hs.1 59499 


425810 


AJ923627 


Hs.31903 


425811 


AL039104 


Hs.159557 


425B49 


AI077288 


Hs^96323 


425B52 


AK001504 


Hs.1 59651 


426067 


AA4Q1359 


Hs.1 90721 


42608B 


AF038007 


Hs.1 661 95 


426215 


AW067800 


Hs.155223 


426227 


U67058 


Hs.1 54299 


426269 


H15302 


Hs.1 68950 


426283 


NM-003937 


Hs.1 691 39 


426329 


AL389951 


Hs.271623 


426427 


M86699 


Hs.169840 


426432 


AF001601 


Hs.1 69857 


426440 


BE382756 


Hs.1 69902 


426459 


AF151B12 


Hs.1 69992 


426471 


M22440 


Hs.170009 


426496 


D31765 


Hs.1 701 14 


426501 


AA401369 


Hs.1 90721 


426514 


BE616633 


Hs.1 701 95 


426536 


A1949749 


Hs.44441 


426572 


AB037783 


Hs.170523 


426682 


AV660038 


Hs.2056 


426691 


kill ArtnnAi 

NMJJQ62Q1 


HS.171834 


426745 


J03626 


Hs.2057 


426752 


X63490 


Hs.172004 


426784 


U03749 


Hs.172216 


426807 


AA385315 


Hs.l5borJ2 




AriUoOoo 


Ljc 179R19 

ns. w coio 


426814 


AF036943 


Hs.172619 


426831 


BE296216 


Hs.172673 


426897 


AA401369 


Hs.190721 


426925 


NMJJ01196 


Hs.315689 


426935 


NM.000088 


Hs.172928 


426964 


AA393739 


Hs.287416 


426966 


A1493134 




426991 


AK00153S 




427099 


AB032953 


Hs.173560 



hypothetical protein 

ESTs, WeaMy similar to G01763 atrophb- 

protein kinase Chk2 

crystaffin, betaA2 

prostaglandin E synthase 

H2A histone family, member X 

tengsln 

tntegrin, alpha 5 (fibroneciin receptor, 
mitochondrial translatbna) initiation f 
Actwin A receptor, type I (ACVR1)(ALX 
cytochrome P450, subfamily XXVIA. polype 
hypothetical protein RJ11 193 
glutamate-cysteine ligase, catalytic sub 
KlAA0535gene product 
matrix metafloproleinase 9 {getatmase B 
wingless-type MMTV integration sits fami 
Homo sapiens cDNA FU10570 fis, ctons NT 
extra spindle poles, S. cerevisiae, homo 
Not56 (D. melanogasterHike protein 
N!MA (never in mitosis gene a)-related k 
UDP-galactose transporter related 
gb:EST365 1 90 MAGE resequences, MAGB Homo 
ESTs 

achaete-scuta complex (DrosophBa) homo! 
rninichromosome maintenance deficient (S, 
methylene tetrahydrafolate dehydrogenase 
carbarnoyl-phosphate synthetase 2, as part 
ESTs, Weakly similar to 138022 hypothefi 
ESTs, Weakly similar to 138022 hypothefi . 
stanniocalrin 2 
WAA0095 gene product 
matrix metalloproteinase 11 (stromelysin 
alpha-fetoprotein 

rninichromosome maintenance deficient (mi 
protein kinase, DNA-acth/ated, catalytic 
ribose 5-phosphate isomerase A (ribose 5 



tapoisomerase (DNA) II alpha (170kD) 
hypothetical protein NUF2R 
ELKL motif kinase 

FAT tumor suppressor (DrosophQa) homolo 
UL16 binding protein 2 



desmoglein 3 (pemphigus vulgaris antigen 
^acetyl transferase 1 (arylamine N-acety 
protein tyrosine phosphatase, non-recept 
pepOdylglycine alpha-amidatlng monooxyg 
parathyroid hormone receptor 2 
ESTs 

karyopherin alpha 2 (RAG cohort 1, tmpor 
serurn/grucocorticoid regulated kinase 
death receptor 6, TNF superfamHy member 
ESTs 

ATPase, Class I, type 8B, member 1 
stanniocalrin 2 

Human proteinase activated receptor-2 mR 
Homo sapiens mRNA; cONA DKFZp566A1046 (f 
kynureninase (Uyrairenlne hydrolase) 
nudeoportn 5QxD 
TTK protein kinase 
paraoxonase 2 

solute carrier family 2 (facilitated glu 
hypothetical 43.2 Kd protein 
transforming growth factor, alpha 
K1AA0061 protein 
ESTs 

bone morphogenebc protein 7 (osteogenic 
ESTs 

hypothetical protein FLJ11 183 

UDP gtycosyi transferase 1 family, polype 

PCTA1RE protein kinase 1 

uridine monophosphate synthetase (orotat 

fan 

chromogranln A (parathyroid secretory pr 
ESTs 

salute carrier farrafy 12 (potassium/chto 
myelin transcription factor 1-Gke 
S^rJenosylhorrocysteine hydrolase 
ESTs 

Homo sapiens cDNA: FU22373 lis, clone H 

collagen, type I, alpha 1 

Homo sapiens cONA RJ1 1439 Us, clone HE 

sclerostin 

Homo sapiens cDNA FU 10674 fis, clone NT 
odd Oz/terwn homolog 2 (Drosophila, mous 



U.90 


l.UO 


/.(it 


4 OA 


QC cc 


92.00 


4 C9 - 


1 oc 


1 GO 
1.00 


4 <91 
1.00 


1.82 


4 OO 

i^y 


l.UU 


4 nn 
l.UU 


1 no 

l.U* 


O OA 


1 rtn 

l.UU 


17 nn 
i/.uu 


V) AC 




■1 Aft 

0.4U 


£.00 


91 at 

01.0/ 


?c nn 


o eg 
J- Da 


0 77 


4 (V» 
l.UU 


i nn 

l.UU 


0 41 


o oo 

LIS 


4 nn 
1.00 


4 nn 
l.UU 


cc 4a 
00.13 


49 nn 
1AUU 


265 


4 on 
1.0U 


1.23 


4 AC 

1.05 


2135 


4 nn 
l.UU 


1.36 


4 QC 

1.00 


1 04 

1.24 


4 >H 

1.4 1 


1.00 


44 nn 
11. UU 


7.46 


07 nn 

O/.UU 


2.52 


^ no 


A OA 

4.84 . 


a no 
4.U0 


3.62 


O 79 
Ltd 


1.00 


zo nn 
oO.UU 


100.77 


a a nn 
44.UU 


3.30 


Z90 


1.91 


0 oo 
LSI 


1.41 


1.49 


1.00 


68.00 


1.97 


4 CI 

1.00 


141.49 


123.00 


1.00 


84.00 


0.87 


4 CO 

1.59 


14,90 


5.76 


1.00 


1.00 


10.58 


Q7i4 


1.74 


4 An 
1.40 


1.49 


4 4X 
1.14 


53.29 


ooo nn 
ZOO, UU 


33.45 


4 nn 
1.00 


1.00 


cc nn 
55.00 


1.00 


10.00 


1.00 


aa nn 
41 .UO 


1.00 


48.00 


27.39 


98.00 


1.99 


1.58 


71.16 


3.42 


1.35 


1.34 


1.01 


17.00 


26.26 


47.00 


1.91 


Z90 


22.40 


£O.UU 


l.UO 


4 nn 

l.UU 


04 00. 


790 nn 

£43.UU 


A 14 

4.34 


ji na 
4.UO 


7 (Yi 
I.UZ 


1 nn 

l.UU 


1.10 


1.00 


£.07 


1 71 


1.00 


1.00 


7f| en 
IU.OU 


?s nn 


y.ui 


22.00 


10 99 


17 nn 
If .uu 


!UJ.f4 


A1 on 

*» l.UU 


4.00 


99 nn 

IO.UU 


l.UU 


A9 nn 

40.UU 


IDU.UO 


Ann 


1 51 
1.01 


1 9«J 

I.JO 


9 11 


1 r>A 

1 .DO 




O. IH 


1 79 


1 71 
1./ 1 


l.OU 


1 fid 


1.47 


1.53 


too 


too 


1.51 


1.25 


141.55 


17.00 


32.61 


38.00 


2.65 


116 


1.97 


3.49 


1.00 


1.00 


3.39 


2.28 


4.24 


17.00 



159 



WO 02/086443 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



427239 


BE270447 


Hs.174070 


427260 


AA663848 




427281 


AA905147 


Hs.102869 


427335 


AA448542 


HsJ51577 


427354 


T57896 


Hs.191095 


4273S6 


AW023482 


Hs.97849 


427376 


AA401533 


Hs.19440 


427333 


N\L005411 


Hs.177582 


427427 


AF077345 


Hs.177935 


427441 


AA41 2605 


HsJ43879 


427445 


X8081B 


Hs.178078 


427505 


AA361562 


Hs.178761 


427510 


Z47542 


Hs.179312 


427528 


AU077143 


Hs. 179565 


427546 


AA1B8763 


Hs.36793 


427552 


R56424 


Hs.25534 


427535 


D31152 


Hs.179729 


427660 


AI741320 


Hs.114121 


427666 


AI791495 


Hs.180142 


427668 


AA298760 


Hs.180191 


427677 


NM 007045 


Hs.180298 


427701 


AA411101 


Hs.243886 


427711 


M31659 


Hs.180408 


427719 


A1393122 


Hs.134726 


427722 


AK000123 


Hs.180479 


427747 


AW411425 


Hs.160655 


427912 


AL022310 


Hs.181097 


427961 


AW293165 


Hs.143134 


428004 


AA449563 


Hs.151393 


428023 


AL038843 




428046 


AW812795 


Hs.337534 


428093 


AW594505 


Hs.104830 


428098 


AU077258 


Hs.182429 


428129 


AI244311 


Hs^6912 


428169 


AI928984 


Hs.182793 


428182 


BE386042 


Hs^93317 


428227 


AA321649 


Hs.2248 


428242 


H55709 


HsJ225Q 


428330 


L22524 


Hs.2256 


428434 


A1909935 


Hs.65551 


428450 


NMJJ14791 


Hs.184339 


428471 


X57348 


Hs.184510 


428479 


Y00272 


H5L334562 


428484 


AF104032 


Hs.184601 


428505 


AL035461 


Hs.2281 


428532 


AF157326 


Hs.1 84788 


428645 


AA431400 


Hs.98729 


428664 


AK001666 


Hs.183095 


428698 


AA852773 


Hs.334838 


428728 


NMJH6625 


Hs.191381 


428748 


AW593206 


Hs.98785 


428758 


AA433988 


Hs.98502 


428771 


AB028992 


Hs.193143 


428801 


AW277121 


Hs.254881 


428810 


AR368236 


Hs.193788 


428839 


AI767756 


Hs.62302 


428845 


AL157579 


Hs.153610 


428959 


AF100779 


Hs.194680 


428959 


AF120274 


Ns.194689 


429038 


AL023513 


Hs.194768 


429065 


AT753247 


Hs.29643 


429164 


AI688663 


Hs.116586 


429170 


NM.001394 


Hs.2359 


429183 


AB014604 


Hs.197955 


429201 


X0317B 


Ks.198246 


429211 


AF052693 


Hs.198249 


429220 


AW207205 




429228 


AI553633 


Hs.326447 


429259 


AA420450 


Hs.292911 


429263 


AA019004 


Hs.198396 


429276 


AF056085 


Hs.198612 


429359 


W00482 


Hs.2399 


429412 


NM.006235 


Hs.2407 


429413 


NMJJ14058 


Hs.201877 


429486 


AF155827 


Hs.203963 


429504 


X99133 


Hs.204238 


429533 


BE182592 


Hs.11261 


429547 


AA401359 


Hs.190721 


429551 


AW450624 


Hs.220931 


429563 


BE619413 


Ks.2437 


429597 


NMJJ03816 


Hs.2442 


429610 


AB024937 


Hs.211092 


429612 


AF062549 


Hs.252587 


429616 


A1982722 


Hs.120845 


429655 


X05608 


Hs.211584 



ubiqutin earner pmten 

gb:ae70b06^1 Stratagene schizo bran S1 

ESTs 

G antigen 7B 
ESTs 
ESTs 
ESTs 

surfactant, pulmonary-associated protein 
lectin, superfamiiy member 1 (carflage- 
SPANX family, member C 
glutamate receptor, metabofropic 4 
26S proteasome-assodated pad1 homolog 
small nuclear RNA acflvaSng complex, po 
mmichramosome maintenance deficient (S. 
hypothetical protein FU23188 
ESTs 

coflagen, type X alpha 1 (Schmfd metaph 
Homo sapiens cDNA: FU23228 fe, done C 
cairrxxJuBn-nke skin protein 
hypothefoa) protein FU 14904 
FGFR1 oncogene partner 
nuclear autoanttgenlc sperm protein (bis 
solute carrier family 25 (mitochondrial 
ESTs 

hypothetical protein FL120116 

serine/threonine kinase 12 

tumor necrosis factor (Bgand) superfami 

ESTs 

gtotamate-cysteine Dgase, calalytic sub 
Homo sapiens cDNA: RJ23502 (is, done I 
ESTs, Moderately similar to 138022 hypot 
ESTs 

protein disulfide isomerase-reJated prot 
ESTs 

goigi phosphoprotein 2 

ESTs, Weakly similar to GGC1_HUMAN G ANT 
small indudbte cytokine subfamily B {Cy 
leukemia inhibitory factor (cholinergic 
matrix metailoproteinase 7 (mabflysin. 
Homo sapiens, SimBar to DNA segment Ch 
KIAA0175 gene product 
stratifin 

cell division cycle 2, G1 to S and 62 to 
solute carrier family 7 {cationic amino 
chromogranin B (secretpgranln 1) 
TBP-interacting protein 
ESTs, Weakly similar to 201 7205A dihydro 
similar to SALL1 {sal (Drasoph3a)-iika 
WAA1866 protein 
hypothetical protein 
Ksp37 protein 

hypothetical protein FU14303 

WAA1069 protein 

ESTs 

nitric oxide synthase 2A (inducible, hep 

Homo sapiens cDNA FU14814 lis, done NT 

KIAA0751 gene product 

WNT1 inducible signaling pathway protein 

artemm 

seizure related gene 6 (mouseHke 
Homo sapiens cDNA FU13103 fe, done NT 
ESTs 

dual spedSdty phosphatase 4 
K1AA0704 protein 

group-specific component (vitamin D bind 
gap junction protein, beta 5 (connexin 3 
ESTs 
ESTs 

ESTs, Highly similar to S50712 band-6-pr 
ATP-binding cassette, sub-family A {ABC1 
G protein-coupled receptor 51 
matrix metailoproteinase 14 (membrane-in 
POU domain, dass 2, associating factor 
DESC1 protein 
hypothetical protein FU 10339 
Bpocafin 2 (oncogene 24p3) 
smaB proiine-rich protein 2A 
ESTs 
,ESTs 

eukaryob'c translation initiation factor 
a disintegrin and metailoproteinase doma 
LUNX protein; PLUNC (palate lung and nas 
pituitary turrtor-transfoiming 1 
ESTs 

neurofilament, light polypeptide (68kD) 



1.58 


1.05 


1.34 


1.60 


1.00 


66.00 


51.83 


4.00 


1.17 


1.95 


7.31 


41.00 


1.00 


57.00 


0.42 


1.32 


1.00 


20.00 


1.00 


1.00 


0.97 


1.03 


4.60 


4.04 


22.00 


45.00 


97.45 


92.00 


1.50 


3.24 


6.81 


40.00 


69.91 


62.00 


2.70 


49.00 


t.37 


1.88 


29.55 


67.00 


3.52 


2.63 


7.41 


34.00 


15.84 


70.00 


7.03 


4.52 


2.92 


1.74 


1.76 


1.26 


9.63 


59.00 


41.97 


naoo 


23.82 


1.00 


1.40 


1.33 


96.28 


167.00 


1.25 


1.29 


1.86 


1.60 


1.00 


42.00 


2.76 


2.11 


1.00 


1.00 


85.59 


181.00 


8.57 


21.64 


7.77 


15.90 


0.58 


1.43 


237.53 


204.00 


6.00 


4.60 


56.54 


16.00 


3.53 


2.15 


1.00 


1.00 


1.00 


58.00 


1.00 


16.00 


1.00 


1.00 


187.37 


255.00 


47.24 


80.00 


1.00 


87.00 


1.06 


1.13 


1.98 


92.00 


1.67 


6.15 


1.03 


1.27 


124.17 


43.00 


1.00 


1.00 


15.16 


27.00 


1.36 


1.24 


0.97 


3.31 


6.82 


16.47 


19.08 


67.00 


16.18 


105.00 


79.72 


104.00 


1.00 


1.00 


1.33 


1.09 


1.00 


7.00 


39.47 


29.25 


2.01 


1.18 


1.07 


1.00 


3.70 


142.00 


1.30 


1.34 


94.09 


86.00 


41.91 


10.00 


12.19 


1.00 


1.61 


1 ns 

l.UO 


4.43 


2.90 


1.06 


17.00 


Z89 


65.00 


1.49 


1.37 


61.66 


100.00 


1.59 


1.69 


2.78 


1.74 


1.00 


1.00 


1.00 


4.00 



PCT/US02/12476 



160 



WO 02/086443 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



429663 


M58874 


Hs^11587 


429736 


AF125304 


Hs.212580 


429782 


NMJJ05754 


Hs.220689 


429903 


AL134197 


Hs.93597 


429918 


AWB73986 


Hs.119383 


429978 


AA249027 




429986 


AF092047 


Hs.227277 


430044 


AA464510 


Hs.152812 


430114 


AA847744 


Hs.99640 


430134 


BE380149 


Hs.105223 


430147 


R60704 


Hs.234434 


430287 


AW182459 


Hs.1 25759 


430294 


AI538226 


Hs-32976 


430300 


U60805 


Hs^38648 


430315 


NMJKM293 


Hs.239147 


430337 


M36707 


Hs^39600 


430378 


Z29572 


Hs.2555 


430388 


AA356923 


Hs.240770 


430393 


8E185030 


Hs.241305 


430439 


AL133561 




430451 


AA836472 


Hs.297939 


430454 


AW469011 


Hs.1 05635 


430466 


AF052573 


HsJ24W7 


430481 


AA479678 


Hs.203269 


430486 


BE062109 


Hs.241551 


430508 


A1015435 


Hs.104637 


430533 


M480895 


Hs.57749 


430563 


AF146074 


Hs.108660 


430677 


Z26317 


Hs.94560 


430678 


AA401369 


Hs.190721. 


430666 


NMJJ01942 


Hs.2633 


430788 


AI742925 


Hs.7179 


430890 


X54232 


Hs.2699 


430935 


AW072916 




430985 


AA490232 


Hs^7323 


431009 


BE149762 


Hs.48956 


431089 


BE041395 




431092 


A1332764 


Hs.125757 


431124 


AF284221 


Hs.59506 


431164 


AA493650 


Hs.94367 


431211 


M86849 


Hs.323733 


431221 


AW207837 


Hs.286145 


431277 


AA501806 


Hs.345824 


431322 


AW970622 




431342 


AW971018 


Hs.21659 


431384 


BE158000 


Hs.285026 


431462 


AW583672 


Hs.256311 


431494 


AA991355 


Hs^98312 


431515 


NMJH2152 


Hs.258583 


431548 


AJ834273 


Hs.9711 


431630 


NM_W2204 


Hs.265829 


431745 


AW972448 


Hs.163425 


431770 


BE221880 


Hs.26B555 


431830 


Y16645 


Hs.271387 


431B46 


BE019924 


Ks.271580 


431890 


X17033 


Hs.271986 


431934 


AB031481 


Hs.272214 


431958 


X63629 


Hs.2877 


432006 


AL137382 


Hs.272320 


432023 


R43020 


Hs.236223 


432201 


AI538613 


Hs.298241 


432210 


A1567421 


Hs.273330 


432226 


AW182766 


Hs.273558 


432239 


XB1334 


Hs.2936 


432265 


BE382679 


Hs.285753 


432281 


AK001239 


Hs.274263 


432365 


AK001105 


Hs.274419 


432374 


W68815 


Hs.301885 


432375 


BE536069 


Hs.2962 


432407 


AA221036 




432441 


AW292425 


Hs.163484 


432489 


AI804855 


Hs.207530 


432543 


AA552690 


Hs.1 52423 


432552 


AI537170 


Hs.173725 


432583 


AW023624 


Ks.162282 


432606 


NMJX/2104 


Hs.3066 


432625 


A1243596 


Hs.94830 


432653 


N62096 


Hs.293185 


432677 


NMJTO4482 


Hs.278611 


432715 


AA247152 


Hs.200483 


432753 


NM.014075 


Hs.336938 


432788 


AA521091 


Hs.178499 


432842 


AW674093 


Hs.334822 


432867 


AW016936 


Hs.233364 


432917 


NWL014125 


Hs.241517 



phospholipase A2, group IVA (cytosolfc. 
tumor necrosis fsctor receptor superfami 
Ras-GTPase-acfivating protein SHMomain 
cyc&Mlependent kinase 5. regulatory su 
ESTs 

riccscmal protein S6 

sine ocute homeobox (Drosophila) homoto 

ESTs 

ESTs 

ESTs, Weakly similar to T33188 hypotheS 
hauy/enhancer-o^spJit related with YRP 
ESTs, Weakly similar to LEU5_HUMAN LEUKE 
guanine nucleotide binding protein 4 
oncostafin M receptor 
guanine deaminase 
calmoduBn-iike 3 

tumor necrosis fsctor receptor superfami 
nuclear cap Wntfing protein subunit 2, 2 
estrogerwesponsive B box protein 
DKFZP434B061 protein 
cathepsin B 
ESTs 

polymerase (DNA directed), fteia 
ESTs, Moderately similar to ALU8_HUMAN A 
chloride channel calcium activated, fam 
ESTs 

ESTs, Weakly similar to T17288 hypotheti 
ATP-birufing cassette, sub-family C {CFTR 
desmoglein 2 
ESTs 

desmoglein 1 

ESTs, Weakly similar to 2Q04399A chromos 
gJypicanl 

zinc finger protein 131 (done pHZ-10) 
ESTs, Weakly similar to 178885 serine/th 
gap junction protein, beta 6 (connexln 3 
ESTs, WeaWy similar to unknown pro tain 
ESTs 

doublesex and mab-3 related transcriptio 
Homo sapiens cDNA* RJ23494 fis, clone L 
gap junction protein, beta 2, 26kD (conn 
SRB7 (suppressor of RNA polymerase B, ye 
ESTs 

gb:EST382704 MAGE resequences, MAGK Homo 
ESTs 

gb:MR2-HT0377-150200-202-<s03 HT0377 Homo 
granin-fika neuroendocrine peptide precu 
hypothetical protein DKFZp434A1315 
endothelial differentiation, lysophospha 
novel protein 

integrin, alpha 3 (antigen CD49C alpha 
ESTs 

5 , -3 , exoribonuclease2 

small inducible cytokine subfamily A (Cy 

uropiakin 1B 

integrin, alpha 2 (CD49B, alpha 2 subtirri 
STG protein 

cadherin 3. type 1, P-cadherfn (placenta 
Homo sapiens mRNA; cDNA DKF2p434L1226 (f 
EST 

Transmembrane protease, serine 3 

Homo sapiens, done IMAG£:3544662, mRNA, 

phosphate cytidylyllransferase 1, chorm 

matrix metaBoproteinase 13 (coBagenase 

SCGKWke-protein 

hypothetical protein FLM0377 

hypothefcal protein RJ10244 

Homo sapiens cDNA FU11346 fis, done PL 

S100 calciunvtfinding protein P 

gb2i03f12xt Stratagene NT2 neuronal pr 

ESTs 

ESTs 

Homo sapiens cDNA: FU21274 fis, clone C 
ESTs, WeaWy similar to ALU8..HUMAN ALU S 
potassium channel TASK-4; potassium chan 
granzyme K (serine protease, granzyme 3; 
ESTs, Moderately similar to T03094 A-Wn 
ESTs, Weakly similar to JC7328 amino aci 
UDP4J-acetyi-alpha-t>galactosamlne:po!yp 
ESTs, Weakly slmflar to KIAA1074 prole&i 
Homo sapiens PRO0593 mRNA, complete cds 
Homo sapiens cDNA: RJ231 17 fis, done L 
hypothetical protein MGC44B5 
ESTs 

PRO0327 protein 



PCT/US02/12476 



69.95 


104.00 


1.25 


1.21 


1.00 


7.00 


11.80 


1.00 


1.00 


78.00 


1.98 


3.09 


1.00 


48.00 


69.27 


59.00 


1.00 


1.00 


1.00 


51.00 


1.10 


122 


1.00 


127.00 


3.80 


1.47 


1.00 


35.00 


92.31 


28.00 


1.18 


1.08 


5.28 


66.00 


16.76 


38.00 


1.63 


1.50 


1.00 


1.00 


1.64 


2.12 


63.35 


44.00 


247 


1.91 


1.00 


31.00 


12.28 


41.00 


4.75 


7.27 


1.00 


1.00 


1.00 


1.59 


1.72 


1.30 


0.90 


17.00 


1.00 


1.00 


1.62 


1.84 


1.58 


1.40 


90.28 


13200 


0.94 


1.28 


60.25 


28.00 


23.32 


941.00 


13.46 


63.00 


49.43 


62.00 


0.44 


2.20 


182.26 


101.00 


4.15 


13.97 


1.00 


86.00 


40.55 


200.00 


1.00 


53.00 


0.94 


1.14 


1.30 


1.25 


3.90 


26.00 


1.41 


1.87 


5.66 


15.00 


0.99 


1.44 


0.99 


3.51 


67.12 


91.00 


3.36 


4.71 


4.49 


2.51 


2.20 


132 


1.01 


1.04 


51.17 


46.35 


0.94 


1.65 


0.94 


47.00 


1.10 


2.24 


1.42 


1,45 


1.00 


1.00 


18.67 


1,00 


1.09 


1.21 


40.98 


58.00 


1.00 


214.00 


157.34 


37.00 


1.65 


1 nc 


73.71 


/5.QU 


56.35 


fZ.UU 


1.00 


24.00 


137.72 


no nn 


1.00 


31.00 


0.27 


35.18 


0 07 
/.Of 


622 


26.63 


56.00 


1.92 


5.29 


1.00 


48.00 


45.13 


31.00 


1.00 


68.00 


2.69 


3.67 


1.22 


1.34 


1.00 


1.00 


10.25 


6.62 



161 



WO 02/086443 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



432920 


U37689 


Hs.3128 


polymerase (RNA) II (DNA (Erected) polyp 


433001 


AF217513 


Hsi79905 


done HQ0310 PRO0310p1 


433023 


AWB64793 


HSJ7409 


thrombospcrtdin 1 


433042 


AW193534 


HsJ81B95 


Homo sapiens cDNA FU11660 fis, dona HE 


433091 


Y12642 


Hs.3185 


lymphocyte an Dg en 6 complex, locus D 


433159 


AB035898 


Hs.150587 


kinesW&e protein 2 


433183 


AF231338 


Ks.222024 


transcnpGon factor BMAL2 


433258 


AA622783 


Hs.203613 


ESTs, Weakly similar to ALUB_HUMAN !!U 


433409 


AI276802 


Hs^5661 


ESTs 


433437 


LT20536 


HsJ280 


caspase 6, apoptosis-retated cysteine pr 


433485 


A1493076 


Hs.201967 


aldo-keto reductase family 1, member C2 


433537 


AI733592 


Hs.1 12488 


ESTs 


433547 


W04978 


Hs.303023 


beta tubulin 1, class VI 


433556 


W56321 


Hs.1 11460 


caidum/catmoduBrwlependent protein kin 


433647 


AA603367 


Hs^22294 


ESTs 


433658 


L03678 


Hs.155110 


immunoglobulin kappa constant 


433800 


AI094221 


Hs.135150 


lung type*! eel) membrane-associated gty 


433819 


AW511097 


Hs.112765 


ESTs 


433862 


086960 


Hs^610 


KIAA0205 gene product 


433980 


AA137152 


HS.2B6049 


phosphoserine aminotransferase 


434088 


AF1 16677 


Hs-249270 


hypothetical protein PR01956 


434094 


AA305599 


Hs.238205 


hypothetical protein PRO2013 


434105 


AW952124 


Hs.13094 


presenifins associated rhomboid-like pro 


434217 


AWD14795 


Hs.23349 


ESTs 


434340 


AI193043 


Hs.1 28685 


ESTs, Weakly similar to T17226 hypolheti 


434360 


AA401369 


Hs.190721 


ESTs 


434414 


AI798376 




gb:tr34b07jc1 NCl_CGAP_.Ov23 Homo sapiens 


434424 


AI811202 


Hs.325335 


Homo sapiens cDNA: FU23523 fis, clone I 


434467 


BE552368 


Hs.231853 


Homo sapiens cDNA FU 13445 fe, done PL 


434551 


BE387162 


Hs^8085B 


ESTs, Highly simfiar to A35661 DNA exds 


434627 


A1221894 


Hs.39311 


ESTs 


434699 


AA643687 


Hs.149425 


Homo sapiens cDNA FU 11980 fis, done HE 


434769 


AA548884 


Hs.134278 


Homo sapiens cDNA FU12676 fis, done NT 


434792 


AA549253 


Hs.132458 


ESTs 


434808 


AF155108 


Hs.256150 


Homo sapiens, Similar to R1KEN cDNA 2810 


434828 


D90070 


Hs.96 


phorod-12^ydst8te-13^cetate-induced 


434876 


AF160477 


Hs.61460 


Ig superfam&y receptor LNIR 


434891 


AA814309 


Hs.123583 . 


ESTs 


434928 


AW015595 


Hs.4257 


Homo sapiens clones 24714 and 24715 mRNA 


435013 


K91923 


Hs.1 10024 


Target CAT 


435056 


BE261750 


Hs.4747 


dyskeratosis congenita 1, dyskerln 


435087 


AW975241 


Hs.23567 


ESTs 


435099 


AC004770 


Hs.4756 


flap structure-specific endonuclease 1 


435159 


AA668879 


Hs.116649 


ESTs 


435205 


X54136 


Hs. 181 125 


immunoglobulin lambda locus 


435232 


NVL001262 


Hs.4854 


cycOn-dependent kinase Inhibitor 2C (p1 


435304 


H10709 


Hs.269524 


ESTs 


435313 


AI769400 


Hs.169729 


ESTs 


435505 


AF200492 


HS.21123B 


interteukin-1 homolog 1 


435509 


A1458679 


Hs.181915 


ESTs 


435525 


AI831297 


Hs.123310 


ESTs 


435532 


AW291488 


Hs.1 17305 


Homo sapiens, done IMAGE:3682908, mRNA 


435550 


A1224456 


Hs.324507 


H.sapiens pdyA site DNA 


435602 


AF217515 


Hs.283532 


u recharacterized bone marrow protein BM03 


435766 


R11673 


Hs.186498 


ESTs 


435793 


AB037734 


Hs.4993 


KIAA1313 protein 


436069 ' 


A1056B79 


Hs.263209 


ESTs 


436170 


AW450381 


Hs.1 4529 


ESTs 


436211 


AKQ01581 


Hs.334828 


hypothetical protein FU10719; KIAA1794 


436213 


AA325512 


Hs.71472 


hypothetical protein FU10774; WAA1709 


435217 


T53925 


Hs.107 


RDrinogen-fike 1 


436238 


AK002163 


Hs.301724 


hypothetical protein FU11301 


436251 


BE515065 


Hs.296585 


nudeolar protein (KK&D repeat} 


436291 


BE568452 


Hs.344037 


protein regulator of cytokinesis 1 


436302 


AL355841 


Hs.99330 


hypothetical protein FU2358B 


436396 


AW992292 


Hs.152213 


wingless-type MMTV Integration site faml 


436414 


BE254633 


Hs.143638 


WD repeat domain 4 


436419 


AI948626 


Hs.1 71356 


ESTs 


436443 


AW138211 


Hs.1 28746 


ESTs 


436474 


AJ270693 


Hs. 199887 


ESTs 


436481 


AA379597 


Hs.5199 


HSPC150 protein similar to uDiquiwvcon 


436466 


AA742221 


Hs.120633 


ESTs 


436511 


AA721252 


Hs.291502 


ESTs 


436553 


X57809- 


• I i Ail nr 

Hs.181125 


irnrnunogtobulin lambda locus 


436557 


W15573 


Hs.5027 


ESTs, Weakly similar to A47582 B-ce/I gr 


436606 






□own synarorna cnuwfli icyiiwj ^iuioui i» 


436667 


AW025183 


Hs.127680 


ESTs 


436771 


AW975687 


Hs.292979 


•ESTs 


436S39 
436887 


AA401369 


Hs.190721 


ESTs 


AW953157 


Hs.193235 


hypothetical protein DKFZp547D155 


436944 


AW268614 


Hs.5840 


•ESTs 


436961 


AW375974 


Hs.156704 


ESTs 


436972 


AA284679 


Hs.25640 


daudin3 


437016 


AU076916 


Hs.5398 • 


guanine monphosphate synthetase 


437044 


AUJ35864 


Hs.69517 • 


cDNA for differenSally expressed C016 g 



1.44 


1.30 


154.79 


85.64 


20.96 


100.00 


1.00 


10.00 


1.20 


1.03 


13.82 


39.00 


1.00 


69.00 


1.00 


1.25 


44.81 


117.00 


70.39 


105.00 


11.55 


2.00 


8.66 


55.00 


25.16 


83.00 


1.00 


19.00 


20.30 


49.00 


5.92 


10.03 


2.29 


Z22 


a71 


8.00 


62.08 


104.00 


10191 


47.00 


1.00 


1.00 


121.27 


87.00 


1.22 


1.23 


14.11 


57.00 


Z10 


256 


40.98 


17.00 


1.48 


1.56 


1.00 


64.00 


54.91 


85.00 


2.46 


ZOO 


1.00 


1.00 


too 


23.00 


7.08 


56.00 


a 52 


44.00 


11.33 


1.00 


1.00 


1.00 


1.25 


1.29 


1.00 


6.00 


1.00 


1.00 


1.26 


1.10 


1.69 


1.37 


1.00 


1.00 


2,90 


1.93 


1.00 


1.00 


1.02 


1.46 


2.04 


Z70 


27.58 


139.00 


1.00 


14.00 


1.00 


38.00 


1.00 


1.00 


1.00 


56.00 


1.00 


ZOO 


142 


3.92 


195 


1.80 


1.00 


28.00 


23.68 


42.00 


1.00 


58.00 


1.00 


18.00 


5.84 


2Z0O 


1.42 


1.27 


57.97 


31.00 


251 


1.71 


2.33 


1.64 


108.99 


52.00 


0.75 


Z81 


60.01 


1.00 


Z50 


Z19 


0.95 


1.33 


1.12 


9.26 


1.00 


i.UU 


3.28 


1.56 


1.00 


19.00 


16.76 


14.00 


1.08 




19.20 - 


9.75 


oo.vi 




0.89 


1.19 


1.00 


10.00 


1.00 


17,00 


1.05 


1.15 


1.00 


1.00 


25.13 


25.00 


1.59 


1.46 


2.35 


1.78 


1.34 


1.13 



PCT/US02/12476 



162 



WO 02/086443 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



-437181 


AI306515 


Hs.125343 


ESTs, Weakly similar to KIAA0758 protein 


437204 


AL110215 


Hs^2826 


ESTs, Weakly simflar to 155214 salivary 


437205 


AL1 10232 


Hs.279243 


Homo sapiens mRNA; cDNA DKFZp564D2071 (f 


437259 


A1377755 


Hs.12Q895 


ESTs 


437270 


R18087 


Hs.323769 


dsplaSn resistance related protein CRR 


437271 


AL1 37445 


I&23846 


Homo sapiens mRNA; cDNA OKFZp56601 34 (fir 


437370 


AL359567 


Hs.161962 


Homo sapfens mRNA; cDNA DKFZp547D023 (tr 


437390 


AI125859 


Hs.1 12507 


ESTs 


437412 


BE069288 


Hs.34744 


Homo sapiens mRNA; cDNA DKFZp547C1 36 (fir 


437435 


AI306152 


Hs.27027 


hypofJte Beat protein DKFZp762H1311 


437444 


K45008 


Hs.31518 


ESTs 


437558 


AI954795 


Hs.156135 


ESTs 


437623 


063880 


Hs.5719 


chromosome conderisatioiKelated SWC-asso 


437789 


A1581344 


Hs.1 27812 


ESTs, Weakly similar to T17330 hypotheti 


437814 


AI088192 


Hs.1 35474 


ESTs, WeaWy similar to DDX9_HUMAN ATP-D 


437840 


AA884836 


Hs.292014 


ESTs 


437852 


BE001836 


Hs.256897 


ESTs, Weakly similar to (U3550121 {H.sa 


437879 


BE262082 


Hs.5894 


hypothetical protein FU 10305 


437915 


AI637993 


Hs.202312 


Homo sapiens clone N1 1 NTera2D1 leratoca 


437916 


BE566249 


Hs^0999 


hypothetical protein FU 23142 


437937 


AI917222 


Hs.121655 


ESTs 


437942 


AI888256 


Hs.307525 


ESTs 


438091 


AW373062 




nuclear receptor subfamily 1, group I, m 


438113 


AI457903 


Hs.8882 


ESTs 


438119 


AW963217 


Hs.203961 


ESTs, Moderately similar to AF1 16721 89 


438274 


AI918906 


Hs.55080 


ESTs 


438378 


AW970529 


Hs.85434 


hypothetical protein FU21B16 


438403 


AAB066O7 


Hs.292206 


ESTs 


438494 


AA908678 


Hs.130183 


ESTs 


438546 


AW297204 


Hs.125811 


ESTs 


438552 


AJ245820 


Hs.6314 


type 1 transmembrane receptor (seizure-! 


438702 


AI879054 


Hs£4618 


ESTs 


438724 


AW612553 


Hs.1 14670 


Human DNA sequence from done RP1 1-16121 


438746 


A1885815 


Hs.184727 


Human melartomaassoclated antigen p97 (m 


438779 


NM.003787 


Ks.6414 


nucleolar protein 4 


438821 


AA826425 


Hs.192375 


ESTs 


438885 


AJ886558 


Hs.184987 


ESTs 


438898 


AA401369 


Hs.190721 


ESTs 


43B915 


AA280174 


Hs2856B1 


watiams-Beuren syndrome chromosome regi 


438956 


W00847 


Hs.1 35056 


Human DNA sequence from done RP5-850E9 


439000 


AW979121 




gb:EST391231 MAGE resequences, MAGP Homo 


439023 


AA745978 


Hs.28273 


ESTs 


439024 


R95696 


Hs.35598 


ESTs 


439128 


A1949371 


Hs.1 53089 


ESTs 


439146 


AW138909 


Hs.156110 


immunoglobulin kappa constant 


439223 


AW238299 


Hs.250618 


UL16 binding protein 2 


439285 


AL133916 




hypothetical protein FU 20093 


439318 


AW837046 


Hs.6527 


G protein-coupled receptor 56 


439343 


AF086161 


Hs.114611 


hypothetical protein FU11808 


439394 


AA401369 


Hs.190721 


ESTs 


439410 


AA632012 


Hs.188746 


ESTs 


439451 


AF086270 


Hs.278554 


hetercchromatin-like protein 1 


439452 


AA918317 


Hs.57987 


B-cbH CLUymphorna 1 1 B (zinc finger pro 


439453 


BE264974 


Hs.6566 


thyroid hormone receptor interactor 13 


439477 


W69813 


Hs.58042 


ESTs, Moderately similar to GFR3_HUMAN G 


439492 


AF086310 


Hs.103159 


ESTs 


439523 


W72348 


Hs.185029 


ESTs 


439592 


AF086413 


Hs.58399 


ESTs 


439606 


W79123 


Hs.58561 


G protein-coupled receptor 87 


439670 


AF088O76 


Hs.59507 


ESTs, Weakly similar to AC004858 3 U1 sm 


439702 


AW085525 


HS.1341B2 


ESTs 


439706 


AW872527 


Hs^9761 


ESTs, WeaHy similar to DAP1_HUMAN DEATH 


439738 


BE246502 


Hs.959B 


sema domain, immunoglobufln domain (Ig), 


439750 


AL359053 


Hs.57664 


Homo sapiens mRNA fufl length Insert cDN 


439759 


AL359055 


Hs.67709 


Homo sapiens mRNA full length insert cDN 


439780 


AL1 09688 




gbiHomo sapiens mRNA full length insert 


439840 


AW449211 


Hs.1 05445 


GDNF family receptor alpha 1 


439926 


AW014875 


Hs.1 37007 


ESTs 


439963 


AW247529 


Hs.6793 


platelet-activating factor acetylhydrola 


439979 


AW600291 


Hs.6823 


hypothetical protein FU 10430 


440008 


AK000517 


Hs.6844 


hypothetical protein FU 20510 


440028 


AW473575 


Hs.125B43 


ESTs, Weakly simitar to T1 7227 hypotheti 


440106 


AA864968 


Hs.1 27699 


K1AA1 603 protein 


440138 


AB033023 


Hs.318127 


hypcthel/cal protein FU10201 


440273 


A3805392 


Hs325335 


Homo sapiens cDNA; FU 23523 fis, done L 


440289 




u- <Q0ft74 

ns.isAJn 


to IS 


440325 


NM.003812 


Hs.7164 


a disintegrin and me talio proteinase doma 


440492 


R39127 


Hs.21433 


hypothetical protein DKFZp547J036 


440527 


AV657117 


Hs.184164 


ESTs, Moderately similar to S65657 alpha 


440659 


AF134160 


Hs.7327 


daudlnl 


440704 


M69241 


Hs.162 


insulin-tike growth factor binding prole 


440943 


AW082298 


Hs.146161 


hypothetical protein MGC2408 


440994 


A1160011 


HS.Z72068 


ESTs 


441020 


AA401369 


Hs.190721 


ESTs • 


441031 


A1110634 • 


Hs.7645 


fibrinogen, B beta polypeptide 



1.00 


17.00 


40.55 


82.00 


1.00 


11200 


1.00 


20500 


1.56 


1.54 


113.25 


125.00 


1.82 


4.57 


1.35 


1.75 


156 


3.20 


3.03 


1.08 


1.00 


39.00 


1.00 


19.00 


t.95 


1.57 


1.00 


100 


1.00 


45.00 


1.07 


1.78 


1.68 


3.26 


1.87 


252 


74.05 


35.00 


23.15 


89.00 


1.00 


1.00 


12.28 


31.00 


1.53 


10.85 


1.80 


139 


22.67 


36.90 


1.00 


1.00 


38.92 


38.00 


1.00 


1.00 


2.05 


80.00 


100 


131.00 


1.43 


1.45 


1.00 


34.00 


1.33 


1.10 


242 


1.59 


1.00 


18.00 


£03 


257 


&42 


88.00 


22.41 


17.00 


1.00 


1.00 


120 


1.88 


278 


4.81 


1.17 


1.31 


1.00 


28.00 


1.00 


67.00 


1.38 


1.41 


1.93 


1.64 


46.23 


139.00 


2.00 


220 


6.10 


7.37 


3.39 


17.00 


1.83 


3.07 


23.28 


52.00 


18.76 


122.00 


2.78 


1.56 


1.22 


1.44 


7.46 


39.00 


1.00 


1.19 


1.00 


1.00 


33.61 


1.00 


1.00 


1.00 


4.30 


10.00 


86.55 


11.00 


2.36 


1.88 


2.02 


6.08 


too 


21.00 


7.27 


25.00 


1.00 


1.00 


32.58 


71.00 


21.28 


9.55 


68.83 


61.00 


1.83 


4.02 


1.42 


254 


1.00 


54.00 


24.18 


52.00 


3.21 


4.72 


3o,o3 


n'i nn 


62.88 


147.00 


235 


162 • 


10.84 


57.00 


118 


237 


2.89 


209 


202 v 


1.41 


1.29 


1.14 


14299 


17.00 


1.41 


99.00 



PCT/US02/12476 



163 



WO 02/086443 

441128 AA570256 

441230 W27501 HsJ9605 

441362 BE614410 HsJ3044" 

441377 BE218239 Hs.202656 

5 441390 AIS92550 Hs.131175 

441497 R51064 Hs.23172 

441525 AW241867 Hs.127728 

j 441553 AA281219 Hs.121295 

441607 NM.005010 Hs.7912 

10 441633 AW95B544 Hs.112242 

441636 AA081846 Hs.7921 

441737. X79449 Hs.7957 

441790 AA401369 Hs.190721 

441801 AV/242799 Hs.86366 

15 441919 AI553802 Hs.128121 

441937 R41782 Hs.22279 

441954 A1744935 Hs.8047 

442025 AW887434 Hs.11810 

442029 AW956698 Hs.14456 

20 442072 A1740832 Hs.12311 

442108 AW452649 Hs. 166314 

442117 AW664964 Hs.128899 

442137 AA977235 H5.128B3D 

442159 AW163390 Hs.278554 

25 442179 AA983842 Hs.333555 

442328 AI952430 Hs.150614 

442432 BE093589 Hs.38178 

442530 AI580830 Ks. 176508 

442547 AA306997 Hs.217484 

30 442556 AL137761 Hs.8379 

442619 AA447492 Hs.20183 

442710 A1015631 Hs.23210 

442717 R88362 Hs.180591 

442875 BE623003 Hs.23625 

35 442914 AW188551 Hs.99519 

442932 AA457211 Hs.8858 

442942 AW167087 Hs.131562 

443068 AJ188710 

443204 AW20587B Hs.29643 

40 443211 AI128388 Hs.143655 

443247 . BE614387 Hs.333893 

443324 R44013 Hs.154225 

443383 A1792453 Hs.166507 

443400 R28424 Hs.250648 

45 443426 AF098158 Hs.9329 

443572 AA025610 Hs.9605 

443575 A1078022 Hs.269636 

443614 AV655385 Hs.7645 

443633 AL031290 Hs.9654 

50 443648 A1085377 Hs.143610 

443715 AI583187 Hs.9700 

443723 A1144442 Hs.157144 

443802 AW504924 Hs.9805 

443859 NMJJ13409 Hs.9914 

55 443892 AA401369 Hs.190721 

443947 W24187 

443991 NMJJ02250 Hs.10082 

444006 BE395085 tte.10Q8Q 

444009 A1380792 Hs.135104 

60 444017 U04840 Hs.214 

444127 N63620 Hs.13281 

444129 AW294292 Hs.256212 

444279 U62432 Hs.89605 

444371 BE540274 Hs.239 

65 444378 R41339 Hs.12569 

444381 BE387335 Hs.283713 

444461 R53734 Hs.25978 

444471 AB020684 Hs.11217 

444489 AJ151010 Hs.157774 

70 444519 BE5380B2 Hs.8172 

444665 BE613126 Hs.47783 

444707 A1188613 Hs.41690 

444735 BE019923 Hs.243122 

444781 NMJ14400 Hs.11950 

75 444783 AK001468 Hs.62180 

445236 AK001676 Hs.12457 

445258 A1635931 Hs.147613 

445413 AA151342 Hs.12677 

445417 AK001058 Hs.12680 

80 445443 AV653838 Hs.322971 

445462 AA378776 Hs.288549 

445517 AF208855 Hs.12830 

445537 AJ245671 Hs.12844 

445580 AF167572 Hs.12912 

85 445654 X91247 Hs.13046 



ESTs, Weakly amDar to T23273 hypcthefi 

cholinergic receptor, nicotinic, alpha p 

RAD51 (S. cerevisiae) homotog (E coll Re 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

neuronal ceO adhesion moteciile 

norma) mucosa of esophagus specific 1 

Homo sapiens mRNA; cDNA DKFZp566E183 (fr 

adenosine deaminase, RHA-specific 

ESTs 

ESTs 

ESTs 

ESTs 

Fanconi anemia complementation group G 
CDA11 protein 

neural precursor cell expressed, develop 
Homo sapiens clone 23570 mRNA sequence 
ESTs 
ESTs 

ESTs, Weakly similar to Z192JWMAN ZINC 

helerochromafin-fike protein 1 

chromosome 2 open reading frame 2 

ESTs, Weakly similar to ALU4_HUMAN ALU S 

hypothetical protein RJ23463 

Homo sapiens cOMA FU14712 fis, clone NT 

ESTs, Weakly similar to ALU1.HUMAN ALU S 

Homo sapiens mRNA; cDNA DKFZp58SL2424 (f 

ESTs, WeaWy similar to AF164793 1 prote 

ESTs 

ESTs, WeaWy similar to T23976 hypothefl 
Homo sapiens done TCCCTA00142 mRNA sequ 
hypothetical protein FU14007 
bromodomain adjacent to zinc finger doma 
ESTs 
ESTs 

Homo sapiens cDNA FU13103 fis, clone NT 
ESTs 

c-Myc target JP01 

ESTs 

ESTs 

ESTs 

chromosome 20 open reading frame 1 

cleavage and poIyadenytaSon specific fa 

ESTs, WeaWy similar to ALULHUMAN ALU S 

fibrinogen, B beta polypeptide 

simflar to pregnancy-associated plasma p 

ESTs 

cycRn E1 

syntaxin6 

KIAA1291 protein 

foflistaiin 

ESTs 

gb:zb47f09./1 SoaresJetaUung_NbHL19W 
potassium fotermediate&mall conductance 
type I trarisrnembrane protein Fn14 
ESTs 

neuro-oncotogical ventral antigen 1 

ESTs 

ESTs 

cholinergic receptor, nicotinic, alpha p 

forkhead box Ml 

ESTs 

ESTs, WeaWy similar to S64054 hypotheti 
ESTs, WeaWy slmBar to 2109250A B cell 
WAA0877 protein 
ESTs 

ESTs. Moderately similar to A46010 X-to 
B aggressive lymphoma gene 
desmocoIlin3 

hypothetical protein FU13057 similar to 
GPi-anchored metastasis-associated prote 
anflSn (Drosophila Scraps homotog). act 
hypothetical protein FU10814 
ESTs 

CGH47 protein 

Homo sapiens cDMA FU10196 fis, clone HE 
ESTs 

hypothetical protein MGC3077 
hypothetica! protein 
. EGF-Ske-domain,mul(ipIe6 
skbl (a pombe) homotog 
thioredo»n reductase 1 



4.13 


3.50 


1.00 


1.00 


130.23 


43.00 


22.03 


1.00 


3.65 


7.70 


1.00 


1.00 


1.53 


1.42 


1.89 


1.57 


1.47 


Z.11 


216.22 


363.00 


231 


205 


1.30 


1.49 


44.15 


17.00 


1.00 


1.00 


1.00 


12200 


0.66 


1.37 


1.43 


1.39 


1.00 


46.00 


9.92 


45.00 


25.05 


77.00 


3.61 


3.14 


3.00 


5.49 


1.00 


1.00 


1.92 


1.66 


27.22 


50.00 


5.00 


3.42 


181.59 


76.00 


10.59 


144.00 


109.23 


98.00 


1.00 


53.00 


29.02 


50.00 


1.00 


19.00 


1.00 


5.00 


22.85 


50.00 


25.33 


8200 


3.18 


4.41 


8.45 


64.00 


1.00 


27.00 


1,00 


24.00 


1242 


200 


128.84 


96.00 


0.02 


4.59 


1.00 


47.00 


18.52 


61.00 


4.02 


1.75 


2.98 


257 


1.00 


29.00 


1.00 


16.00 


1.00 


39.00 


39.81 


70.00 


48.74 


7.00 


1.29 


1.30 


1.75 


1.61 


1.35 


1.13 


1.00 


17.00 


1.33 


1.64 


5.71 


6.87 


1.47 


1.92 


1.00 


77.00 


1.00 


1.00 


1.00 


29.00 


1.00 


1.00 


0.60 


7.80 


2.91 


1-14 


1.00 


i nn 


469.00 


556.00 


128B 


105.00 


24.91 


90.00 


1.00 


in.vu 


1.00 


7U.UU 


30.56 


1QQ nn 
Ids .W 


1.00 


4 An 
I.UU 


77.02 


an nn 


1.57 


i.Ji 


7755 


o nn 
<c.UU 


1 nn 


27.00 


1.00 


73.00 


28.14 


50.00 


1.81 


262 


1.00 


1.00 


209 


1.70 


1.87 


70.00 


1.71 


272 


1.52 


1.34 


1.51 


1.52 



PCT/US02/12476 



164 



5 

10 
15 
20 

25 446356 

30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

445669 AJ570330 Hs. 174870 

445818 BE045321 

445873 AA250970 

445885 AI734009 

445898 AF070623 

445903 A1347487 

445932 BE046441 

445982 BE410233 

446078 AJ339982 

446102 AW168067 

446157 BE270828 

446269 AW263155 

446292 AF081497 

446293 AI420213 
446423 AW139655 
446428 AW082270 
446432 AJ377320 
446528 AU076640 
446574 AI310135 
446519 AU076643 
446536 ACQD2563 
446783 AW138343 
446839 BE091926 
446B49 AU076617 

. A1814373 

445872 X97058 

446880 A1811B07 

446921 AB012113 

446989 AK00189S 

447022 AW291223 

447033 A1357412 

447078 AW885727 

447081 Y13896 

447131 NMJXM585 Hs.17466 

447149 BE299857 Hs.326 

447153 AA805202 

447164 AF026941 

447178 AW594641 

447250 A1878909 

447289 AW247017 

447342 A1199268 

447343 AA256641 
447350 AJ375572 
447377 N27687 
447415 AW937335 
447425 AJ963747 
447519 U4625B 
447532 AK000614 
447534 AA401369 
447635 Y10043 
447688 N87079 
447733 AF157482 
447769 AW873704 
447802 AW593432 
447850 AB018298 
447924 AI817226 
447973 AB011169 
448030 N30714 
448105 A1538613 
448243 AW369771 
448278 WQ7369 
448290 AK002107 
448296 BE622756 
448357 BE274395 
448390 A10354U 
448469 AW504732 
448569 BE382657 
448863 BE614599 
448672 AI955511 
448733 NM_005629 Hs.187958 
448741 8E614567 Hs.19574 
448757 AI366784 
448775 AB025237 
448826 A15B0252 

-448830 AL03165B 

448844 AI581519 

448988 Y09763 

448993 A1471630 

449003 X76342 

449029 N28989 

449040 AF040704 

449048 Z45051 

449053 AI625777 

449054 AF148848 
449101 AA205847 



Hs.136017 

Hs.251946 

Hs.127699 

Hs.13423 

Hs.132781 

Hs.333555 

Hs.13501 

Ks.156061 

Hs.317694 

Hs.131740 

Hs.14559 

Ks.279582 

Hs.149722 

Hs.150120 

Hs.12498 

Hs.150058 

Hs.15243 

Hs.335933 

Ks.313 

Hs.15767 

Hs.141867 

Ks.16244 

Ks.16251 

Hs.164175 

Hs-16362 

Hs.108646 

Hs.16530 

Hs.16740 

Hs.157573 

Hs.157601 

Hs.9914 

Hs.17287 



Hs.315562 

Hs.17518 

Hs.192417 

HS.17B83 

Hs.36978 

Hs.19322 

Hs.236894 

Hs.172634 

Hs.334334 

Hs.28149 

Hs. 18573 

Hs.339665 

Hs.18791 

Hs.190721 

Hs.19236 

Hs.19400 

Hs.320831 

Hs.161455 

Hs.19822 

Hs.313413 

Hs.20141 

Hs.325960 

Hs.298241 

Hs.52620 

Hs.11782 

Hs.20843 

Hs.10949 

Hs.108923 

Hs.21068 

Hs.21275 

Hs.21486 

Hs.106823 

Hs.225106 



Hs.48820 

Hs.388 

Hs.293246 

Hs.22181 

Hs.177164 

Hs^2785 

Hs.389 

Hs.22891 

Hs.149443 

Hs.22920 

Ms.344766 

Hs.22934 

Hs.23016 



ESTs 
ESTs 

pdy(A}-bindmg protein, cytopiasrrtic 1-t 
WAA1603 prolan 

Homo sapiens dons 24468 mRNA sequence 

class \ cytokine receptor 

Homo sapiens done 24859 mRNA sequence 

pescadiilo (zebrafish) homolog 1. contal 

ESTs 

ESTs 

Homo sapiens cDNA: RJ 22562 6s, done H 

hypothetical protein FU10540 

Rh type C glycoprotein 

ESTs 

ESTs 

ESTs, Weakrysim^toALU4JWMAN ALU S 
ESTs 

nucleolar protem 1(1 20kD) 
ESTs 

secreted phosphoprotsJn 1 (osteoponfin, 
citron Irtwwnteracfing, serinefthreonm 
ESTs 

mitotic spindle coiled-coil related prat 
cleavage and polyadenylafion specific fa 
ESTs 

pyrimtfinergic receptor P2Y, G-croiein c 

Homo sapiens cDNA FU14934 fis, done PL 

small inducible cytokine subfarrfly A (Cy 

typoffietical protein FU1 1036 

ESTs 

ESTs 

ESTs 

potassium InwanJIy-fectHying channel, s 
retrnote add receptor responder (tazaro 
TAR (HIV) RNA-blnding protein 2 
ESTs 

Homo sapiens dg5 mRNA, partial sequence 
ESTs 

protein phosphatase 1 G (formerly 2C). ma 

melanoma antigen, famRy A, 3 

Homo sapiens, Simflar to RIKEN cDNA 2010 

ESTs, Highly similar to S02392 alpha-2-m 

ESTs 

transcription factor AP-2 alpha (activat 
ESTs, WeaHy similar to KF3B..HUMAN WNES 
acylphosphatase 1. erythrocyte (common) 
ESTs 

hypothetical protein FU20607 
ESTs 

Wgfwnobffity group (nonhistone chromoso 
Target CAT 

MAD2 {mitotic arrest deficient, yeast, h 
Homo sapiens cDNAFU 14597 fis, done NT 
ESTs 

SEC24 (S. cerevlsiae) related gene famil 
ESTs, WeaWy similar to T23110 hypothec 
similar to S. cerevisiae SSM4 
membrane-spanning 4-domains, subfamily A 
Transmembrane protease, serine 3 
mtegrin, beta 8 
ESTs 

Homo sapiens cDNA FU11245 fis, done PL 
Homo sapiens cDNA FU 14 162 fis, done NT 
RAB38. member RAS oncogene f amity 
hypothetical protein 
hypothetical protein FU1 1011 
signal transducer and activator of trans 
hypothetical protein MGC14797 
ESTs 

solute carrier farnHy 6 (neurotransmltte 
hypothetical protein MGC5469 
TATA box binding protein (TBP>-associate 
nudSx (nucleoside diphosphate finked moi 
ESTs, Weakly simBar to putative p150 IH 
hypothetical protein dJ310O13.3 
ESTs 

garnma^rwbutync add (GABA) A recepto 

K1AA01 44 gene product 

alcohol dehydrogenase 7 (dass IV), mu o 

solute carrier famDy 7 (caEorcc amino 

putative tumor suppressor 

similar to S63401 (cattle) glucose indue 

ESTs 

myeneurin 

G proteuvcoupled receptor 



10.95 

1.00 

4942 

1.00 

1.00 

1.00 

Z41 

1.60 

1.00 

1.00 

1.70 

73.01 

1.55 

1.00 

1.10 

0.53 

1.00 

1.35 

3.89 

32.03 

4.19 

Z82 

110.28 

3.26 

6J8 

1.99 

94.90 

1.67 

Z82 

1.00 

7.15 

47.24 

0.12 

0.97 

1.24 

1.00 

1.00 

3.42 

1.60 

1.00 

28.63 

146.62 

1.00 

2.55 

0.91 

1.00 

59.89 

1.23 

1.00 

1.41 

1.00 

1.17 

6.47 

a73 

86.45 

1.00 

3.50 

4.13 

1.15 

15.84 

0.97 

1.00 

2.42 

1.44 

1.00 

263 

1.84 

3.29 

1.00 

1.82 

2.48 

23.53 

£34 

74.07 

1.37 

1.00 

1.84 

1.63 

1.00 

1.97 

0.97 

27.13 

8.33 

73.85 

158 



11.45 
1.00 
54.00 
13ZG0 
1.00 
36.00 
•Z88 
1.35 
42X0 
1.00 
1.53 
48.00 
1.26 
ZOO 
4.19 
3.26 
5.00 

I. 31 
7Z0O 
20.23 
5.07 
9.47 
28.00 
Z94 

II. 30 
Z03 
113.00 
3.90 

ai2 

170.00 

107.00 

24.00 

17.88 

1.48 

1.26 

54.00 

67X0 

50.00 

1.52 

1.00 

1.00 . 

51.00 

12.00 

63.00 

1.13 

35.00 

49.00 

1.63 

17.00 

Ml 

39.00 

1.12 

5.95 

Z34 

116.00 

1.00 

4.27 

14Z00 

Z24 

1.00 

1.90 

1.00 

Z17 

1.08 

43.00 

249 

Z53 

46.00 

21.00 

1.08 

1.92 

20.00 

1.97 

6Z67 

U1 

31.00 

1.95 

1.49 

1.00 

Z28 

1.56 

90.00 

44.00 

104.00 

27.00 



PCTAJS02/12476 



165 



WO 02/086443 



PCT/US02/12476 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



449167 


T05095 


Hs.1 9597 


WAA1694 protein 


449207 


AL044222 


Hs.23255 


nudeoporin 15ScO 


449228 


AJ403107 


Hs.1 48590 


protein related with psoriasis 


449230 


BE613343 


Hs211579 


metanoma ceD adhesion mdecute 


449305 


A1638293 




gtdtQ9b07jt1 NC1_CGAP_GC6 Homo sapiens 


449318 


AW236021 


Hs.78531 


Homo sapiens, Similar to R1KEN cDNA 5730 


449446 


060730 


Hs.57471 


ESTs 


449467 


AW205006 


Hs.1 97042 


ESTs 


449523 


NM_000579 


Hs.54443 


crremoktne (C-C modf) receptor 5 


449722 


BE260074 


Hs.23950 


cycfinBI 


449976 


H06350 


Hs.1 35056 


Human DNA sequence from done RP5-85QE9 


450001 


NMJXJ1044 


Hs.406 


solute carrier family 6 (neurotransmttta 


450098 


W27249 


Hs.8109 


hypothetical protein FU2108O 


450101 


AV649989 


Hs.24385 


Human hbc547 mRNA sequence 


450149 


AW959781 


Hs.132883 


Zic family member 2 (odd-paired OrosopW 


450193 


AI916071 


Hs-15607 


Homo sapiens Fanooni anemia comptementat 


450221 


AA328102 


Hs.24641 


cytoskeieton associated protein 2 


450372 


BH218107 


HsJ202436 


ESTs 


450375 


AA009547 


Hs.8850 


a disintegrin and metafloproteinase doma 


450447 


AF212223 


Hs.25010 


hypothetical protein P15-2 


450568 


AL050078 


Hs.25159 


Homo sapiens cDNA FLM0784 ns, done Ni 


450589 


AI701505 


Ks^02526 


ESTs 


450S84 


AA872605 


Hs.25333 


interieukin 1 receptor, type II 


450701 


H39960 


Hs.2B8457 


Homo sapiens cDNA FU 12280 6s, clone MA 


450705 


U90304 


Hs.25351 


iroquote homeobox protein 2A (1RX-2A) ( 


450832 


AA401369 


Hs.1 90721 


ESTs 


450937 


R49131 


Hs.26267 


ATP-dependant interferon response protei 


450983 


AA305384 


Hs.25740 


EROJ (S. carevisiaeHika 


451105 


AI761324 




gbwiGOM Ijct NQ_CGAP_Co16 Homo sapiens 


451110 


A1955040 


Hs.265398 


ESTs. Weakly similar to transfofmation-r 


451253 


H48299 


Hs26126 


daudinlO 


451291 


R39288 


Hs.6702 


ESTS 


451320 


AW498974 




dlacytgtyceroi kinase, zeta (104kD) 


451380 


H09280 


Hs.1 3234 


ESTs 


451386 


AB029008 


Hs.26334 


spastic paraplegia 4 {autosomal dominant 


451437 


H24143 


Hs.31945 


hypothetical protein FU 11071 


451462 


AK000367 


Hs.26434 


hypothefca) protein FU20360 


451524 


AK001466 


Hs26516 


hypothefica! protein FU10604 


451541 


BE279383 


Hs.26557 


ptakophiUn 3 


451592 


AI805416 


Hs.213897 


ESTs 


451635 


AAQ18899 


Hs.127179 


cryptic gene 


451743 


AA401369 


Hs.t90721 


ESTs 


451808 


NMJKJ3729 


Hs.27076 


RMA 3'-terminal phosphate cyclase 


451807 


W52854 




hypothetical protein FLJ23293 simitar to 


451871 


AI821005 


Hs.1 18599 


ESTs 


451952 


AL120173 


Hs.301663 


ESTs 


452012 


AA307703 


Hs.279766 


fcinesin family member 4A 


452046 


AB018345 


Ks27657 


K1AA0802 protein 


452194 


AI694413 


Hs.332649 


olfactory receptor, famfiy 2, subfamily 


452206 


AW340281 


Hs.33074 


Homo sapiens, clone IMAGE:3606519, mRNA, 


452240 


AA401369 


Hs.190721 


ESTs 


452256 


AK000933 


Hs.28561 


Homo sapiens cDNA FU 10071 5s, dene HE 


452281 


T93500 


Hs.28792 


Homo sapiens cDNA FU 11041 fis, clone PL 


452291 


AF015592 


Hs.28853 


COC7 (cell division cyde 7, S. cerevlsi 


452295 


BE379936 


Hs.28866 


programmed cell death 10 


452304 


AA025386 


Hs.61311 


ESTs, Weakly simBar to S 10590 cysteine 


452340 


NMJQ2202 


Hs.505 


ISL1 transcription factor, UM/homeodoma 


452349 


AB028944 


Hs.29189 


ATPase, Class VI, type 1 1 A 


452367 


U71207 


Hs.29279 


eyes absent (Drosophita) homoiog 2 


452401 


NMJQ7115 


Hs29352 


tumor necrosis factor, alpha-induced pro 


452410 


AL133619 




Homo sapiens mRNA; cDNA DKFZp434E2321 (f 


452461 


K7S223 


Hs.1 08105 


transcription factor 


452571 


W31518 


Hs.34665 


ESTs 


452613 


AA461599 


Hs23459 


ESTs 


452699 


AW295390 


Hs.213062 


ESTs 


452705 


H49805 


Hs246005 


ESTs 


452747 


AF160477 


Hs.61460 


tg superfamfly receptor LNIR 


452787 


AW294022 


Hs,222707 


KIAA1718 protein 


452795 


AW392555 


Hs.1 8878 


hypothetical protein FU2162U 


452823 


AB012124 


Hs.30696 


transcription factor-Eke 5 (basic helix 


452633 


BE559681 


Hs.30736 


KIAA0124 protein 


452838 


U65011 


Hs30743 


preferentially expressed antigen in meia 


452862 


AA401369 


Hs.190721 


ESTs 


452865 


AW173720 


Hs.345605 


ESTs, Weakly simuar to A4758Z Been gr 


452934 


AA581322 


Hs.4213 


hypothetical protein MGC16207 


45&40 




n5.0 Ivrac 




452976 


R442U 


Hs.101189 


ESTs 


453028 


AB005532 


Hs.31442 


RaQ protein-like 4 


453095 


AW295560 


Hs252756 


ESTs 


453102 


NM_007197 


Hs.31664 


frizzled (Drosophila) homolog 10 


453103 


A1301052 


Hs.153444 


ESTs 


453120 


AA292891 


Hs31773 


pregnancy-Induced growth inhibitor 


453153 


N53893 


Hs.24360 


ESTs 


453160 


A1253307 


Hs239884 


H2B histone famiry, member I 


453197 


AI916269 


Hs.109057 


ESTs, Weakly similar to ALU5_HUMAtf ALU S 



1.61 


2-35 


236 . . 


1.50 


1.15 


1.15 


206.65 


4ci nn 


1728 


45 00 


26,39 


OO.UU 


1.00 


4 nn 
1.00 


4 nn 
1.00 


4 nn 
I.UU 


56 80 


Li 0.00 


150-03 


I.UU 


Z16 


£.00 


1.17 


4 AS. 
1.40 


1.79 


LSo 


4 ATI 

1.00 


cq nn 


1.00 


4 nn 
l.UU 


29.85 


•Ji Art 


1.00 


4 nn 

l.UU 


1.00 


4 nn 
l.UU 


51.26 


oi nn 


12120 


181.00 


1.00 


40 nn 


i nn 
l.UU 


won 


4 nn 

1.00 


inn nn 
1UU.UU 


1.89 


4 CC 
1.00 


4 nn 
l.UU 


Ac nn 
•KJ.UU 




17 nn 


90.92 


on nn 


3.33 


4 7n 
l./U 


15.02 


124.00 


1.00 


143.00 


3.02 


Z29 


1.00 


1.00 


2.92 


18.00 


6.90 


0.67 


35.75 


72.00 


1.00 


69.00 


1.83 


Z10 


1.13 


1.07 


1.88 


1.33 


1.00 


1.00 


1.52 


1.92 


4.95 


17.00 


13.55 


31.00 


1.55 


35.00 


1.81 


2.53 


1.00 


22.00 


3.43 


2.26 


56.59 


19.00 


1.67 


4.09 


9.31 


53.00 


13.42 


17.00 


39.03 


oa nn 


153.01 


4An nn 
04U.UU 


1.95 


oi nn 


i«> 11 

42.33 


£4 nn 
ol.UU 


4 At 

1.17 


£.14 


1.00 


4i nn 
1J.UU 


1.09 


1 AO 

1.4Z 


54.49 


ci nn 
00. UU 


4 nf\ 
1.00 


19 nn 
o£UU 


1.26 


4 OQ 

i.yy 


24.47 


ic nn 


54.61 


mo nn 


4 1& 

1.39 


4 19 


4 nn 
I.UU 


oc nn 

ZD.UU 


4 fin 
I.UU 


1 nn 

l.UU 


1 lZ.Of 


1 90 


4 nn 
I.UU 


1 nn 
l.UU 


1 nn 
I.UU 


1 nn 
l.UU 




7c nn 




1 Q9 


47/ -ac 
1/4. 39 


1 no 

l.UU 


Qfl Oft 


17 nn 

II.W 


4 CC 
1.00 


1 nn 
l.UU 


1.73 


1 40 

1.18 


1 on 


1.00 


1.58 


198 


1.80 


1.60 


0.77 


1.50 


1.00 


1.00 


1.00 


1.00 


123 


120 


1.00 


83.00 


1.00 


30.00 


1.00 


134.00 



166 



WO 02/086443 PCT7US02/12476 



453210 


AL133161 


Hs.32360 


hypothefcai protein FU 10657 


1.69 


1.93 


453240 


AI959554 


Hs.166254 


hypdheflcaJ protein OKFZp5661133 


* 1.00 


1.00 


453317 


NM.0G2277 


Hs.41696 


keratin, hair, ackfic.1 


1.19 


1.27 


453323 


AHJ34102 


Hs.32951 . 


solute carrier family & {nucleoside tra 


4.90 


4.11 


453331 


A1240655 


Hs.8850 


ESTs 


199l42 


moo 


453392 


U23752 


HSJ2964 


SRY (sex determining region Y)-box 1 1 


1.00 


iaoo 


453431 


AF094754 


Hs.32973 


glycine receptor, beta 


1.00 


1.00 


453439 


AI572438 


Hs.32976 


• guanine nucleotide binding protein 4 


3.44 


5.17 


453459 


BE047032 


Hs.257789 


ESTs 


284 


5.58 


453563 


AW60890S.comp 


HS.18U63 


hypothetical protein MGC5629 


453533 


AA357001. 


Hs.34045 


hypothefcaJ protein FU20764 


1.74 


1.60 


453775 


NM.002916 


Hs.35120 


repGcaiion factor C (activator 1) 4 (37 


19.49 


1.00 


453830 


AA534296 


Hs.20953 


ESTs 


24.92 


25.00 


453857 


AL080235 


H&.35B61 


DKFZP586E1S21 protein 


167.59 


66.00 


453867 


AI929383 


Hs.33032 


hypothetical protein DKFZp434N185 


1.00 


39.00 


453883 


AI638516 


Hs.347524 


cefaclor required for Sp1 transcripSona " 


1.97 


1.58 


453884 


AA355925 


Hs.36232 


WAA0185 gene product 


63.89 


20.00 


453900 


AW003582 


Hsi26414 


ESTs, Weakly simDar to ALU8_HUMAN ALU S 


20.41 


16.00 


453922 


AF053306 


Hs.36708 


budding uninhibited by benzimidazoles 1 


7.09 


22.00 


453941 


U39817 


Hs.36820 


Blxm syndrome 


29.75 


19.00 


453964 


AJ561486 


Hs.12744 


ESTs 


1.00 


1.00 


453968 


AA847843 


Hs.62711 


Homo sapiens, clone 1MAGE:3351295, mRNA 


2.06 


1.81 


453976 


BE463830 


Hs.163714 


ESTs 


3.02 


131.00 


454024 


AA993527 


Hs^ 93907 


hypothetical protein FU23403 


1.00 


131.00 


454034 


NM.000691 


Hs.575 


aldehyde dehydrogenase 3 famfly, member 


1.23 


1.02 


454042 


TJ9228 


Hs.172572 


hypolheScai protein HJ20093 


30.63 


171.00 


454059 


NM_003154 


Hs.37048 


statherin 


1.00 


1.00 


454066 


X00356 




(^tontn/calcttonirHelated potypeptid 


1.01 


1.45 


454098 


W27953 


Hs.292911 


ESTs, Highly similar to S60712 band-&pr 


1.28 


111 


454241 


BE144666 




gb:CM24iT0176441099-017.c02 HT0176 Homo 


6.33 


5.04 


454417 


AI244459 


Hs.1 10326 


IrinucleoSde repeat containing 9 


4.30 


7.82 


454439 


AW819152 


Hs.1 54320 


DKFZP56601646 protein 


1.00 


1.00 


455175 


AW993247 




gb:RC2-8NQ033-1802G0-014-h09 BN0033 Homo 


13.75 


103.00 


455601 


AI368580 


Hsi16 


SRY (sex determining region Y)-box 2 


206.11 


1.00 


456237 






gb:zx52e07.r1 Soares Jeiai Jrver_s pteen_ 


1.00 


1.00 


456321 


NM 001327 


Hs.87225 


cancef/testis antigen 


l!l4 


110 


456475 


kju 000144 


Hs.95998 


Friedreich ataxia 


1.00 


48.00 


456508 


AA502764 


Hs 123469 


ESTs, WeaHy similar to AF208855 1 BM-01 


162.25 


189.00 


456534 


X91195 


Hs 100623 


phosphatase C, beta 3, neighbor pseudo 


2.12 


1.80 


456738 


AW248217 


Hs.1 61 9 


achaete-scute complex (Dmsophila) homol 


1.15 


194 


456759 


BE259150 


Hsj 27792 


delta (DrosophBaH&e 3 


1*00 


100 


456990 


NMJH4504 


Hs.171545 


HlV-1 Rev binding protein 


16.42 


84.00 


457200 


U33749 


Hs.197764 


thyroid transcription factor 1 


0.57 


176 


457234 


AW968360 


Hs.14355 


Homo sapiens cONA RJ13207 Ms, clone NT 


271 


4.15 


457465 


AW301344 


Hs.122908 


DNA repficafion factor 


46.37 


47.00 


457489 


A1693815 


Hs.127179 


cryptic gene 


1.12 


135 


457646 


AA725650 


Hs.112948 


ESTs 


1.55 


2.51 


457733 


AW974812 


Hs.291971 


ESTs 


1.00 


55.00 


457819 


AA057484 


Hs.35406 


ESTs, Highly similar to unnamed protein 


4.36 


3.18 


458092 


BE545684 


Hs.343566 


K1AA0251 protein 


1.00 


132 


458098 


BE550224 




metailothionein 1E (functional) 


100 


2200 


458207 


T28472 


Hs.7655 


U2 smafl nuclear ribonudeoprotef n auxii 


206 


1.88 


458242 


BE299588 


Hs.26465 


Homo sapiens cDNA: RJ21859 Rs, clone H 


1.00 


100 


458247 


R14439 


Hs.209194 


ESTs 


7.00 


9.85 


45B679 


AW975460 


Hs.142913 


ESTs 


1.00 


3,00 


458778 


AW451034 


Hs.326525 


aryteuifefase D 


1.31 


201 


456933 


AI638429 


Hs.24763 


RAN binding protein 1 


1.98 


171 


459352 


AW810383 


Hs.20682B 


ESTs 


1260 


63.00 


459670 


F01020 


Hs.172004 


mn 


1.00 


100 


459702 


AI204995 




gb:an03c03.x1 Stratageneschtzo brain S1 


1.00 


237.00 



TABLE 9B 



Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey 
407746. 



CAT Number 
10125J 



408070 


1036688L1 


408650 


107294J 


409522 


113735J 


409856 ' 


1156522J 


410032 


1170435J 


411089 


123175L1 


411152 


1234028J 


412537 


1304.1 



Accession 

AK001 962 R69415 BE464605 AA418699 AA053293 AA149075 AA058396 AW338226 AW272659 AA454607 AI139535 AW469852 A1275461 
AW271982 AA730033 AA576507 AA991217 AA782067 AI985851 AA805864 AA505598 AW469857 R69546 AA988279 AW001647 N63320 
D82661 T27343 AA305950 AA360989 R58778 
AW148852BE350895 

AA525775 AA056342 AI538978 AW975281 AA664986 

AA075382AA075431 

AW5O2152H41202 H29772 

BE055985 BE065944 BE0E6008 BE066083 BE066093 

AA456454 AA713730 AA091294 AA584921 N86077 AW836781 AA601 031 AA579876 AA551 1 06 AA6331 88 AW905577 AI955808 A1679386 
A1679895 AA514764 AA454562 AI082382 AA595822 AA551351 AA586369 AA656384 AA1 88934 AA665398 AA551297 AA565188 
BE069199 AW936012 AW877466 AW8197B2 AW935798 AW835546 AW936042 BE069121 AW835625 AW877536 AW935885 BEG69202 
AW820019AW935937BE160180AW935946BE059101 BE059125AW877527BE160316BE160398AIV935794AW835701 AW935784 
AL031778 X59711 NM 002505 K69079A1870439AI494259AW654010AM05£)63AA436132BE174516AA4 

T29403 BE079412 BE079428 N90322 A1631202 A1141758 A1016793 A1167566 AI862075 A1375230 A1208445 AW235763 AL044113 AA382556 
AW953918 AA927051 AA889823 BE003094 AW390155 AW360805 AW360823 AW360810 AA425472 AI694282 AU3441 14 A1684577 A1809865 



167 



WO 02/086443 



412811 132343J 



413630 
414883 



417324 
418574 

418712 
419443 
419502 



419938 
4215B2 



422128 
423034 
423816 
424200 
424999 
426966 
426991 
427260 
428023 



429220 
429978 



430439 
430935 
431089 
431322 
432407 



1383256.1 
15024.1 



156454J 
165714J 
17690J 

1784125J 
184788J 
18535J 



189181 1 
2041J 



211994J 

224122.1 

23234.1 

236595.1 

245835.1 

273896 1 

27415J 

276598.1 

28589_2 



301384J 
31 150 J 



31808J 
325772.1 
327B25J 
331543.1 
34624 1 



434414 38585J 



42361.3 



PCT/US02/12476 . 

AJ478773 A! 1 60445 AI674630 N69088 AW565529 N49278 AI1 29239 A1457830 AI621 264 AY/2971 52 A126821 5 AA907787 A12851 70 AI01 7982 
A1963541 A1469807 A1969353BE552356N66509AA736741 AA382555AW075811 AW292026 

H06382 AW957730 AA352014 R13591 AA121201 D60420 BE263253 BE047862 241952 AI424991 AJ693507 A1863103 AA599050 A1091 148 
AA59SS89 R39837 AA813482 AW016452 HQS383 R41807 AI354268 AA620528 A/241940 AVW89149 AW090733 AWC88575 Z38240 
AA121202R17734 
BE157489BE157560 

AA926960 AA926959 W76521 W24270 W21526 AA037172 BE267635 H83186 AA469909 N86395 AA001 348 BE535736 AA081745 8E566245 
AA082436 H72525 H77575 N49786 \V805S5 H78746 BE569085 W04339 R98127 T55938 BE279271 AW960304 T29312 AA476873 BE297387 
AA292753AA177048NM_001826 X54941 BE31 4366 AA9D87B3AI71 9075 BE2701 72 BE269819AA889955AI204530W25243AI93515O 
AAB72039 W72395 T99630 AI422691 H98460 N31428 BE25591 6 H03265 AI857576 AA776920 AA91 0644 AA459522 AA293140 AW514667 
K75953 AW6623S6 AA562522 AI865147 AI4231 53 AW262230 AA584410 AA583187 AW024595 AW069734 AI828996 AA282997 AA876046 
AW613002 AA527373 AW972459 A1831360 AA521337 AA100926 AA772418 AA594628 AI033892 W95096 AI034317 AA398727 AI085O31 
N95210 AI459432 A1041437 AA932124 AA627684 AA935829 A1004827 AI423513 AI094597 H42079 R54703 AI530359 M617681 AA978045 
AA643280 W44561 AI991988 AI537692 AI090262 AA740817 AI312104 AI91 1822 AA416871 AI185409 AA129784 AA701623 AI075239 
AI1 39549 AA633648 AI339996 AI336880 AA399239 AI078708 AI0B5351 AI362835 AI34661 8 AJ146955 AI989380 AI348243 N92892 AA765850 
AI494230 A1278887 AA962596 AI492600 W80435 AA001979 R97424 AI129015 N24127 AA157451 AA235549 AA459292 AA037114 AA129785 
AI49421 1 AW059601 AW886710 R92790 N59755 AI361 1 28 AW589407 H47725 H97534 H48076 H48450 T99531 AW300758 H03431 R76789 
AA954344 H77576 R96823 A/457100 N92845 N49682 H42038 BE220698 BE220715 H99552 M701624 N74173 R54704 H79520 H72923 
K03266BE261919M769633 M480310AA507454AA9105B6AJ203723AW1(W^ 
W95095 R97470 AA702275 T77551 AA91 1952 H82956 N83673 AA283672 

AI267700 AI720344 AA191 424 A1023543 A1469633 AA172056 AW958465 AA172236 AW953397 AA3550B6 

AW265494 AA455904 AA1 95677 AW2654 32 AW991605 AA456370 

N28754 N28747AI568146AJ979339AA322671AA322672AW955043^ 

W70051 AI038748 AAB31 327 AI925B45 AW945895 

Z42183T31621T97478 

D52703AA242966D79798 

AU076704 T74854 T74860 T72098 T73265 T73873 T691 80 T74658 T58786 T60385 T73410 T68781 T67845 T67593 T73952 T67864 T60630 
T68367T68401 T53959T72360T72099T60377T58961 T71712T7282I T64738 T74645 T72037 T68688 T72063 T73258 772826 T54242 
T68220 T74673 T71B0O T68355 T61227 T62738 T69317 T53850 T64692 T7376B T73962 T73382 T6B914 T70975 T73400 T60631 T73277 
T73203 T70498 T61409 T58925 NW.000508 M54982 T68301 T73729 T69445 T60424 T67922 T67736 T68716 T67755 T74765 T73819 T58719 
T74756 T60477 T74863 T61 109 T68329 T58B50 T71857 T73425 T53736 T68607 T58898 T64309 T72031 T72079 T64305 T71908 T68107 
T71916 T73787 T56035 T64425T71870 T60476 T61376 T67820 T71895 T41006 T69441 T68170 T74617T71958 T69440 T61875 R06796 
H48353 T71914 T53939 T64121 AA693996 T72525 T67779 T68078 AA011465 AA345378 AV654847 AV654272 AV556001 AI064740 T82897 
N33594 AA344542 AW805054 AI207457 T61743 AA026737 H94389 AA382695 AA918409 T68044 S82092 T39959 AIQ17721 AA312395 
AA312919T40156 H66239 AV652989 H38728 R98521 AV655200 R95790 W03250 W00913AA344136 AV660126 R97923AA343596 
AW470774 AV651256 N5441 7 AA812862 AW182929 AJ1 1 1 192 H61463 H72060 AA344503 H38639 AI27751 1 AV681 108 AI207625 T47810 
AA235252 T27853 T47778 R95746 H70620 AA701463 AW8271 66 R98475 C20925 AV657287 T71959 T71313 T73920 T73333 T61 618 T69293 
T69283 T73931 T721 78 T72456 AV645639 AV653476 T72957 172300 T58906 T71457 T70494 T72956 T70495 T68267 T74407 T85778 
AA344726 T27854 T74485 T74101 T73868 T71 518 T72304 AA343853 T73909 T6807O T72065 H721 49 T73493 T73495 AV645993 RD2293 
T70475 T64751 AA344441 AA343657 AA345732 AA344328 All 10639 AA344603 AF06351 3 T64696 T6851 6 T72223 T60507 T67633 R29500 
T72517 R02292 T60599 T69206 T70452 T74677 R29366 T61277 T74914 T60352 R29675 T74843 AV645792 AA344408 T691 97 T72057 
T69368 T69358 T6825B AV650429 T73341 T61702 T74598 T40095 K02272 T401 06 AA343045 AA341908 AA341907 AA342807 AA341 964 
T53747 T72042 T62764 AI064899 AA343060 T67832 T72440 T71770 T68091 T69108 T72449 T69167 T71289 T68251 AV654844 T64375 
AA345234 T67598 AA011414 T68036 H48262 AI207557 T68219 W86031 T69081 T64232 R93196 T62136 AV650539 H67459 772978 
AA3445B3 T60362 H58121 T95711 T72B03 T68055 T71715 R29036T72793 T69122 T64595 T62888 T69139 T68291 T64652 T67971 T46B62 
AA693592 AI248502 R29454 T64764 T57001 T73052 T71429 T51 176 T5B866 AV655414 H90426 AA342489 T73666 T67848 T72512 T53835 
T67837 T73317 T74273 T69420 T68245 T74380 T67862 T74474 T56068 
AI792788BE142230AA252019 

A191 0275 X00474 X52003 X05030 NM.003225 AA314326 AA308400 AA506787 AA314825 A1571948 AA507595 AA614579 AA587613 R8381 8 

AA568312 AA614409 AA307578 AI925552 AW950155 AI910083 M12075 BE074052 AW004668 M578674 AA582084 BE074053 BE074126 

BE074140AA514776AA588034BE074051 BE074068AW009769AW050690AA858276 R55389AI001051 AW050700AW750216AA614539 

BE074045 A1307407 AW602303 BE073575 AI202532 AA524242 AI970839 A1909751 BE076078 AI909749 R55292 

AW881145AM90718M85637AA304575 T06067AA331991 

AL119930 AA320696 AW752565 

AL031985 AL137241 AI792386 A/733664 A/857654 AI04991 1 

AA337221 AA335756AW966196 

AW953120 R56325 AA349562 

AJ493134 AJ498691 AW771508 AI498457 AI768408 AI783624 AJ383985 AI580267 D79813 AA393768 
AK001 536 AA1 91092 AW510354 AI554256 AL35396B AA134266 
M663848 AA400100 AA401424 

AL038843 AA161338 BE268213 AA425597 N87306 AA092969 BE566038 AA247451 N47392 AI928802 AW1 82584 AW027872 A1819831 
A1936994 W56258 A1653448 A127861 1 AI2B3557 AIB24306 AW338658 AW150899 AA68751 4 N47393 N29885 AA973469 AI03B904 A1292064 
AI034339 AW674593 N72156 AI079733 AI038683 AI291616 AA491599 AA993675 AA837380 BE008554 BE005473 AI087090 T33044 
AA652043 A1203503 AA583959 W35283 AI129926 Z41844 AW020925 AW57584B AI684603 AA493297 AJ140689 A12771 75 AA425444 
AI932767 W02632 BE398786 R37261 
AW207206 AW341473 AA448195 AI951341 

AA249027 AL038984 AK001993 ALOB0065 AV652725 BE566226 AA345557 AA315222 AA090585 AA375688 AA301092 AA298454 W05762 
AW507939 H51658 083880 N84323 BE296821 AW947007 D61461 AW079261 AA329482 AW901780 A1354442 AA772275 R31663A1354441 
A1767525 H92431 AI91 6735 H93575AJ394255AW01 4741 AI573090 C06195 AW612857 AW265195A1339558A1377532 AI308821 AI919424 
AI589705 AW055215 A1336532 A1338051 AA805547 C75509 C00618 AW071 172 AW769904 AA630381 A167801 8 AI863985 D79662 BE221 049 
AW265018 A1589700 AW196655 N76573 AI370908 BE042393 N75017 AI898870 AW9601 15 
AL133561 AL041090AL117481 AL122069 AW439292 AI968826 

AW072916 AI184913 AA489195AW466994 AW469044 N59350 AI819642 AI2B0239 AI220572 AA7B9302 AI473611 AW841126 D60937 

BE041395AA491826AA621946M715980AA666102 

AW970622 AA503009 AA502998 AA502989 AA502805 T92188 

AA221035 R87170 BE537068 BE544757 C18935 AW81 2058 T92565 AA227415 AA233942 AA223237 AA658403 AA601627 AW869639 
BE061833 BE000620 AW961 170 AW847519 AA308542 AW821833 AW9456B8 C04699 AA205504 AA377241 AW821667 AA055720 
AW81 7981 AWB56468 AA1 55719 AA1 79928 T03007 AW754298 AA227407 AA1 13928 AA307904 C1 6859 

A/798376 S46400 AW8 11617 AW8 11616 W00557 BE142245 AW858232 AW861851 AW858362 AA232351 AA218567 AA055556 AW65B231 
AW857541 AW814172 H66214 AW814398 AF1 341 64 AA243093 AA173345 AA199942 AA223384 AA227092 AA227080 T12379 AA0921 74 
T61 139 M1 49776 M699829 AW87918B AW813567 AW81K38 AOT^ 

AA1 57730 AA1 5771 5 M053524 AWB49581 AW854566 C05254 AW882836 T92637 AW812621 AA206583 AA209204 BE1 56909 AA226824 
AI829309 AW991957 N66951 AA527374 H66215 AA045564 AI694265 H60808 AA149726 AW1 95620 BE081333 BE073424 AW817862 
AW817705AW817703AWBi7559Ba)81531 H59570 
AA628980A1126603BE504035 



168 



WO 02/086443 



PCTAJS02/12476 



5 
10 
15 
20 
25 
30 
35 
40 
45 



439000 
439285 

W780 
441128 

443068 
443947 
447636 



467716J 
470B5J ' 

47673.1 
51021.2 

558874.1 
586160.1 
7301J 



438091 44954 1 AW373062 T55S62AJ299190BE174210AW579001 H01811 W40186RS7100AI923888AW952164AA628440AW898507AW898616 

AA709126 AW898828 AWB98544 AA947932 AW898S25 AW898622 AI276125 AMB5720 AW510698 AAS87230 T52522 BE467708 AVV243400 
AW043542AI288245 All 86932 D52654 D55017DS2715 D52477 D53933 D54679 A1298739 A11 46984 AI922204 N98343 BE174213AA845571 
AI813854 AI21451 6 A1635262 AI139455 AJ707807 AJ698085 AW884S23 A1Q24768 A1004723 AW087420 AI565133 N94964 AI263939 
' AW513230AI061126AM35818A1859105A1360506AI024767AA513019AA757598 X56196AA9^ 

AW830091 AW513771 A1951391 AJ337671 T52499AA890205 A1640908 H75966AA463487 AA358688 A1961767 AJ865295AA780994 
AI985913BE174196AA029094AW592159T55581 N79Q72 A1611201 AA9KB12AI220713AW149306AI758412AA045713 R79750 N7609S 
AW979121 AAB47986 AA829098 

AL133916 N79113 AF086101 N76721 AW950828 AA364013 AW955684 AI346341 A1867454 N54784 AI655270 A1421279 AW014882 
AA775552 N62351 N59253 AA626243 A1341407 BE175639 AA456968 AI358918 AA457077 
AL109688R23S65R26578 

AA57Q256AW014761 AA573721 AI473237 AI0221 65 AA554071 AA127551 N90525AW973823 M447991 AA243852 BE328B50AI148171 
AI359827 A1005068 AJ356567 AA232991 AW01 6855 AA906902 AA2331 01 AA1 27550 BE512923 
AI188710A1032142AW078833 N30308 AW675532AI21902BAI341201 N22181 H95390 
W24187W24194R17789 

Y10043 NMJXJ5342 L05085 AL034450 BE614226 AW749053 AA379173 AA248230 BE514634 AA334622 R70656 AA367593 AA214649 
AA369318 AW957081 R05760 AA039903 AI886597 AW630122 AA906264 AA041 527 R01145 A1088688 BE463637 AA398795 A1354883 
AI768938 AI569996 AI452952 A) 168582 A11 89889 AI086670 AW262560 AW61 3854 AA862839 AA435840 AA670197 AI024032 A1990659 
AI990089 N81095AA847919 AW960150 AA211075AA044704 AA367594AW5825B7 AW858854AW818630AW818281 AW818433AW582595 
AA096002 N83992 

448993 79225.1 AI471630 BE540537 BE265481 AW407710 BE513882 BE546739 AA053597 BE140503 BE218514 AW956702 AI656234 AI636283 AI567265 

AW340358 BE207794 AA053085 R69173 AA292343 AA454908 AA293504 AI659741 A1927478 AA399460 AI760441 AA34641 6 BE047245 
AA730380 AA394063 AA454833 AI982791 A1567270 A1813332 AI767858 AA427705 D20284 AI221458 BE048537 AI263048 AA345417 
AA911497BE537702 
8Q4424J AI538293AW813561 
859083 1 AI761324AW880941AW880937 

86576 1 AW1 1 6072 AI631 982 T15734 AA224195 AI701458 W20198 F26326 AA890570 N90552AW071907 A1671352 AI375892 T03517 R88265 

AI124088 AA224388 AI08431 6 A1354686 T33652 A1140719 A172021 1 T03490 AI372637 T1 5415 AW205B36AA630384 T03515T33230 
AA017131 AA443303 T33623 AI222556 T33511 T33785 AI419606 D55612 

W52854 AL117600 BE20B116 BE208432 BE206239 BE082291 AW953423 AA351619 BE180648 BE140560 W60080 AA865478 N90291 
AW450652AW449519AA993634A1806539AA351618AW449522AI827626AA904788 M380381AA8 

AL133619 AA468118 AA383064 AI476447 T09430 AI673758 AA524B95 AI581345 A1300820 AW49B812 AA256162 A1559724 AI685732 
AA602400 AA905453 AI204595 AW166541 AA1 57456 AA156269 AA3B3652 AA431 072 AW592707 A1435410 AW272464 AI21 5594 AA622747 
R74039 N35031 AI804128 AW513621 AA86B351 AI026826 A1493388 AA614641 W81604 AI567080 A1214351 AA730140 AJ125754AI200813 
A1269603 A1565082 A1807095 AI476629 AA505909 AI368449 AI686077 AI582930 AW085038 AA757863 AA7301 54 AI767072 AA468316 
A1734130 AI734138 AA426284 AA433997 A1741241 AW043563 AI732741 AI732734 AA437369 AA425820 AA664048 R74130 
1067807 1 BE144666BE184942AW238414BE184946 
1257335 1 AW993247AWB61464 -■ • : 

168730 1 AA203682 R11958 

47395 1 BE550224 AA832519 N45402 AW885857 N29245 BE465409 W07677AW970089 AI299731 AA482971 BE503548 H18151 W79223 AF086393 

AA461301 W74510R34182AI090689 N46003BE071550 R28075AW134982^ 
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TABLE 9C 



449305 
451105 
451320" 



451B07 . 8865.1 
452410 9163.1 



454241 
455175 
456237 
458098 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Idenfifier (GI) numbers, "Dunham I. et aL" refers to the publication entitled The DNA 

5 0 sequence of human chromosome 2Z" Dunham I. et at, Nature (1 999) 402:489495. 

Strand: Iricficales DNA strand from which exons were predicted 

HI jJOsiQon: Indicates nucleotide positions of predicted exons. 
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Pkey 


Ref 


Strand 


NLposition 


400512 


9796593 


Minus 


1439-1615 


400517 


9796688 


Minus 


49996-50346 


400560 


9843598 


Plus 


94182-94323,97056.97243,101095-101236,102824-103005 


400664 


8118496 


Plus 


13558-13721,13942-14090,14554-14579 


400665 


8118496 


Plus 


16879-17023 


400566 


811B496 


Pius 


17982-18115,20297-20456 


400749 


7331445 


Minus 


9162-9293 


400763 


8131616 


Minus 


35537-35784 


401027 


7230983 


Minus 


70407-70554,71 060-71 160 


401093 


8516137 


Minus 


22335-23166 


401203 


9743387 


Minos 


172961-173056,173868-173928 


401212 


9858408 


Plus 


87833-88028 


401411 


7799787 


Minus 


144144-144329 


401435 


8217934 


Minus 


54508-55233 


401464 


66B2291 


NSnus 


170688-170834 . 


401714 


6715702 


Plus 


96484-96681 


401747 


9789672 


Minus 


118596-118816,119119-1192^119609-119761^ 
131932,132451-132575,133580-134011 


401760 


9929699 


Plus 


83126-83250,85320-35540.9471 9-95287 


401780 


7249190 


Minus 


28397-28617,28920-29045.29135-29296.29411-29567,29705-29787,30224-30573 


401781 


7249190 


Minus 


83215^3435,83531-83656,83740^901,84237-84393,84955^85W7,8629(W6814 


401785 


7249190 


Minus 


165776-16599S.166189-1B6314 ( 165408-1B6569,167112-16726ai67387-1S7469.168634-l68942 


401797 


6730720 


Plus 


6973-7118 


401961 


4581193 


Minus 


124054-124209 


401985 


2580474 


Plus 


61542-61750 


401994 


4153858 


Minus 


42904-43124.4321 1-43336.4460744763,4519945281,46337-46732 


402075 


8117407 


Plus 


121907-122035.122804-122921,124019-124161.124455-124610.125672-126076 


402260 


3399665 


Minus 


113765-113910.115653-115765.116808-116940 


402265 


3287673 


Plus 


21059-21168 


402297 


6598324 


Plus 


35279-35405.35573^5559 


402408 


9796239 


Minus 


110326-110491 
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402420 


9796339 


Pius 




402674 


8077108 


Minus 




402802 


32S7156 


Minus 


COOAO MAW 


402994 


2996543 


Minus ' 


HI ti-Wtw 


403137 


9211494 


Minus 


QTU(LQ3CT9 Q9QttLQ3flRd 9 O VJA9l9407^<U591 QJW1A.QM37 


403306 


8099945 


Plus 


Mliw-icIlOi 


403329 


8516120 


Plus 


OC4JIfLOfi£Qff 


403381 


9438267 


. Minus 


9RWKL9R17A 


403478 


9958258 


Plus, 


IlOWO-l COOW 


403485 


9966528 


Plus 




403S27 


8569879 


Minus 




403715 


7239669 


Phis 




404044 


9558573 


Minus 




404076 


9931752 


Minus 




404101 


8076925 


Minus 




404140 


9843520 


' Bus 


T77C1 M1/T7 

O/fDI-JOl**/ , 


404165 


9926489 


Minus 




404185 


4572584 


Minus 




404210 


5006246 


Plus 


t033tD- i rum 


404253 


9367202 


Minus 


900/ O-OOVOO 


404287 


2326514 


Plus 


531 34-53281 


404298 


9944263 


Minus 


/JOSI-f J/ZO 


404347 


9838195 


Plus 




404440 


7528051 


Plus 


OU4oU-b)DD) 


404721 


9656648 


Minus 




404794 


4826439 


Pius 


4n4A4G_1 M fiOfi 

luioiy-iuioyo 


404854 


7143420 


Plus 


14^oU-iHOa/ 


404877 


1519284 


Plus 


i0y>2lu/ 


404927 


7342002 


Plus 


booyo-oyobJ 


404996 


6007890 


Phis 




405449 


7622497 


Plus 


422JO-440/U 


40556B 


6006906 


Plus 


359i2-380oo 


405572 


3800891 


Plus 




405646 


4914350 


Plus 


741-969 


405676 


4557087 


Plus 


73195-73917 


405770 


2735037 


Plus 


61057-62075 


405932 


7767812 


Minus 


123525-123713 


406137 


9166422 


Mnus 


30487-31058 


406360 


9256107 


Minus 


7513-7673 


406399 


9256288 


Minus 


63448-63554 


405467 


9795551 


Plus 


182212-182958 
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TABLE 10A: Potential Therapeutic Diagnostic and Prognostic targets for Therapy of Lung Cancer and Non-malignant Lung Disease 

Table 2A shows about 307 genes up-regulated in non-ma&gnant lung disease relative to lung tumors and normal body tissues and/or dowiwegutated in King tumors relative to 
normal lung and noiwnafignant lung disease. These genes were selected from about 59680 probesets on the Eos/ARymatrix Hu03 Genechip array. 

Table 10B show the accession numbers for those Pke/s lacking UnigenelD's for tabte 1QA. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are feted in the 
"Accession" column. 

Table 10C show the genomic positioning for those Pkey's lacking Unigene ID'S and accession numbers In tabte 1QA. For each predicted exon, we have fisted the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also listed. 



Pkeyt| Unique Eos probeset identifier number 

ExAcca* Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: Average of fung tumors (including squamous ceil carcinomas, afenwarcinomas, small cell carcinomas, granulomatous and carcinoid tumors) divided by the 
average of normal lung samples 

R2 Average of non-maSgnanl lung disease samples (including bronchitis, emphysema, fibrosis, atelectasis, asthma) divided by the average of normal lung samples 



Pkey 


ExAccn 


UnigenelD 


Unigene Title 


404394 




ENSP00000241075:TRRAP PROTEIN. 


404916 






Target Exon 


405257 






Target Exon 


407228 


M25079 


Hs.155376 


hemoglobin, beta 


407568 


AA740964 


Hs .62699 


ESTs 


408562 


AI436323 


Hs.31141 


Homo sapiens mRNA for KIAA1568 protein, 


409031 


AA376836 


Hs.76728 


ESTs 


410434 


AF051152 


Hs.63663 


toPxe receptor 2 


4t0467 


AF102546 


Hs.63931 


dachshund (Drosophila) homotog 


410808 


T40326 


Hs.167793 


ESTs 


412351 


AL135960 


Hs.73828 


T-ceB acute lyrnphocytic leukemia 1 


412372 


R65S98 


Hs.285243 


hypothetical protein FU22029 


413795 


AL040178 


Hs.142003 


ESTs 


414154 


AW205314 


Hs.323050 


ESTs 


414214 


D49958 ' 


Hs.75819 


glycoprotein MSA 


414998 


NM002543 


Hs.77729 


oxidised low density lipoprotein (lectin 


415122 


060708 


Hs.22245 


ESTs 


415765 


NNL005424 


Hs.78824 


tyrosine kinase with immunoglobulin and 


415775 


K00747 


Hs.29792 


ESTs, Weakly similar to 138022 hypoiheS 


415910 


U20350 


Hs.76913 


chemoWne (C-X3-C) receptor 1 



R1 


R2 


0.79 


3.10 


1.00 


159.00 


1.00 


422.00 


0.47 


233 


1.00 


12100 


1.00 


'230.00 


1.00 


12&00 


39.65 


149.00 


1.00 


109.00 


1.14 


13.14 


0.37 


2.27 


1.00 


173.00 


0.10 


11.90 


062 


209 


0.03 


4.55 


0.64 


2.97 


0.07 


8.97 


067 


1.65 


029 


264 


1.00 


145.00 
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416319 AI815601 Hs.79197 

416402 NMJHC715 

417355 013168 

417421 AL138201 

5 417511 AL049176 

418489 U76421 

418728 BE241812 

418741 H83265 

418883 BE38703B 

.10. 419088 NM.000216 

419150 T29618 

419235 AW470411 

419407 AW410377 

420555 AA278300 

15 420655 AA279098 

420729 AW964897 

421177 AW070211 

422060 R20893 " 

422426 W79117 

20 422652 AW967969 

423099 NM.002837 

424433 H04607 

424585 AA464840 

424711 NWL005795 Hs.152175 

25 424973 X92521 Hs.154057 

425023 AW956889 

425654 AJ006276 

425998 AU076629 

426657 NM_015865 

30 426753 T89832 

427558 049493 

427983 M177Q6 

428467 AK002121 

428927 AA441837 

35 429496 AA453800 

430468 NM_004673 

431385 BE176536 

431728 NMJQ7351 

431848 A1378857 

40 432128 AA127221 

432519 AI221311 

433043 W57554 

433803 AI823593 

434730 AA644669 

45 435472 AW972330 

436532 AA721522 

437119 A1379921 

437140 AA312799 

437211 AA382207 

50 ' 437960 AI6695e6 

43B202 AW169287 

438873 AT302471 

438875 AA827640 

441048 AA913488 

55 441188 AW292830 

441499 AW298235 

444513 AL120214 

444527 NM 005408 

444561 NM.004469 

60 445279 R41900 

445017 N98238 

446984 AB020722 

446998 N99013 

447357 AI375922 

65 448108 A1800470 

448253 H25899 

449275 AW450848 

450400 AI694722 

450696 AI654223 

70 450726 AW204600 

451497 H83294 

451533 NM.0Q4657 

453636 R67837 

458332 AI000341 

75 459530 AA022888 
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Hs.1012 

Hs.82002 

Hs.82120 

Hs.82223 

Hs.65302 

Hs.87860 

Hs.8881 

Hs.1211 

Hs.89591 

Hs.89640 

Hs.288433 

Hs.41502 

Hs.124292 

Hs.187635 

Hs.290825 

Ks.102415 

Hs.325823 

Hs.58559 

Hs.1 18958 

Hs.123641 

Hs.9218 

Hs.131987 



Hs.154210 

Hs.159003 

Hs.165950 

Hs.171731 

Hs.170278 

Hs.2171 

Hs2233 

Hs.184465 

Hs.90250 

Hs.192793 

Hs.241519 

Hs.11090 

Hs.268107 

Hs.126758 

Hs.117037 

Hs.130704 

Hs.125019 

Hs.27688 

Hs.193042 

Hs.283022 

Hs.177043 

Hs.283689 

Hs.5509 

Hs.222194 

Hs^2588 

Hs.124292 

Hs.189059 

Hs.192102 

Hs.255609 

Hs.101689 

Hs.7117 

Hs.1 1383 

Hs.11392 

HS22245 

Hi55185 

Hs.16714 

Hs.16762 

Hs.159367 

Hs.171941 

Hs.201591 

Hsi054S7 

Hs^79744 

Hs.16026 

Hs^50505 

Hs.284122 

Hs.26530 

Hs.169872 

Hs.220491 

Hs.176065 



403421 
407570 
412295 
414517 
417204 
418307 
416935 
421502 
421798 



Z19002 

AW088826 

M24461 

N81037 

U70867 

T23499 

AF111855 

N74880 



Hs.37098 

Ks.117176 

Hs.76305 

Hs.1074 

Hs.83974 

Hs.89485 

Hs.105039 

Hs.29377 



C083 antigen {activated B lymphocytes, i 

comptemsnt component 4-binding protein, 

endothelin receptor type B 

nuclear receptor subtamHy 4. group A, m 

chordin-like 

adenosine deaminase, RNA-specfic, B1 (b 

protein tyrosine phosphatase, norwecept 

ESTs, Weakly similar to S41044 chrornosom 

acid phosphatase 5, tartrate resistant 

Kallmann syndrome 1 sequence 

TEK tyrosine kinase, endothelial (venous 

neurotrimln 

rrypotheScaJ protein FU21276 

Homo sapiens cDNA: FU23123 fis, cfone L 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA DKFZp586N0121 (f . 
ESTs, Moderately similar to ALU5_HUMAN A 
ESTs 

syntaxin 1 1 

protein tyrosine phosphatase, receptor t 

ESTs 

ESTs 

calcitonin receptor-tike 
matrix rnetaBoproteinase 1 9 
endothe&al ditterenrjatron, sphmgofipi 
transient receptor potential channel 6 
fibroblast growth factor receptor 4 
solute carrier family 14 (urea transport 
ESTs 

growth differentiation factor 10 
colony stimulating factor 3 (granulocyte 
hypothetical protein FIJ1 1259 
ESTs 
ESTs 

angtopoietin-fike 1 

membrane-spanning 4-domains, subfamily A 
muffimwin 

ESTs. Highly similar to AF175283 1 zinc 
ESTs 

ESTs, Weakly similar to BCHUIA S-100 pro 
lymphoid nuclear protein (LAM) mRNA 
ESTs 
ESTs 

' triggering receptee expressed on myeloid 
gb:nv54h12r1 NCI_CGAP_Ew1 Homo sapiens 
ESTs 

activator of CREM in testis 
ecotropic viral integration site 2B 
ESTs . 
ESTs 

Homo sapiens cONA: FD23123 fis, clone L 

ESTs 

ESTs 

ESTs . 

ESTs 

ghJtamate receptor, lonotroplc AMPA 1 
small inducible cytokine subfamily A (Cy 
c-fos induced growth factor (vascular en 
ESTs 
ESTs 

Rho guanine exchange factor (GEF) 15 

Homo sapiens mRNA; cDNA DKFZp564B2062 (f 

ESTs 

ESTs 

ESTs 

periaxin , 
ESTs 

hypothetical protein FU23I91 

reOnoic acid receptor, alpha 

WntinhiWtOTyfactor-1 

serum deprivation response (phosphatidyl 

ESTs 

ESTs 

ESTs 

Eos Control 

NM_016369°*tomo sapiens claudin 18 (CLON 
zlncfingar protein 145 (KruppeWxe, e 
poty(A)-binding protein, nuclear 1 
surfactant, pulmonary-associated prolan 
surfactant pirfnwnary-assoriated protein 
solute carrier family 21 (prostapjamfin 
carbonic anhydrase IV 
solute carrier family 34 (sodium phospha 
K-acytsphingosine amsdohydrolase (add c 



15.32 


237.00 


0.64 


4.00 


OL01 


^90 


36.30 


357.00 


1.00 


179.00 


0.02 


6.00" 


1.00 


- 11&0Q 


0.44 


1.90 


0.96 


2.04 


0.62 


274 


0.03 


6.90 


1.48 


5.13 


37.55 


336.00 


aeo 


3.65 


1.65 


8.07 


2.99 


25.82 


a46 


1.95 


1.00 


156.00 


ao3 


7.44' 


ai4 


3.62 


aoi 


3.16 


0.75 


141.75 


1.00 


167.00 


0.43 


3.01 


a37 


19.45 


0.14 


a35 


1.00 


94.00 


058 


1.42 


0.03 


3.74 


1.00 


141.00 


1*00 


117.00 


0.75 


2.20 


0.76 


225 


0.01 


3.62 


1.00 


138.00 


too 


132.00 


1.00 


157.00 


1.00 


157.00 


0*34 - 


224 


0.00 


1.15 


0.01 


2.06 


1.00 


267.00 


1.00 


105.00 


105 


3.15 


0.83 


1.94 


1.00 


218.00 


1.00 


133.00 


0.67 


122.67 


1.00 


14200 


1.00 


147.00 


1.00 


141.00 


0.71 


3.66 


23.32 


370.00 


0.77 


8.50 


3.43 


1636 


1.00 


167.00 


1.00 


151.00 


46.47 


153.00 


0.01 


3.08 


0.60 


141.00 


0.18 


2.39 


0.10 


2.16 


0.01 


2.53 


0.46 


264 


18.05 


296.00 


1.00 


141.00 


0.56 


1.38 


0.88 


4.33 


0.52 


208 


0.79 


2.01 


0.35 


203 


0.13 


225 


1.00 


116.00 


1.00 


19200 


0.20 


298 


0.40 


240 


0.53 


1.77 


0.01 


3.18 


0.56 


1.74 


0.64 


1.50 


033 


1.16 


0.53 


1.55 


O20 


1.28 


0.78 


1.90 


0.59 


1.54 
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425438 T62215 Hs.270840 

426828 NMJM0020 Hs.172570 

427019 AA001732 Hs. 173233 
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Pkey; Unique Eos probeset identifier number 
CAT number. Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT Number Accession 

408074 103684 1 R9X)723AA2630<)3AA333976AA334725AA334151 AW96 
C06094AW104534 

411667 1253334 1 BE160198 AW935898T11520AW935930AW856073 AW861034 

413533 1375344~1 BE146973 BE146972 BE147042 BE147018 BE146783 BE147020 BE146781 BE147019 BE146766 BE147021 BE146952 BE146767 BE147044 
BE146797 BE146776 BE146985 BE146793 BE146768 BE146771 BE146954 BE146760 BE147048 BE147025 BE147030 

423387 22779 1 AJ012074U11M7L13288 X75299 L20^AWM0780H14880 T28037AI872991 R72136 AW449839 T81622 T79697T29519 R94105 T83923 
" R73300 A1797007 R73390 AA961 010 H741 68 A1689932 BE045543 AI808418 AI608912 AI806573 AWB84084 AW872978 AW872985 AA565655 
AI022915 R50647 R73210 H45098 R46451 AW1 66269 T71 132 AI264547 R52146 AI304920 R73391 AW8B4059 AW884085 H73241 T6003B 
T79612R73145f^549AI094557AI668793 R72302Al564366M1956 M418962m R72840 H45409 R72085 R46356 R46758 
AA50B805AA418798TB3751 R94072T16182AA928785AA903896 

423696 23112 1 Z92546 AA330586 AI570568 AW341487A1827050 AW298658AI792189 AI015693AI733599 AI572251 AI 672488 AW1 93262 A1244716 
A1864375 A1206100 AA912444 AI269365 A1640254 AW772466 AI867336 AA627604 H1 6914 AA358477 AA338009 

430212 314437 1 AA469153AI718503AA469225 

436532 421B02L1 AA721522AW975443 T93070 

453531 97026J AA417940 AA036735T07025 

454741 1232559J BE1 54396 AW817959 BE1 54393 



TABLE 10C 



Pkey Unique number corresponding to an Eos probeset 

Ret ' Sequence source. The 7 digit numbers in this column am Genbank Identifier (Gl) numbers. Dunham t. et al/ refers to the publication entitled The DNA 

sequence of human chromosome 22." Dunham I. et aL, Nature (1 999) 402:489495. 
Strand: Indicates DNA strand from which exons were predicted 
NLpositon: Indicates nucleotide posifions of predicted exons. 



Pkey 

400754 

401045 

401083 

402474 

402808 

403021 

403421 

403438 

403687 

403764 

404277 

404288 

404394 

404518 

404916 

405106 

405257 

405381 



Ref 

7331445 
8117619 
3242744 
7547175 
$456148 
7547270 
9665041 
9719679 
7387384 
7717105 
1834458 
2769644 
3135305 
8151988 
7341826 
8079395 
7329310 



Strand reposition 

Plus 144559-1446B4 

Plus 90044-90184,91111-91345 

Plus 33192-33360 

Minus . 53526-53628,55755-55920,57530-57757 

Minus 114964-115136,115461-115585,115931-116047,117665-117771.118004-118102 

Plus 120799-120956 

Minus 126609-126773,139986-140205 

Plus 90792-90938 

Plus 9009-9534 

Minus 116692-118853 

Minus . 91665-91946 

Plus . 3512-3691 

Minus 37121-37205,37491-37762,4105^41140,4132241593,4177341919 

Plus 84494-84603 

Plus 91057-91188 

Minus. 80877-81418 

Plus 73121-73273 

Minus 7536-8054 
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i 



WO 02/086443 * PCT/US02/12476 ...... . 

406387 9256180 Plus 116229-116371,117512-117651 
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TABLE 1 1 A: Genes Dis&nguisWng Adartocardnonrta from Other Lung Diseases and Konrtai Lung 

Table \ 1A shows about 84 genes unregulated in king adenocarcinomas relative to other lung tumors, nonpregnant lung disease, and normal lung- These genes were selected 
from about 5S6B0 probesets on the Ec^Affymetrix Hu03 Genechip array. 

Table 118 show the accession numbers for those Pke/s lading UnrgenelErs for table 1 1 A For each probeset we hare listed the gene duster number from vtochjhs 
aBgonucteofides were designed. Gene dusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were dustered based on sequence 
similarity using Clustering snd Alignment Tods (DoutteTwisL Oakland CaWomia). The Genbank accession numbers for sequences comprising each duster are fisted m the 
"Accession" column. 

Table 11C show the genomic posifioning for those Pke/s lacking Unigene tD*s and accession numbers in table 1 1 A. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locafions of each predlded exon are also Dsted. 



Pkey: 
ExAccm 
Unigendft 
Unigene Title: 
R1: 

R2: 



Unique Eos probeset identifier number 

Exemplar Accession number, Genbank accession number 

Unigene number 

Average oHung^ors (including squamous cell carcinomas, adenocarcinomas, small cell cardnomas, granulomatous and carcinoid tumors) divided by the 
average of normal lung samples 



Pkey 


ExAccn 


UnlgenelD 


403329 






406399 






406590 


M29540 


Hs.220529 


407889 


A1827976 


H&2439V 


407881 


AW0720O3 


Ks.40968 


408908 


BE296227 


Hs.250822 


409103 


AF251237 


Hs.1 12208 


409187 


AF154830 


Hs.50956 


409269 


AA576953 


Hs.22972 


410076 


T05387 


Hs.7991 


410102 


AW248508 


Hs.279727 


410399 


BE068869 




411908 


L27943 


Hs.72924 


412612 


NM_000047 


Hs.74131 


414075 


U11862 


Hs.75741 


416208 


AW291168 


Hs,41295 


417542 


J04129 


Hs.82269 


419163 


U60669 


Ks.89663 


419502 


AU076704 




419631 


AW188117 


Hs-303154 


420931 


AF044197 


Hs.100431 


421155 


H87879 


Hs.102267 


421190 


U95031 


Hs.102482 


421474 


U76362 


Hs.104637 


421515 


Y11339 


Hs.105352 


421582 


A1910275 




422026 


U80738 


Hs.1 10826 


422095 


AIB68872 


Hs.282804 


422311 


AF073515 


Hs.114948 


422867 


L32137 


Hs.1584 


423472 


AF041260 


Hs.129057 


423554 


M90516 


Hs.1674 


424502 


AF242388 


Hs.1 49585 


424544 


M88700 


Hs.150403 


424905 


NMJJ02497 


Hs.153704 


424960 


BE245380 


Hs.153952 


425523 


AB007948 


Hs.158244 


426230 


AA367019 


Hs.241395 


427701 


AA411101 


Hs.243886 


4285B5 


AB007863 


Hs.185140 


428758 


AA433988 


Hs.98502 


429170 


NJO01394 


Hs.2359 


429263 


AA019004 


Hs.198396 


429610 


AB024937 


Hs.211092 


430508 


A1015435 


Hs.104637 


43D985 


M490232 


Hs.27323 


431548 


A1834273 


Hs.9711 


431566 


AF176012 


Ks.26072O 


431986 


AA536130 


Hs.149018 


432375 


BE536069 


Hs.2962 


432677 


NMJ»44B2 


Hs.27B611 


433556 


W56321 


Hs.111460 


433819 


AW511097 


Hs.1 12765 


434001 


AW950905 


Hs.3697 


434424 


A1811202 


Hs.325335 


434792 


AA649253 


Hs.132458 


436217 


T53925 


Hs.107 


436749 


AA584890 


Hs.5302 


436972 


AA234679 


Hs.25640 


437866 


AA156781 




437935 


AW939591 


Hs.5940 


438915 


AA280174 


Hs.285681 


439451 


AF086270 


Hs.278554 



Unigene Tide ■ 
Target Exon 

NM_0031 22*'.Homo sapiens serine protease 
carci r>o embryonic antigen-related cell ad 
hypotheticaJ protein FU1361 2 
heparan sulfate (glucosamine) 3-O-suIfot 
serine/threonine kinase 15 
XAGE-1 protein 

carbamoyl-phosphate synthetase 1, mitoch 

hypo Weal protein RJA3352 

ESTs 

Homo sapiens cONA FU14035 fc. clone HE 
synudeln, gamma (breast cancer-specific 
cytidine deaminase 

arytsulfatase E (chondrodysptasia puncta 
amiJoride binding protein 1 (amine oxida 
ESTs, Weakly similar to MUC2.HUMAN MUCIN 
progestagen- associated endometrial prate 
cytochrome P450, subfamily XXIV (vitamin 
fibrinogen, A alpha polypeptide 
popeye protein 3 

small inducible cytokine B subfamily (Cy 
lysyl oxidase 

mucin 5, subtype B, tracheobronchial 
solute carrier farnfly 1 (glutamate trans 
GalNAc alpha-2, 6-sialyltransferase 1, 1 
trefoil factor 1 (breast cancer, estroge 
trinudeoiide repeat containing 9 
hypothetical protein FU22704 
cytokine receptor-like factor 1 
cartilage digomeric matrix protein (pse 
breast carcinoma amplified sequence 1 
gHrtemme-frodose-e^rosr^ate transamin 
lengsrn 

dopa decarboxylase (aromatic L-amino ad 
NIMA (never in mitosis gene a)-related k 
5 1 nudeotidase(CD73) 
K1AA0479 protein 
protease, serine, 1 (trypsin 1) 
nudear autoanCgenlc sperm protein (his 
KIAA0403 protein 
hypothetical protein FU 14303 
dual specificity phosphatase 4 
ATP-binding cassette, sub^amiiy A (ABC1 
UUrMX protein; PLUNC (palate lung and nas 
ESTs 

ESTs, Weakly similar to 178885 serina/th 
novel protein 

J domain containing protein 1 

Novel human gsne mapping to chomosome 20 

S100 calcium-binding protein P 

UDP-N-acetyl-alpha-agalado^ne:polyp 

cafaurnrcafrnodulin-dependent protein kin 

ESTs 

serine (or cysteine) proteinase tnhibito 
Homo sapiens cDNA: FU23523 fis, done I 
ESTs 



lectin, galactoside-binding, soluble, 4 
ctaudin 3 

metaUothiondn 1E (functional} 
mucin 13, epitheBal transmembrane 
WiEams-Beirren syndrome chromosome n 
hetercchromatirvtike protein 1 



R1 


R2 


1.00 


61.00 


1.00 


39.00 


226.37 


350.00 


0.77 


1.18 


1.00 


10.00 


7.76 


1.00 


80.44 


40.00 


1.00 


1.00 


1.00 


1.00 


1.12 


1.50 


9.89 


1.00 


0.92 


1.06 


1.00 


1.00 


1.02 


1.03 


0,84 


1.07 


a67 


1.00 


1.28 


1.35 


1.00 


1.00 


13.05 


115.00 


1.00 


13.00 


1.00 


aoo 


1,00 


15.00 


1.17 


1.55 


1.46 


1.76 


1.00 


aoo 


1.23 


1.00 


1.00 


52.00 


4.37 


Z34 


1.15 


1.78 


1.69 


3.17 


48.13 


72.00 


1.00 


50.00 


4 Art 

1.00 


1 nn 
l.UU 


1.00 


59.00 


21.35 


1.00 


1.00 


1.00 


1.00 


35.00 


1.00 


83.00 


7.41 


'34.00 


1.00 


aoo 


1.05 


1.13 


16.18 


105.00 


1.07 


1.00 


1.59 


1.69 


4.75 


7.27 


0.94 


1.28 


5.66 


15.00 


49.76 


37.00 


1.19 


1.47 


1.65 


1.08 


1.00 


48.00 


1.00 


19.00 


171 


aoo 


29.31 


72.00 


1.00 


64.00 


8.52 


44.00 


57.97 


31.00 


1.10 


1.41 


1.59 


1.46 


3.62 


101.00 


1.60 


1.39 


1.00 


1.00 


23.28 


52.00 
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439759 


AL359055 


Ks.67709 


441031 


AJ110684 


Hs.7645 


441377 


BE218239 


Hs.202658 


443614 


AV655386 


Hs.7645 


443813 


AA876372 


Hs^3961 


443991 


NMJ102250 


Hs.10082 


444570 


H58373 


Hs332938 


444931 


AV652C66 


Hs.75113 


446102 


AW168067 


Hs.317694 


446163 


AA026880 


Hs.25252 


446469 


BE094848 


Hs.15113 


447388 


AW630534 


Hs.76277 


447532 


AK000614 


Hs.18791 


448243 


AW369771 


Hs.52620 


448844 


AI581519 


Hs.177164 


449444 


AW818436 


Hs^3590 


451807 


W52B54 




452689 


F33868 


Hs.284176 


463392 


U23752 


Ks.32964 


453464 


M884911 


Hs.32989 


453735 


AI066529 


Hs.125073 



Homo sapiens mRNA fuD fenglh insert cDN 1.00 21.00 

fiirmogen, B beta poJypepfide 1.41 99.00 

ESTs 22.03 1.00 

fibrinogen, B beta porypepflde 1.00 16.00 

Homo sapiens mRNA; cONA DKFZp657D095 (fr 1.20 1.99 

potassium intarmediate/smafl conductance 5.71 6.87 

hypotneScal protein MGC5370 1.98 38.00 

general transcription factor lilA 1.00 54.00 

ESTs LOO 1.00 

Homo sapiens cONA FU13603 fis, clone PL ,1.00 35.00 

homogenfcate 1,2-dbxygenase {homogenti 1 .00 11 .00 

Homo sapiens, done MGC9381. mRNA, comp 1.24 1.16 

hypothetical protein FU20607 1.23 1.63 

fotegrin.beta8 15.84 1.00 

ESTs 1.00 31.00 

solute carrier family 16 (monocarboxyfic 1.00 83.00 

hypothetical protein FU23293 similar to 1.55 35.00 

transferrin 1.54 . 1.44 
SRY(sexdetemuwng region Y)-box 11 • 1.00 . 16.00 

receptor (caldtonin) acMy rnodilying 1.55 2.45 

ESTs 1.01 1.30 



TABLE 11B 

Pkey: Unique Eos probeset WenSfier number 
CAT number. Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT Number Accession 

410399 11995 V BE068889 BE068882 AF044311 AF017256 NM_003087 AF037207 AF010126 AA633976 AA872836 BE298825 BE299889 AI016464 A1684600 

A1936527 AA804675 AA394097 A1139933 AA946606 BE17131 3 AA722407 AA293803 AK6B480AA055035 AA055968 AW796957 AI637713 
AM10737 H49348 M486472 M411094AA235594AM02624AA4436^AW452137AA421708Ara 

419502 1 B535_1 AU076704 T74854 T74860 T72098 T73265 T73873 T591 80 T74658 T58786 T60385 T73410 T6B781 T67845 T67593 T73952 T67864 T60530 

T68367 T68401 T53959 T72360 T72099 T60377 T58951 T7171 2 T72821 T64738 T74645 T72Q37 T68688 T72063 T73258 T72826 T64242 
T6B220 T74673 T71800 T68355 T61227 T62738 T69317 T53850 T64692 T73768 T73962 T73382 T68914 T70975 T73400 T60531 T73277 
T73203 T70498 T61409 T58925 NM 000508 M64982 T68301 T73729 T69445 T60424 T67922 T67736 T68716 T67755 T74765 T73819 T58719 
T74756 T60477 T74863 T61109 T68329 T5B850 T71857 T73425 T53736 T68607 T58898 T64309 T72031 T72079 T64305 T71908 T68107 
T71916 T73787 T55035 T64425 T71870 T60476 T61376 T67820 T71895 T41006 T69441 T6B170 T74617 T71958 T69440 T61 875 R06796 
H48353 T71914 T53939 T64121 AA693996 T72525 T67779 T68078 AA01 1465 AA345378 AV654847 AV654272 AV65SQ01 A1064740 T82897 
N33594 AA344542 AW805054 AI207457 T61743 AA026737 H94389 AA382695 AA918409 T68044 S82092 T39959 AI01 7721 AA31 2395 
AA31291 9 T40156 H66239 AV652989 H38728 R98521 AV655200 R95790 W03250 W0091 3 AA344136 AV660126 R97923 AA343598 
AW470774 AV551256 N54417AA812862AW1S2929AM1192H61463 H72060AA344503 H38639AI277511 AV661108AI207625 T47810 
AA235252 T27853 T4777B R95746 H70620 AA7D1 463 AW827166 R98475 C20925 AV657287 T71959 T71313 T73920 T73333 T61 618 T69293 
T692B3 T73931 T721 78 T72456 AV645639 AV653476 T72957 T72300 T58906 T71457 T70494 T72956 T70495 T68267 T74407 T85778 
AA344726 T27854 T74485 T74101 T73868 T71518 T72304 AA343853 T73909 T68070 T72065 H72149 T73493 T73495 AV645993 R02293 
T70475 T64751 AA344441 AA343657 AA345732 M344328 All 10639 AA344603 AF05351 3 T64696 T6651 6 T72223 T60507 T67633 R29500 
T72517 R02292 T60599 T69208 T70452 T74677 R29366 T61277 T74914T60352 R29675 T74843 AV645792 AA344408 T69197 T72057 
T69368 T69356 T68258 AV650429 T73341 T61702 T74598 T40095 K02272 T40106 AA343045 AA341 908 AA341 907 AA342807 AA3419S4 
T53747T72042T62764AIOS4899AA343060T67832T72440 T71770T68091 T69108T72449T69167 T71289T68251 AV654844 T64375 
AA345234 T67598 AA011414T68036 H48262 AI207557 T68219 W86031 T69081 T64232 R93196 T62136 AV650539 H67459 T72978 
AA344583 T60362 H58121 T9571 1 T72803 T68055 T71715 R29036 T72793 T69122 T64595 T62888 T691 39 T68291 T64652 T67971 T46862 
AA693592 AI248502 R29454 T64764 T57001 T73052 771429 T51176 T58866 AV655414 H90426 AA342489 T73666 T67848 T72512 T53835 
T67837 T7331 7 T74273 T69420 T68245 T74380 T67862 T74474 T56068 

421582 2041 1 AJ910275 X00474 X52003 X05030 NM 003225 AA314326 AA308400 AA505787 AA31 4825 AI571948 AA507595 AA614579 AA58761 3 R8381B 

AA568312 AA614409 AA307578 AI925552 AW950155 AI910083 M12075 BE074052 AW004668 AA578674 AA582084 BEO74053 BE074126 
BE074140 AA514776 AA588034 BE074051 BE074068 AW009769 AW050690 AA858276 R55389 AI001051 AW050700 AW750216 AA614539 
BE074045 AI307407 AW602303 BE073575 AI202532 AA524242 AI970839 AI909751 BE076078 A1909749 R55292 

437866 44433 2 AA1 56781 AW293839 U52054 AA024963 AA778446 BE073977 AW444904 AW602574 BE1 64040 BE1 6401 2 BE163972 BE163974 BE1 63992 

AA837481 AW468444 BE185091 AW468002 AA687333 AA81 1830 AA581806 AI666686 AI572124 AA043777 AA040926 020160 Ai536733 
AA812489 AW874142 AI471883 W84421 AA156350 

451807 8865 1 W52854 ALII 7600 BE208116 BE208432 BE206239 BE082291 AW953423 AA351619 BE180648 BE140560 W60080 AA865478 N90291 

AW450652 AW44951 9 AA993534 AI806539 AA351618 AW449522 AI827526 AA904788 AA380381 AA886045 AA774409 BE003229 241756 



TABl£11C 

Pkey: 
Ret 

Strand: 



Unique number corresponding to an Eos probeset 

Sequence source. The 7 digit numbers In this column are Genbank Identifier (Gl) numbers. 'Dunham I. et aT refers to the publication entitled The ONA 
sequence of human chromosome 22* Dunham I. et aL, Nature (1 999) 402:489495 
Indicates DNA strand from which exons were predicted. 



NLposiSon: Indicates nucleotide positions of predicted exons. 



Pkey 

403329 

406399 



Ref Strand 
8516120 Plus 



NLposffion 



63448-63554 



177 



WO 02/086443, 
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TABLE 12A: Genes Distinguishing Squamous CeD Carcinoma fromOfter Lung Diseases and Normal Lung • 

Table 12A shows about 72 genes upreguiated in squamous cefl carcinomas of the lung relative to other lung tumors, norwnafignant lung disease, and norma) lung. These genes 
were selected from about 59580 probesets on tiie EcsJAflymetrix Hu03 Genechip array. 

Table 1 2B show the accession numbers for those Pke/s lacking Urfcenetrjs tar table 1 2A. For each probeset we have listed the gene cfcister number tan i which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were dustered based on sequence 
simBarity using Clustering and Alignment Toots (DoufateTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed m me 
'Accession* column. 



Table 12C show the genomic positioning for those Pke/s lacking Unigene ID'S and accession numbers in table 12A For each predicted 
sequence source used for prediction. Nucleotide locations of each predicted exon are also fisted 



exon, we have listed the genomic 



Pkey. Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number. Genbank accession number 

UnigenelOc Unigene number 

Unigene Title: Unigene gene title „ . ' .. . K „ fKo * 

R1: Average of lung tumors (including squamous cell carcinomas, adenocarcinomas, small cell carcinomas, granulomatous and carcinoid tumors) divided by tne 

R2: 



average of normal lung samples . , , 

Average of norwnaDgnant lung disease samples finduding bronchitis, emphysema, fibrosis, atelectasis, asthma) divided by the average of normal lung samples 



Pkey 
400289 
400656 
401760 
401781 
401785 
401994 
402075 
404996 
407839 
408000 
408522 
410561 
-415091 
415817 
416656 
417034 
417366 
418663 
418678 
419121 
420783 
421773 
421948 
421978 
422158 
422440 
423534 
423725 
423736 
424012 
424046 
424098 
424834 
425650 
427099 
427335 
428182 
428645 
428748 
429259 
42953B 
429903 
430486 
430890 
431009 
431846 
433091 
434360 
434880 
435505 
435793 
436511 
438403 
439285 
439606 
439670 
439706 
440325 
441525 
443162 
444378 



ExAccn 
X07820 



UnigenelD 
Hs^258 



AA045144 

L11690 

A1541214 

BE540255 

AL044872 

U88967 

U03272 

NM.006183 

BE185289 

AK001100 

NM.001327 

AA374372 

A1659838 

W69233 

L42583 

AJ243662 

L10343 

NMJKW812 

AW959908 

AJ403108 

AB002134 

AW36B377 

AF027866 

AF077374 

AK001432 

NfVL001944 

AB032953 

AA448542 

BE386042 

AA431400 

AW593205 

M420450 

BE182592 

AL134197 

BE062109 

X54232 

BE149762 

BE019924 

Y12642 

AW015415 

U023B8 

Af 200492 

AB037734 

AA721252 

AA806607 

AL133916 

W79123 

AF08B076 

AW872527 

NMJJ03812 

AW241867 

T49951 

R41339 



Hs.161566 

Hs.620 

Hs.46320 

Hs.6994 

Hs.77910 

Hs.78867 

Hs.79432 

Hs.80962 

Hs.1076 

Hs.41690 

Hs.87225 

Ks.89626 

Hs.99923 

Hs.1 12457 

Hs.334309 

Hs.110196 

Hs.1 12341 

Hs.116724 

Hs.1 690 

Hs.1 32127 

Hs.1 32195 

Hs.1 37569 

Hs.1 38202 

Hs.1 39322 

Hs.1 53408 

Hs.1925 

Hs.173560 

Hs^51677 

Hs.293317 

Hs.98729 

Hs.98785 

Hs.292911 

Hs.11261 

Hs.93597 

Hs.241551 

Hs.2699 

Hs.48956 

Hs.271580 

Hs.3185 

Hs.127780 

Hs.101 

Hs^11238 

Hs.4993 

Hs^91502 

Hs.292206 

Hs.58561 

Hs.59507 

Hs.59761 

Hs.7164 

Hs.127728 

Hs.9029 

Hs.12569 



matrix metalloproteinase 10 {stromerysin 
NMJ»2425:Homo sapiens matrix metaHopro 
NMJM>5557\Homo sapiens keratin 16 (foca 
Target Exon 

NM_002275*:Homo sapiens keratin 15 (KRT1 
Target Exon 

ENSPO000025 1 Q55*:Ptasma membrane calcium 

Target Exon 

ESTs 

bullous pemphigoid antigen 1 (230/240kD) 
Small prollne-rich protein SPRK [human, 
Homo sapiens cDNA* FU 22044 fis, done H 
34n/droxv-3^thylgtutaryl-Coenzyma A sy 
protein tyrosine phosphatase, receptor-t 
fibrillin 2 (congenita) contracture) ara 
neurotensin 

small praline-rich protein IB (comifin) 
desmocollin3 
cancer/testis antigen 
parathyroid hormone-like hormone 
lectin, galactoside-binding, solubte, 7 
ESTs 
keratin 6A 
NICE-1 protein 

protease Inhibitor 3, skMerived (SKAL 
aJdo-keto reductase family 1 , member B1 0 
heparin^mding growth factor binding pr 
hypothetical protein LOC57822 
airway trypsb-fike protease 
tumor protein 63 kDa with strong homolog 
serine (or cysteine) proteinase inhibito 
small proGne-rich protein 3 
Homo sapiens cDNA FU10570 fis, clone NT 
desmogtein 3 (pemphigus vulgaris antigen 
odd Ozrten-m homolog 2 (Drosophfla, mous 
G antigen 7B 

ESTs. Weakly similar to GGC1_HUMAN G ANT 
ESTs, Weakly similar b 2017205A dihydro 
Ksp37 protein 

ESTs, Highly similar to S6071 2 band-6-pr 
small praline-rich protein 2A 
cydin-dependent kinase 5, regulatory su 
chloride channel, calcium activated, tarn 
gtypicanl 

gap junction protein, beta 6 (connexin 3 
uroplaWn IB 

lymphocyte antigen 6 complex, locus D 
ESTs 

(^tocnrome P450, subfamily IVF, porypept 

interleukin-1 homcteg 1 

WAA1313 protein 

ESTs 

ESTs 

hypothetical protein FU20093 

G protein-coupled receptor 87 

ESTs, Weakty simaar to AC004858 3 U1 sm 

ESTs, Weakly simaar to DAP INHUMAN DEATH 

a distntegrin and metalloproteinase doma 

ESTs 

DKFZP434G032 protein 
ESTs 
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446292 AHJ81497 Hs.2795B2 1% type C gf>toproteb 

447078 AW885727 . Hs.9914 • ESTs 

447342 AI199258 Hs.19322 Homo salens, Similar to RIKEN cONA 2010 

449003. X76342 . Hs.3B9 . alcohol dahyrfrogenase 7 (class IV}, mu o 

449101 AA2B847 rk23016 G protein-coupled receptor 

450832 AW970602 Hs.1 05421 ESTs 

452240 A1591147 - Hs.61232 EST$ 

453317 NM.002277 Hs.41696 keratin, hair. addic.1 

453830 AA534296* Hs20953 ESTs . 

45409B W27953 Hs.292911 ESTs,r^WysirrilartoS60712band4^ 

455601 AI3S8680 Hs,816 SRY (sex determining region YJ-bcx 2 

TABLE 12B 

Pkey: Unique Eos probeset idenffiar number 
CAT number Gene cluster number 
Accession: . Genbank accession numbers 

439285 OTisT^ ALlSt6N79113AF0B6101 N76721 AW950828 AA364013AW955684AI346341 AI867454 N54784 AI655270AJ421279 AW014882 
" M775552 N82351N59253M626243A)341407BE175639AM5^ 

TABLE 12C 

sequence of human duomosome 22.' Dunham I. et at., Nature (1999} 402:489-495. 
Strand: Indicates DNA strand from which excns were predicted. 
NLposifon: Indicates nucleotide positions of predicted axons. 

Pkey Ref Strand NLpositkm 

400666 8118496 Plus 17982-18115,20297-20456 _ , 

401780 7249190 Minus 2B397-28617 l 2892f>29045 t 2913S-2929S.29411-29567 ) 29705-29787.30224-30573 



40 785 7249190 Minus 16577&-165996J66189-1663H166408-166569.167112.167268.167387-157469J68634-168 

401994 4153858 Minus 4290443124,43211-43335,44607-44763,45199^5281,46337-46732 

402075 ' 8117407 Plus 121907-122035,122804-122921,124019-124161,124455-124610,125672-126076 

404996 6007890 Plus 37999^8145,38652-38998.39727-39872.40557-40674,4235142450 
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TABLE 13A: Genes Distinguishing Iton-Mafignanl Lung Disease Iron Lung Tumors and Normal lung , 

Tab3e'l3A shows about 23 genes unregulated In non-maOgnant lung disease relative to lung tumors and normal rung. These genes were selected from about 59880 probeseb on 
foe Eos/Afiymetrix Hu03 Gsnechip array. 

Table 13B show the accession numbers for those Pkey*s lacking UnjgeneJD's for table ISA. For each probeset we have listed the gene cluster nurrMta which the 
oDgonudeofides were designed. Gene clusters were compiled using sequences derived ton Genbank ESTs and mRNAs. These sequences were ctetoed based on sequence 
scanty using Clustering and Alignment Tools (DoubteTwfet Oakland California). The Genbank accession numbers for sequences comprising each duster are listed In the 
'Accession* rohimn. 

Table 13C she* the genomic positioning for those Pke/s lacking Unigene ID'S and accession numbers in table 13A. For each predicted exon, we have Bsted the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also fisted. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Trtte: Ur^ne J™^^ {mMng ^ uamaus ceB £asdm3Sr adenocarcinomas, small ceH cardnomas, granulomatous and carcinoid tumors) divided by the 

R2: Avera^ of nc^aBgS samples ftnduding branchais, emphysema, fibrosis, atelectasis, asthma) divided by the average of normal hmg samples 



Pkey 


ExAccn 


UnigenelD 


Unigene Title 


408562 


A1436323 


Hs.31141 


Homo sapiens mRNA for K1AA1568 protein, 


409031 


AA376836 


Hs.76728 


ESTs 


412372 


R65998 ■ 


Hs.285243 


hypothetical protein FU22029 


415910 


U20350 


Hs.78913 


chemokine (C-X^C) receptor 1 


417511 


AL049176 


Hs.B2223 


chordin-like 


418819 


AA228776 


Hs.191721 


ESTs 


422060 


R20893 


Hs^25823 


ESTs, Moderately similar to ALU5.HUMAN A 


424585 


AA464840 


Hs.131987 


ESTs 


426753 


T89832 


Hs.170278 


ESTs 


429496 


AA453800 


Hs,192793 


ESTs 


430719 


AA488988 


Hs.293796 


ESTs 


431089 


BE041395 




ESTs, Weakly similar to unknown protein 


431385 


BE178538 


Hs.11090 


membrane-spanning 4-domalns, subfamily A 


431728 


NM07351 


Hs.268107 


muitlmerin 


435532 


AA721522 




gb:m/54h12j1 NCLOGAP__Ew1 Homo sapiens 


437960 


AI66958S 


Hs.222194 


ESTs 


438202 


AW1692B7 


Hs.22588 


ESTs 


441499 


AW298235 


Hs.101689 


ESTs 


444513 


AL120214 


Hs.7117 


gtutamate receptor, fonotropic. AMPA 1 


448253 


H25899 


Hs.201591 


ESTs 


453636 


R57837 


Hs.169872 


ESTs 


458332 


A1000341 


Hs.220491 


ESTs 


459587 


AA031956 




gbzk15eQ4.s1 Soares_pregnanLuterus_NbH 



R1 


R2 


1.00 


230.00 


1.00 


128.00 


1.00 


173.00 


1.00 


145.00 


1.00 


179.00 


1.00 


140.00 


1.00 


156.00 


1.00 


167.00 


1.00 


141.00 


1.00 


138.00 


1.00 


133.00 


23.32 


941.00 


1.00 


157.00 


1.00 


157.00 


1.00 


218.00 


1.00 


147.00 


1.00 


141.00 


1.00 


167.00 


1.00 


151.00 


1.00 


141.00 


1.00 


116.00 


1.00 


192.00 


1.00 


154.00 



TABLE 13B 

Pkey: Unique Eos probeset identifier number 
CAT number. Gene duster number 

Accession: Genbank accession numbers 

Pkey CAT Number Accession 

4310B9 327825 1 BEO41395AA491826AA621946AA715980AA666102 

436532 421802J AA721522AW975443T93070 



TABLE 13C 

Pkey: 
Reft 

Slrand: 



Unique number corresponding to an Eos probeset 

Sequence source. The 7digit numbers in this column are Genbank Identifier (Gl) numbers. -Dunham L et al. 
sequence of human chromosome 22' Dunham t. et a)., Nature (1999) 402:489-495. 
Indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 

Ref Strand NLposihn 

8117407 Ptus 121907-122035,122804-122921,124019-124161,124455-124610,125672-126076 



refers to the publication entitled "The DNA 



Pkey 
402075 
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TABLE 14A: Preferred UHity and Subcellular Localization for Potential Lung Disease Targets . 

Table 14A shows the subcellular IccaQzafion and prefened uSIry for the genes appearing in Tables 9A and 10A> mAb symbolize rn^^ S/?£?^u M 

diagnostic, s.m. symbolizes small molecule, and CTL symbofizes cytotoxic lymphocyfic Rgand. These genes were selected from 59680 probesets on the Eos/AflymeWc HuOJ 
Genechip array. 

Table 14B show the accession numbers for Ihcss Pkey's lacking UnigenelD's for table 14A. For each probesetwe have fisted Ihegene cluster raimb» to which Uie 
oQgocu^evSdes were designed Gene dusters were combed using sequences derived from GenbanJt ESTs and mRNAs. These sequences im cfcstered based on sequence 
similarity using Clustering and Alignment Toots (DoubleTwist, Oakland CaBfbrnia). The Genbank accession numbers for sequences comprising each duster are feted in the 
'Accesston* column. 

Table 14C show the genomic posBoning for those Pkey's lacking Unlgene ID'S and accession numbers In table 14A. For each predicted exon, we have Bsted the genomic 
3 source used for prediction, NucteoSde locations of each predicted exon are also Bsted. 



Ptey: • Unique Eos probeset identifier number 

ExAccn: ' Exemplar Accession number, Genbank accession number 

UnigenelD: Unigera number 

Unigene Title: Unigene gene title 

Pref.Utility: Preferred Unity 

PreiLoc: Pretficted subcellular localization 



Pkey 


ExAccn 


UnigenelD 


400289 


X07820 


Hs.2258 


400303 


AA242758 


Hs.79136 


402075 






407811 


AW190902 


Hs.40098 


408243 


Y00787 


Hs.624 . 


408790 


AW580227 


Hs.47860 


408908 


BE296227 


Hs.250822 


409041 


AB033025 


Hs.50081 


409103 


AF251237 


Hs.112208 


409420 


Z15008 


Hs.54451 


409632 


W74001 


Hs.55279 


409757 


NMJ301898 


Hs.123114 


409893 


AW247090 


Hs.57101 


409956 


AW103364 


Hs.727 


410001 . 


AB041036 


Hs.57771 


410407 


X66B39 


Hs.63287 


410418 


D31382 


Hs.63325 


412140 


AA219691 


Hs.73625 


412719 


AW016610 


HS.B16 


414774 


X02419 


Hs.77274 


414883 


AA926960 




415138 


C16356 


Hs.295944 


415669 


NM_005025 


Hs.78589 


415817 


U88967 


Hs.78867 


416658 


U03272 


Hs.79432 


417034 


NM_006183 


Hs.80982 


417079 


U55590 


Hs.81134 


417308 


H60720 


Hs.81892 


417389 


BE260964 


Hs.82045 


417433 


BE270266 


Hs.82128 


417933 


X02308 


Hs.82962 


418478 


U38945 


Hs.1174 


418506 


AA084248 


Hs.85339 


418678 


NM.001327 


Hs.167379 


419121 


AA374372 


Hs.89626 


419171 


NM.002846 


Hs.89655 


419183 


U60669 


Hs.89663 


419216 


AU076718 


Hs.164021 


419235 


AW470411 


Hs.288433 


419452 


U33635 


Hs.90572 


419556 


U29615 


Hs.91093 


420610 


AI683183 


Hs.99348 


421110 


AJ250717 


Hs.1355 


421379 


Y15221 


Hs.103982 


421474 


U76362 


Hs.104637 


421552 


AF026692 


Hs.105700 


421753 


BE314828 


Hs.107911 


421817 


AF146074 


Hs.108660 


422109 


S73265 


Hs.1473 


422158 


L10343 


Hs.1 12341 


422282 


AF019225 


Hs.1 14309 


422283 


AW411307 


Hs.1 14311 


422424 


A1186431 


Hs.296633 


422765 


AW409701 


Hs.1578 


422809 


AK001379 


Hs.121028 


422867 


L32137 


Hs.1584 


422956 


BE545072 


Hs.122579 


423634 


AW959908 


Hs.1690 


423673 


BE003054 


Hs.1695 


423951 


D13666 


Hs.136348 


424046 


AF027866 


Hs.138202 


424381 


AA285249 


Hs.146329 



Unigene Title 

matrix metalloproteinase 1 0 (stromerysin 
UV-1 protein, estrogen regulated 
ENSP00000251056*J , lasma membrane calcium 
cysteine knot superfamfly 1, BMP antagon 
interteukin 8 

neurotrophic tyrosine kinase, receptor, 
serine/threonine kinase 15 
Hypothetical protein. XPJ51860 (WAA1 19 
XAGE-1 protein 

laminin, gamma 2 (nicein (lOOxD), kaBni 
serine (or cysteine) proteinase Inhibito 
cystatinSN 

minichrornosome maintenance deficient (S. 
Inhibin, beta A (acfivin A, acSvtn AB a 
kallikrein11 
carbonic anhydrase IX 
transmembrane protease, serine 4 
RAB6 interacting, Idnesin-Hke (rabkines 
ESTs 

plasminogen activator, urokinase 
CDC28 protein kinase 1 
tissue factor pathway inhibitor 2 
serine (or cysteine) proteinase inhibito 
protein tyrosine phosphatase, receptaM 
fibrillin 2 (congenital contractural ara 
neurotensin 

interieuJdn 1 receptor antagonist 
KIAA0101 gene product 
midkine (neurite grovrth-promoting factor 
5T4 oncofetal trophoblast glycoprotein 
tnyrntdylate synthetase 
cycfin-dependent kinase inhibitor 2A (me 
G protein-coupled receptor 39 
cancer/tesfis antigen (NY-ESO-1) 
parathyroid hormone-flke hormone 
protein tyrosine phosphatase, receptor t 
cytochrome P450, subfamily XXIV (vitamin 
small inducible cytokine subfamily B (Cy 
neurotrimin 

PTK7 protein tyrosine kinase 7 
chilinase 1 (chitotriosidase) 
distal-less homeo box 5 
cathepsin E 

small inducible cytokine subfamfly B (Cy 
sohite carrier family 1 (glutamate trans 
secreted frtezled-related protein 4 
ATP-Wnding cassette, sub-family B (MOW 
ATP-binding cassette, suttfamBy C (CFTR 
gastrin-releaslng peptide 
protease inhibitor 3, sxirwierrved (SKAL 
apolipoprotein L 

COC45 (cell division cycle 45, S.cerevrs 
prostate differentiation factor 
bacubviral IAP repeat-containing 5 (sur 
hypothetical protein FU 10549 
carblage oligomeric matrix protein (pse 
ECT2 protein (Epithelial cell transformi 
herjarirhbinding growth factor binding pr 
matrix metalloproteinase 12 (macrophage 
periostin (OSF-2os) 
serine (or cysteine) pi 
protein kinase ChJc2 
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424502 


AF242388 


Hs.149585 


424503 


NMJJ02205 


Hs.1 49609 


4246B7 


J05070 


Hs.1 51738 


425247 


KM-005940' 


Hs.1 55324 


425322 


U63S30 


Hs.1 55637 


425650 


NM.001944 


Hs.1925 


425734 


AF056209 


Hs.159335 


42577B 


U25128 


Hs.1 59499 


425852 


AK001504 


Hs.1 59551 


426215 


AW963419 


Hs.1 55223 


426427 


M86699 


Hs.1 69840 


426514 


BE616633 


Hs.170195 


427335 


AA448542 


Hs.251677 


427747 


AW41 1425 


Hs.1 80655 


428242 


H55709 


Hs^250 


428330 


L22524 


Hs.2256 


428450 


NMJJ14791 


Hs.1 84339 


428479 


Y00272 


Hs.334562 


428484 


AF1 04032 


Hs.1 84601 


428664 


AK001665 


Hs.1 89095 


428698 


AA852773 


Hs.334838 


428748 


AW593206 


Hs.98785 


428758 


AA433988 


Hs.98502 


428969 


AF120274 


Hs.1 94689 


429211 


AF052693 


Hs.1 98249 


429263 


AA019004 


Hs.1 98396 


429547 


AW009168 


Hs.99376 


429510 


AB024937 


Hs.211092 


429903 


AL1 34197 


Hs.93597 


430486 


8E0S2109 


Hs.241551 


431462 


AW583672 


. Hs.256311 


431515 


NM 012152 


HS.25B5B3 


431846 


8E019924 


Hs.271580 


431958 


X63629 


Hs.2877 


432201 




Hs.298241 


433001 


AF217513 


Hs.279905 


435505 


AF700492 


Hs^11238 


436481' 


AA379597 


Hs 5199 


437016 


AU076916 


Hs.5398 


437044 


AL035864 


Hs.69517 


437789 


AI581344 


Hs.127812 


437852 


BE001836 


Hs.256897 


439223 


AW238299 


Hs.250618 


439477 


W69813 


Hs.58042 


439606 


W79123 


Hs.58561 


439738 


BE246502 


Hs.9598 


440006 


AK000517 


Hs.6844 


441362 


BE614410 


Hs.23044 


442117 


AW664964 


Hs.1 28899 


443247 


BE614387 


Hs.333893 


443426 


AF098158 


Hs.9329 


443859 


NM_013409 


Hs.9914 


444006 


BE395085 


Hs.1 0085 


444371 


BE540274 


Hs.239 


444381 


BE387335 


Hs.283713 


444781 


NM_014400 


Hs.1 1950 


445537 


AJ245671 


Hs.1 2844 


446619 


AU076643 


Hs.313 


446921 


AB012113 


Hs.1 6530 


447033 


AI357412 


Hs.1 57601 


447342 


AI199268 


Hs.1 9322 


448243 


AW369771 


Hs.52620 


448844 


AI581519 


Hs.177164 


449048 


Z45051 


. Hs.22920 


449722 


BE280074 


Hs.23960 


450001 


NMJW1044 


Hs.406 


450375 


AA009547 




450701 


K39960 


HS.2B8467 


450983 


AA3053B4 


Hs.25740 


451668 


Z43948 


Hs.325444 


452281 


T93500 


Hs.28792 


452401 


NM.007115 


Hs.29352 


452747 


BE1 53855 


Hs.61460 


452838 


U65011 


Hs.30743 


,453968 


AA847843 


Hs.62711 


457489 


AI693815 


Hs.127179 
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Ksp37 protein 

CA1 25 antigen; mucin 16 

artemin 

gap junction protein, beta 5 (connexin 3 
ATP-birufing cassette, sub-family A (ABC1 
ESTs 

LUNX protein; PLUNC (palate lung and nas 
cydln-dependent kinase 5, regulatory su 
chloride channel, calcium activated, fern 
granin-fika neuroendocrine peptide precu 
endothelial differentiation, tysophospha 
uroplaWn 1B 

cadherin 3. type 1 r P-cadharin (placenta 
Transmembrane protease, serine 3 
clone HQ031QPRO0310p1 
mterleukin-1 homolog 1 
HSPC150 protein similar to uMquIfin-con 
guanine mon phosphate synthetase 
differentially expressed in Fanconfs an 
ESTs, Weakly similar to T17330 frypolheti 
ESTs, Weakly similar to dJ365012.1 |H.sa 
UL1 6 binding protein 2 

ESTs, Moderately similar to GFR3.HUMAN G 
G protein-coupled receptor 87 
sema domain, Immunoglobulin domain (Ig), 
MALP2 protein; PYRiN-Contaning APAF1-C 
RAD51 (S. cerevisiae) homolog (E coD Re 
ESTs; hypothetical protein for IMAGE447 
c-Myc target JP01 

chromosome 20 open reading frame 1 
foffistatin 

type I transmembrana protein Fn14 
forkheadboxMI 

ESTs, Weakly simBar to S64054 hypotheti 
GPt-ancnored metastasis-associated prate 
EGF-Gke-domain, multiple 6 
secreted phosphoprotein 1 (osteopontin, 
small Inducible cytokine subfamfly A (Cy 
ESTs 

Homo sapiens, Similar to RIKEN cDMA 2010 

integrin, beta 8 

ESTs 

similar to S68401 (cattle) glucose Indue 
cycBn B1 

solute carrier family 6 (neurotransmitte 
a disintegrin and metaltoproteinase doma 
hypothetical protein XP.038151 (bucme- 
ER01 (S. cerevtsiaeHike 
cartOags acidic protein 1 
Homo sapiens cONA FU1 1041 fis. done PL 
tumor necrosis factor, alpha-induced pro 
Ig superfamfly receptor LN1R 
preferentially expressed antigen in mala 
High mobility group (nonhistone chrcmoso 
cryptic gene 



SJTL 

mAb&sm 
orag 

mAb& drag & s.m. 

SJTL 

mAb 

SJTL 

mAb&diag 

mAb&sm 

mAb&diag 

CTL&sm 

mAb&diag 

CU 

s.m. 

diag 

mAb&diag &sm 

SJ1L 

sm 

mAb&sjTi. 
CTL&sm 
mAb 



diag 

mAb & s.m 
mAb&sm 
diag 

mAb&diag 
s.m. 

mAb&sm 
diag 

mAb&sm 

mAb&diag 

mAb & diag 

mAb & diag & s.m 

sm 

diag 

sm 

sm 

CTL 

CTL 

mAb&sm 
mAb 

mAb&sm 

mAb&sm 

mAb&sm 

sm 

sm 

mAb&sm 

CU 

CTL 

diag 

mAb 

sm 

diag 

mAb&diag 
mAb&diag 
diag 
diag 

CTL&diag 
CTL 

mAb&sm - 

mAb&sm 

mAb 

sm 

mAb & sm 
mAb & diag & sm 
mAb&diag 

mAb&diag 

(Sag 

diag 

mAb 

CTL 

CTL&sm 
diag 



cytoplasmic 
ptema membrane 
extracellular 
secreted 
cytoplasmic 



plasma membrane 
plasma membrane 
secreted 



secreted 

cytoplasmic 

cytoplasmic 

extracellular 

nuclear 

nuclear 

plasma membrane 
nuclear 

extracellular 

mttochodria* 

extracellular 

plasma membrane 

plasma membrane 

secreted 

secreted 

plasma membrane 
extracellular 
plasma membrane 
plasma membrane 
plasma membrane 
plasma membrane 
nuclear 
secreted 

cytoplasm 
ER 

nuclear 

plasma membrane 
plasma membrane 

plasma membrane 
plasma membrane 
nuclear 

plasma membrane 
extracellular* 

extracellular 
plasma membrane 
nuclear 
secreted 

plasma membrane 
secreted 



secreted 

plasma membrane 

plasma membrane 
cytoplasm 
plasma membrane 
plasma membrane 
plasma membrane 
secreted 

plasma membrane 

extracellular 
plasma membrane 
nuclear 



secreted 



TABLE 14B 



Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey 



CAT Number Accession 
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WO 02/086443 
414883 15024 J 



450375 83327,1 



PCTAJS02/12476 

AA92S960 AA926959 W76521 W24270 W21526 AA037172 BE267636 H 831 86 AA4 69909 N86396AA001348 BE535736AA081745BE565245 
AA082436 H72525 H77575 M978B W80565 H78746 BE569085 W04339 R981 27 T55938 BE279271 AW960304 T29812 AA476873 BE297387 
AA292753AA177048NM_C01B25X54941 BE3 14356 AA9D9783 AI71 9075 BE270172 BE28981 9 AA8B9955 A1204530 W25243 AI935150 
AA872039 VV72395 T99630 AI422691 H98480 N31428 BE255916 H03265 AI857576 AA776920 AA910644 AA459522 AA293140 AW514567 
R75953 AW562396 AA662522 AI865147 AI423153 AW262230 AA584410 AA583187 AW024595 AW069734 AI828995 AA282997 AA876046 
AW61 3002 AA527373 AW9724S A1831360 AAS21 337 AA1 00926 AA77241 8 AA594628 A! 033892 W95096 AI034317 AA398727 AI085031 
N95210 AI459432 AI041437 AA932124 AA627684 AA935829 AI004827 AJ423513 AI094597 H42079 R54703 AI530359 AA517681 AAS78045 
AA643280W44561A1991988AI537592A109Q262AA740817AI312104AI911822M416871 AI1 85409 AA1 29784 AA701 623 AI075239 
AI139549 AA633648AI339996A1338880 AA399239 A1078708AI085351 AI362835 AI346618 A1146955 A1989380 AI348243 N92892 AA765850 
AM94230 AI278887 AA962596 AW92600 VW0435 MM 

AM9421 1 AW059501 AW886710 R92790 N59755 A136112B AW589407 H47725 H97534 H48076 H48450 T99631 AW30075B H03431 R76789 
AA954344 H77576 R96823 AI457100 N92845 N49682 H42038 BE220698 BE220715 H99552 AA701624 N74173 R54704 K79520 H72923 
H03266 BE261919 AA769633 AA480310 AA507454 AA910586 A1203723 AW104725 VV2561 1 W25071 T88980 HD3513 T77589 R99156 
W950S5 R97470 AA702275 T77551 AA91 1952 H82956 N83673 AA283672 

/WX)9647M131254AA374293AW954405H(mi0AW606284M151166BE157457BE157601 H04384 W46291 AW563674 H04021 H01532 
M190993 H03231 H59605 H01 642 AA852876 M1 13758 M62691 5 M746^^ 



TABL£14C 

Pkey: 
Ref: 

Strand: 
NLposilion: 

Pkey 

402075 



Unique number corresponding to an Eos probeset 

Sequence source. The 7 digit numbers In this column are Genbank identifier (GI) numbers. "Dunham I. et af refers to the publication entitled The DNA 
sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489495. 
Indicates DNA strand from which exons were predicted. 
Indicates nucleotide positions of predicted exons. . 

Ref Strand NLposifcn 

8117407 Plus 121907-122035,122804.122921.124019-124161,124455.124610,125672-126076 
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WO 02/086443 PCT/US02/12476 
TABLE 15A: InfonnaSon for aD sequences in Table 16 

TaUe 15A shows the Seq ID No, Pkey, ExAccn, UrtfgenelD, and Unigene Title for ail of the sequences In Table 16. 

.Table 15B show the accession numbers for those Pte/s lacking Unigenelffs for bote ISA For each probes* twe ^f^^^^^^^ ue ^ 
dfaonucteofides were designed. Gene dusters were compiled using sequsnces derived from Genbank ESTs and mRNAs. These sequences ^^^ b ^^ s ^ uenC8 
S^g aXinTand Alignment Tools {DoubJeTwfeToaWand CaEromia). The Genbank accession numbers for sequences compnsmg each cluster are fisted « the 
'Accession" column. 

Table 15Cshowttegenorrfc^^ For each predicted exon, we have Hsted the genomic : 

sequence source used for prediction. Nucleotide locations of each predicted excn are also listed 



Seq ID No: Sequence ID number 

Pkey: Unique Eos probeset identifier "number 

ExAccn: Exemplar Accession number, Genbank 

UnigenelD: Unigene number 

Unigene TitlaUnigene gene title 



Seq ID No: 



Seq ID No: 14 2 
SeqlDNa344 
SeqlDNo:S&6 
SeqIDNa748 
Seq!DNo:9&10 
Seq ID No: 11 & 12 
Seq ID Na 13 4 14- 
Seq ID No: 15 & 16 
Seq ID No: 17418 
Seq ID No: 19 & 20 
Seq ID No: 21& 22 
Seq ID No: 23 & 24 
Seq ID No: 25 & 26 
Seq ID No: 27 & 28 
Seq ID No: 29 & 30 
Seq ID No: 31&32 
Seq ID No: 33 & 34 
Seq ID No: 35 & 36 
Seq ID Na 37 4 38 
Seq ID No: 39 & 40 
Seq ID No: 41 & 42 
Seq ID Na 43444 
Seq ID No: 45 & 46 
Seq ID No: 47 & 48 
Seq ID No: 49 
Seq ID No: 50 & 51 
Seq ID No: 52 & 53 
Seq ID No: 54 & 55 
Seq ID No: 56457 
Seq ID No: 58 & 59 
Seq ID No: 60 & 61 
Seq ID No: 624 63 
Seq ID No: 644 65 
Seq ID Na 66 4 67 
Seq ID No: 68 4 69 
Seq ID No: 70 4 71 
Seq ID No: 72 4 73 
Seq ID No, 74 4 75 
Seq ID No: 76 4 77 
Seq ID No: 78 4 79 
Seq ID No: 80 & 81 
Seq ID No: 82483 
Seq ID No: 84 4 85 
Seq ID No: 86 4 87 
Seq ID No: 88489 
Seq ID Na 90 4 91 
Seq ID Na 92493 
Seq ID No: 94 4 95 
Seq ID No: 96 4 97 
Seq ID No: 98 4 99 
Seq ID Na 1004 101 
Seq ID Na 102 4 103 
Seq ID Na 104 4 105 
Seq ID Na 106 4 107 
Seq ID Na 108 4 109 
Seq ID Na 1104 111 
SeqtDNa1124113 
Seq ID Na 114 & 115 
Seq ID Na 116 
SeqlDNa1174118 
Seq ID Na 119 4 120 
Seq ID Na 121 4 122 
Seq ID Na 1234124 
Seq ID Na 1254 126 



Pkey 


ExAccn 


UnigenelD 


410407 


X66839 


U_ COOB7 


412719 


AW016510 


Hs.616 


417034 


NMJJ08183 


Ks.o0902 


430486 


BE062109 


Hs24155l 


407788 


BE514982 


KS.30391 


407788 


BE514982 


Hs.38991 


4077B8 


BE514982 


Hs.33S91 


407788 


BE514982 


Hs.38991 


439285 


AL1 33916 




413753 


U17760 


Hs.75517 


120486 


AW368377 


MS. 137509 


425650 ! 


NM_001944 


Hs.1 925 


412140 


AA219691 


U_ "70COC 

HS.736Z5 


423673 


BE003054 


Hs.1695 


452838 


U55011 


HS-30743 


418663 


AK001100 


Hs.41690 


418663 


AK00110Q 


HS.4109U 


409632 


W74001 


U- CC770 


429610 


AB024937 


u_ 01 moo 
HSuii 1U9£ 


406690 


M29540 


U» 'WICOO 

Hs.zzoozy 


431846 


BE0199Z4 


u. 07icnn 


418830 


BE51 373 1 


ns.ooyoy 


424098 


AF077374 


Lie 1*J(W99 


443648 


AI085377 


nS.i*rJOiw 


311034 


BE56713Q 


LI— OHO OQ 


408522 


AI 541214 




422158 


L 10343 


Ue 1 1 9141 

ns. i \com 


435505 


ArZUlWyZ 


u. 9110%) 


417366 




Vkt 107R 
ns. \vi\> 


431958 




Lie 9077 


441020 






423217 




Hs.1640 


4<iSWo 


DC. \a CO 3d. 


Hs.11261 


440/ JO 




Hs.1 87958 
rip. ivi J* 


444371 


BE540274 


Hs.239 


444371 


BE540274 


Hs.239 


444371 


BE540274 


Hs.239 


422168 


AA586894 


Hs.1 12408 


422168 


AA5B6894 


Hs.1 12408 


429259 


AA420450 


Hs.292911 


426440 


BE382756 


Hs-169902 


437044 


AL035864 


Hs.69517 


423662 


AK001035 


Hs.130881 


428484 


AF104032 


Hs.184601 


429211 


AF052693 


Hs.198249 


417389 


BE260964 


Hs.82045 


423634 


AW959903 


Hs.1690 


417515 


L24203 


Hs.82237 


441362 


BE614410 


Hs.23044 


425322 


U63630 


Hs.155637 


449003 • 


X76342 


Hs.389 


431009 


BE149762 


Hs.48956 


409103 


AF251237 


Hs.112208 


417542 


J04129 


Hs.82269 


428471 


X57348 


HS.1B4510 


418004 


U37519 


Hs.87539 


414761 


AU077228 


Hs.77255 


418203 


X54942 


Hs.83758 


447343 


AA255641 


Hs.235894 


437016 


AU076916 


Hs.5398 


449230 
446989 


BE613348 


Hs.211579 


AK001898 


Hs.16740 


457819 


AA057484 


Hs.35406 


424687 


J05070 


Hs.151738 



Unigene Title 

carbonic anhydrase IX 
ESTs 

neurotensin 

chloride channel calcium activated, fern 
S100 calcium-binding protein A2 
S100 calcium-binding protein A2 
$100 caldum-binding protein A2 
S100 calcium-binding protein A2 
hypothetical protein FU20093 
laminin, beta3(nicein (125kD), kafinin 
tumor protein 63 kDa with strong homolog 
desmogtein 3 (pemphigus vulgaris antigen 
RAB6 interacting, kinesin-like (rabkines 
matrix metaBoproteinase 12 (macrophage 
preferentially expressed antigen in mela 
desmocoDIn3 
desmocollin3 

serine (or cysteine) proteinase inhibit© 
LUNX protein; PLUNC (palate lung and nas 
carttooembryonic anSgeiHetated cell ad 
uroplakmlB 

hypothetical protein MGC481 6 
small praline-rich protein 3 
ESTs 

ESTs, Highly similar to NKGD JiUMAN NKG2- 

Small praline-rich protein SPRK (human, 

protease inhibitor 3, skin-derived (SKAL 

interleukin-1 homolog 1 

small profine-rich protein 1B (comifin) 

cadherin 3, type 1, P-cadherin (placenta 

ESTs 

collagen, type VII, alpha 1 (epidermorys 

small pronne-rich protein 2A 

solute carrier family 6 (neumtransmitte 

forkhead box M1 

forkhead box M1 

forkhead box M1 

S100 calcium-binding protein A7 (psorias 
S100 calcium-binding protein A7 (psorias 
PlakophiDn 

solute carrier family 2 (facilitated ghi 
differenuaify expressed in FanconTs an 
&*eH OUtymphoma 11A (zinc finger pro 
solute carrier family 7 (cafionic amino 
gap junction protein, beta 5 (connexin 3 
midkine (neurile growth-promoting factor 
heparin-Wnding growth factor binding or 
ataxia-telangiectasia group D-associated 
RAD51 (S, cerevlsiae) homolog (E cdi Re 
protein kinase, DNA^ctivated, catalytic 
alcohol dehydrogenase 7 (class IV), mu o 
gap junction protein, beta 6 (connexin 3 
XAGE-1 protein 

progestagen-associated endometrial prote 
strafifin 

aldehyde dehydrogenase 3 family, member 

enhancer of zeste (Drosophila) homolog 2 

CDC28 protein kinase 2 

ESTs, Highly similar b S02392 aIpha-2-m 

guanine monphosphate synthetase 

melanoma cell adhesion molecule 

hypothetical protein FU1103S 

ESTs, Highly similar to unnamed protein 

matrix metanoproteinase 9 (gelatinase B 
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WO 02/086443 
SeqlDNo:127&128 
Seq ID No: 129 & 130 
Seq ID No: 131 & 132 • 
Seq IDNa 133 & 134 
5 SeqlDNoc135&136 
SeqlDNa: 137 & 138 
SeqtDNa 139 & 140 
SeqlDNa 141 & 142 
Seq IDNa 143 & 144 

10 SeqlDNa 145 & 146 
Seq IDNa 147 & 148 
SeqlDNa 149 & 150 
SeqlDNa 151 & 152 
SeqlDNa 153 & 154 

IS Seq IDNa 155 & 156 
SeqlDNa 157 & 158 
SeqlDNa 159 & 160 
SeqlDNa 161 & 162 
Seq IDNa 163 & 164 

20 Seq IDNa 165 & 166- 
SeqlDNa 167 & 168 
SeqlDNa 169 & 170 
Seq IDNa 171 & 172 
Seq IDNa 173 & 174 

25 SeqlDNa 175 & 176 
SeqlDNa 177 & 178 
SeqlDNa 179 & 180 
Seq D No: 181 & 182 
Seq ID No: 183 & 184 

30 Seq ID No: 185 & 186 
Seq ID No: 187 & 188 
Seq ID No: 189 & 190 
SeqlDNa 191 & 192 
Seq ID No: 193 & 194 

35 Seq ID No: 195 & 196 
SeqlDNa 1974 198 
Seq ID No: 199 & 200 
Seq ID No: 201 & 202 
SeqlDNa 203 & 204 

40 Seq ID No: 205 & 206 
Seq ID No: 207 & 208 
Seq IDNa 209 & 210 
Seq ID No: 21 1 & 212 
Seq ID No: 213 & 214 

45 Seq IDNa 215 & 216 
Seq ID No: 217 & 218 
SeqlDNa 219 & 220 
Seq ID No: 221 & 222 
Seq ID No: 223 4 224 

50 SeqlDNa 225 4 226 
Seq ID No: 2274 228 
Seq IDNa 229 4 230 
Seq IDNa 231 4 232 
Seq ID Na 233 

55 Seq IDNa 234 4 235 
Seq ID No: 236 4 237 
Seq ID No: 238 
SeqlDNa 239 4 240 
Seq IDNa 241 4 242 

60 Seq IDNa 243 4 244 
Seq ED No: 245 
Seq IDNa 246 4 247 
Seq IDNa 248 4 249 
SeqlDNa 2504251 

65 Seq IDNa 252 4 253 
Seq ID No: 254 4 255 
Seq ID No: 256 4 257 
Seq IDNa 258 4 259 
Seq ID No: 260 4 261 

70 Seq IDNa 262 4 263 
Seq IDNa 264 4 265 
Seq IDNa 266 4 267 
Seq IDNa 268 4 269 
Seq IDNa 270 4 27.1 

75 Seq ID No: 272 4 273 
Seq ID No: 2744 275 
Seq ID No: 276 4 277 
Seq ID No: 278 4 279 
Seq IDNa 280 4 281 

80 SeqlDNa 282 

Seq ID No: 283 4 284 
Seq IDNa 2854286 
Seq ID Na 2874288 
SeqlDNa2B94 290 

85 Seq IDNa 291 4 292 



414430 


AI346201 


Hs.76118 


418462 


BE001596 


Hs.85265 


100668 


105424 


Hs.169610 


458933 


AI63B429 


Hs^4763 


418478 


U38945 


Hs.1174 


418478 


U38945 


Hsll174 


418478 


U38945 


Hs.1174 


418478 


U38945 


Hs.1174 


446269 


AW263155 


Hs.14559 


422765 


AW409701 


Hs.1578 


436481 


AA379597 


Hs.5199 


440325 


NMJJ03B12 


Hs.7164 


439606 


W79123 


Hs.58561 


453884 


AA355925 


Hs.36232 


4531384 


AA355925 


Hs.36232 


453884 


AA355925 


Hs.36232 


453864 


AA355925 


Hs.36232 


404877 






413129 


AF292100 


Hs.104613 


413281 


AA851271 


Hs^22024 


444781 


NM_014400 


Hs.11950 


416819 


U77735 


Hs.80205 


451320 


AW1 18072 




418543 


NM_005329 


Hs.85962 


454034 


NM_000691 


Hs.575 


425397 


J04088 


Hs.156346 


415817 


U88967 


Hs.78867 


415817 


U88967 


Hs.78867 


415817 


U88967 


Hs.78867 


415817 


U88957 


Hs.78867 


415817 


U88967 


Hs.78867 


419121 


AA374372 


Hs.89626 


448993 


A1471630 


Hs.8127 


101/ 


AF1 46074 


Hs. 108660 






Hs.241305 


425057 


AAB26434 


Hs.1619 




nnwu ill 


Hs.97932 


MOOR** 


X02404 


■ Hs.274534 


100576 


XD0356 


Hs.37058 


101175 


U82671 


Hs.36980 


429038 


AL023513 


Hs.1 94766 


1 100/0 


MM 001177 


Hs.1 67379 


418678 


NM 001327 


Hs.1 67379 


131927 


AJ003112 


Hs.34780 


428182 


RF1MM2 


Hs.293317 


427335 


AA448542 


Hs>251677 


409420 


215008 


Hs.54451 


114346 


All 37256 


Hs.1 30489 


438956 


W00847 


Hs.1 35056 


404440 






415669' 


NM 00SJ25 


Hs.78589 


103312 


Y12642 


Hs.3185 


320843 


RP069288 


HS34744 


429065 


AJ753247 


Hs.29643 


446102 


AW1 68067 


Hs.317694 


330495 


U47924 


Hs.71642 


413573 


AI733859 


Hs.1 49089 


428479 


Y00272 


Hs.334562 


428479 


Y00272 


Hs.334562 


332180 


AF134160 


Hs.7327 


437915 


A1537993 


Hs.202312 


441553 


AA281219 


Hs.121296 


331692 


AI683467 


Hs.152213 


429413 


NM 014058 


Hs^01877 


422283 


AW411307 


Hs.114311 


448357 


N20169 


Hs.103923 


446292 


AF081497 


Hs^79682 


416209 


AA236776 


Hs.79078 


453922 • 


AF053306 


Hs^8708 


424046 


AF027866 


Hs.138202 


439223 


AW238299 


Hs.250618 


429228 


AI553633 


Hs.326447 


409757 


NMJJ01898 


Hs.123114 


411089 


AA456454 


Hs^14291 


436511 


AA721252 


Hs^91502 


428969 


AF120274 


Hs.194689 


428969 


AF120274 


Hs.194689 


428969 


AF120274 


Hs.194689 


428959 


AF120274 


Hs.194689 


407137 


T97307 




412723 


AA648459 


Hs.335951 


450701 


H39960 


HS.2S8457 


405770 






439453 


BE264974 


Hs.6566 


.414774 


X02419 


Hs,77274 



uhtquifa) carboxyUerminal esterase L1 
intsgrin, beta 4 

CD44 antigen (homing function and Indian 
RAN binding protein 1 
cydin-dependent kinase inhibitor 2A (me 
cydin-dependent kinase inhibitor 2A (me 
cycfin-dependent kinase inhibitor 2A (me 
cycfin-dependenl kinase inhibitor 2A (me 
hypothetical protein FU 10540 . ' 
bacutoviral IAP repeat-containing 5 (sur 
HSPC1 50 protein sirmlar to ubiquifin-con . 
a disintegrin and metafloprotelnase doma 
G protein-coupled receptor 87 
K1AA0185 gens product 
K1AA0186 gene product 
WAA018S gene product 
K1AA0186 gene product 
NM_005365:Homo sapiens melanoma antigen, 
RP42homotog 
transcription factor BMAL2 
GR-anchored metastasis-associated prote 
pim-2 oncogene 

diacytgtycerd kinase, zeta (104kD) 
hyaturonan synthase 3 
aldehyde dehydrogenase 3 My, member 
topcisomerase (DNA) 11 alpha (170kD) 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
parathyroid hormona-fike hormone 
KIAA0144 gene product 
ATP-birufing cassette, sub-family C (CFTR 
estrogen-responsive B box protein 
achaete-scuta complex (DrosaphSa) homol 
chondromoduDn I precursor 
caidtoniiwelated polypeptide, beta 
caldtorur^caldtonln-relaled pdypeptid 
melanoma antigen, family A, 2 
seizure related gene 6 (mouseHike 
cancer/tesiis antigen (NY-ESO-1) 
cancer/tesSs antigen (NY-ESO-1) 
doublecortex; lissencephaly, X-Hnked (d 
ESTs, Weakly similar to GGC1_HUMAN 6 ANT 
G antigen 7B 

laminin, gamma 2 (nicein (100kD), kalini 
ATPase, ammophospholipid transporter-fl 
Human DNA sequence from clone RP5-850E9 
NMJ)21048:Homo sapiens melanoma anligen, 
serine (or cysteine) proteinase inhibit© 1 
lysosomal 

Homo sapiens mRNA; cDNA DKFZp547C136 (fr 
Homo sapiens cDNA RJ 13103 fis, done NT 
ESTs 

guanine nucleotide binding protein (G or 
ESTs 

cell division cycle 2, G1 to S and G2 to 
cell division cyde 2, G1 to S and G2 to 
daudin 1 

Homo sapiens clone N1 1 NTera2D1 teratoca 
ESTs 

wingless-type MMTV integration site faml 
DESC1 protein 

CDC45 (cell division cyde 45, S.cerevis 
RAB38, member RAS oncogene family 
Rh type C glycoprotein 
MAD2 (mitotic arrest deGcient, yeast, h 
budding uninhibited by benzimidazoles 1 
serine (or cysteine) proteinase inhibito 
U116 binding protein 2 
ESTs 

cystatin SN 

cell Division cyde 2-Bke 1 (PITSLRE pr 

ESTs 

artemin 

artemin 

artemin 

artemin 

gb:ye53h05^1 Soares fetal Ever spleen 
hypothefical protein AF301 222 
hypothetical protein XPJJ98151 (teucine- 
NM_002362ttomo sapiens melanoma antigen, 
thyroid hormone receptor Intefactor 13 
plasminogen activator, urokinase 



PCTAJS02/12476. 



185 



WO 02/086443 

Scq 10 No: 293 & 234 424629 

Seq ID Na 295 & 295 

Seq ID No: 297 & 298 

Seq ID Na 299 A 300 
5 Seq ID No: 301 & 302 

Seq ID Na 303 & 304 

Seq ID Ma 305 & 306 

Seq ID No: 307 & 308 

Seq ID No: 309 
10 Seq ID No: 310 & 311 

Seq ID No: 312 & 313 

Seq ID No: 314 & 315 

Seq ID Na 316 & 317 .„„. 

Seq ID Na 318 & 319 409228 
15 Seq ID No: 320 & 321 425734 

Seq ID Na 322 & 323 413582 

Seq ID No: 324 & 325 438403 

Seq ID Na 326 & 327 403329 

Seq ID Na 328 & 329 409893 
20 Seq ID Na 330 & 331 

Seq ID Na 332 4333 

Seq ID No: 334 & 335 

Seq ID No: 336 & 337 

Seq ID Na 338 & 339 
25 Seq ID Na 340 & 341 

Seq ID Na 342 & 343 

Seq ID Na 344 & 345 

Seq ID No: 346 & 347 

Seq ID Na 348 4349 
30 Seq ID Na 350 & 351 

Seq ID Na 352 & 353 

Seq IDNa354 &355 

Seq (DNa356 &357 

Seq ID Na 358 & 359 
35 Seq ID Na 360 & 351 

Seq ID Na 362 & 353 

Seq ID No: 364 & 365 

Seq ID No: 356 4 367 

Seq ID Na 366 & 369 
40 Seq ID Na 370 & 371 

Seq ID No: 372 4 373 

Seq ID Na 374 4375 431565 

Seq ID Na 376 4 377 132354 

Seq ID Na 378 4 379 424441 
45 Seq ID Na 380 4 381 103768 

Seq ID Na 382 4 383 417512 

Seq ID Na 384 4 385 425266 

Seq ID Na 386 4 387 424503 

Seq ID No: 388 4 389 
50 Seq ID Na 3904 391 

Seq ID Na 392 4 393 

Seq ID Na 394 4 395 

Seq ID Na 396 4 397 

SeqIDNa39B 4 399 418506 
55 Seq ID Na 400 4 401 423961 

Seq ID No: 402 4 403 414612 

Seq ID Noc 4044405 417433 

Seq ID Na 406 4 407 417433 

Seq ID Na 408 4 409 422867 
60 Seq ID Na 4104 411 428227 

Seq ID Na 412 4 413 444381 

Seq ID Na 4144 415 400303 

Seq ID Na 4164 417 411769 

Seq ID Na 418 4 419 
65 Seq ID Na 420 4 421 

Seq ID Na 422 4 423 

Seq ID No: 424 4 425 452747 

Seq ID No: 426 4 427 450375 

Seq ID Na 428 4 429 
70 Seq ID Na 430 4 431 

Seq ID No: 4324433 

Seq ID No: 4344435 

Seq ID No: 436 4 437 

Seq ID No: 438 4 439 
75 Seq ID No: 440 4 441 

Seq ID No: 442 4 443 

Seq ID No: 4444445 

Seq ID No: 446 4 447 

Seq ID No: 448 4 449 
80 Seq ID No* 450 4 451 

Seq ID No: 452 4 453 

Seq ID Na 4544455 

Seq ID Na 456 4 457 

Seq ID No: 458 4 459 41655B 
85 Seq ID No: 460 4 461 407811 



PCT/US02/12476 



437789 
437789 
437789 
437789 
437789 
453968 
403478 
441525 
434105 
428810 
413691 
423934 



119073 
113195 
102283 
101345 
103280 
102012 
105729 
134299 
412719 
422158 
128924 
100486 
419121 
409459 
330493 
417866 
418113 
437016 
429612 
440704 
431221 
431565 



400289 
418007 
418007 
418738 
415138 



.450098 
421552 



426215 
425247 
432201 
427585 
442117 
431211 
447033 
447033 
447033 
115522 
410418 
409041 
409041 
452461 
412420 



M90S56 Hs.151393 gtutamate-cystetoe Ogasa, catalytic sub 

AI581344 Hs.1 27812 ESTs, Weakly similar to T1 7330 hypotheS 

AI581344 Hs.127812 ' ESTs, Weakly similar to T1 7330 rrypotheS 

AI581344 Hs.127812 ESTs, Weakly similar to T1 7330 hypotheS 

AI581344 Hs.127812 ESTs, Weakly similar to T1 7330 hypotheS 

AI581344 Hs.127812 ESTs, Weakly similar to T17330 hypotheti 

AA847B43 . Hs.62711 High mob2ity group (nonhistone chromoso 

NM_022342:Homo sapiens ta'nesin protein 9 

AW241867 Hs.127728 ESTs 

AW952124 Hs.13094 prasenffins associated rhomboid-Eke pro 

AF068236 Hs.1 93788 nitric oxide synthase 2A (inducible, hep 

AB023173 Hs.75478 ATPase, Class VI, type 11B 

U89995 Hs.159234 forkhead box E1 (thyroid transcription f 

R16811 Hs.22010 ESTs, Weakly similar to 210926QA B ceO 

AF056209 Hs.159396 peptidylglycinQ atpha-arrBdatirig monooxyg 

AW295647 Hs.71331 hypothetical protein MGC5350 

AA806607 Hs.292206 ESTs 

unnamed protein product (Homo sapiens J 

AW247090 Hs.57101 minichromosome maintenance deficient (S. 

BE245360 Hs.279477 v-ets erythroblastosis virus E26 oncogen 

H83265 Hs.8881 ESTs, Weakly similar to S41044 chromosom 

AW161552 Hs.83381 guanine nucteoMe binding protein 11 

NMJW5795 Hs.152175 calcitonin receptor-like 

U84722 Hs.76205 cadherin 5, type 2, VE-cadherin (vascula 

BE259035 Hs.1 18400 singed (Drosoph3a)-iIke (sea urchin fas 

H46612 Hs.293815 Homo sapiens HSPC285 mRNA, parfial cds 

AW580939 Hs.97199 complement component Clq receptor 

AW016610 Hs.816 ESTs 

L10343 Hs.1 1 2341 protease inhibitor 3, skin-derived (SKAL 

BE279383 Hs.26557 plakophiKn3 

T19006 Hs.10842 RAN, member RAS oncogene family 

AA374372 Hs.89626 parathyroid hormone-like hormone 

D86407 Hs.54481 low density lipoprotein receptor-related 

M27826 endogenous retroviral protease 

AW067903 Hs.82772 collagen, type XI, alpha 1 

AI272141 Hs.83484 SRY (sex determining region Y>box 4 

AU076916 Hs.5393 guanine monphosphate synthetase 

AF062649 Hs.252587 pituitary tumor-transforming 1 

M69241 Hs.1 62 insuOn-nke growth factor binding prate' 

AA449015 Hs^66145 SRB7 {suppressor of RNA polymerase B, ye 

AF161470 Hs^60822 butyrate-induced transcript 1 

AF161470 Hs^60622 butyratetoduced transcript 1 

BE1 85289 Hs, 1076 small praline-rich protein 1 B (comifin) 

X14850 Hs.1 47097 H2A histone family, member X 

AF086009 Hs.296398 gb:Homo sapiens full length insert cONA 

X76534 Hs.82226 glycoprotein (transmembrane) nmb 

J00077 Hs.155421 alpha^etoprotein 

NM.002205 Hs.149609 Integrin, alpha 5 (fibronectin receptor, 

X07820 Hs.2258 matrix metatloproteinase 10 (strome)ysin 

M13509 Hs.83169 matrix metatloproteinase 1 (interstitial 

M13509 Hs.83169 matrix melalloproteinase 1 (interstitial 

AW388633 Hs.6682 ' solute carrier family 7, (caSonic amino 

C 1 8356 Hs.295944 tissue factor pathway inhibitor 2 

M084248 Hs.85339 G proteto-coupled receptor 39 

D13668 Hs.136348 periosfin (OSF-2os) 

X72755 Hs.77367 monokine Induced by gamma interferon 

BE270266 Hs.82128 5T4 oncofetal trophoblast glycoprotein ■ 

BE270268 Hs.82128 5T4 oncofetal trophoblast glycoprotein 

L321 37 Hs. 1 584 cartilage oSgomeric matrix protein (pse 

AA321649 Hs^248 small Inducible cytokine subfamily B (Cy 

BE387335 Hs^83713 ESTs, Weakly similar to S64054 hypotheti 

AA242758 Hs.79136 UV-1 protein, estrogen regulated 

AF245505 Hs.72157 AcHcan 

AA852773 Hs.334833 K1AA1B66 protein 

W27249 Hs.8109 hypothetical protein FU21080 

AF026692 Hs.105700 secreted frizzled-related protein 4 

BE153855 Hs.61460 Ig superfamily receptor LNIR 

AA009647 a disinlegrin and metalloproteinase doma 

AW963419 Hs.155223 stanrttocafctn 2 

NMJ>D5940 Hs.1 55324 matrix melalloproteinase 11 (stromefysin 

AI538613 Hs.298241 Transmembrane protease, serine 3 

D31 1 52 Hs, 179729 collagen, type X, alpha 1 (Schmid metaph 

AW664964 Hs.128899 ESTs; hypothetical protein for IMAGE:447 

M86849 Hs.323733 gap Junction protein, bela 2, 26kD (conn 

A1357412 Hs.157601 ESTs 

AI357412 Hs.157601 ESTs 

AI357412 Hs.157601 ESTs 

BE614387 Hs.333893 c-Myc target JP01 

D31382 Hs.63325 transmembrane protease, serine 4 

AB033025 Hs.50081 Hypothetical protein, XPJJ51860 (WAA1 1 9 

AB033025 Hs.50081 Hypothetical protein, XPJJ51860 (K1AA1 1 9 

N78223 Hs.108106 transcription factor 

AL035568 Hs.73853 bone morphogenefic protein 2 

U03272 Hs.79432 fibrillin 2 (congenita! contractural ara 

AW1 90902 Hs.40098 cysteine knot surjerfarnfly 1, BMP antagon 
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Seq ID No: 462 & 463 
Seq ID No: 464 & 465 
Seq ID Na 465 & 467 
Seq ID No: 468 & 469 
Seq ID No: 470 & 471 
Seq ID No: 472 & 473 
Seq ID No: 474 & 475 
Seq ID No: 476 4477 
Seq ID No: 4784479 
Seq ID No*. 480 & 481 
Seq ID No: 482 & 483 
Seq ID No: 484 & 485 
Seq ID No: 486 & 487 
Seq ID No: 488 & 489 
Seq ID No: 490 & 491 
Seq ID No: 492 & 493 
Seq ID No: 494 & 495 
Seq ID Not 498 & 497 
Seq ID No: 498 & 499 
Seq ID No: 500 & 501 
Seq ID No: 502 & 503 
Seq ID No: 504 & 505 
Seq ID No: 508 &507 
Seq ID No: 508 &509 
Seq ID No: 510 & 511 
Seq ID No: 512 & 513 
Seq ID No: 514 & 515 
Seq ID Na 516 & 517 
Seq ID No: 518 & 519 
Seq ID No: 520 & 521 
Seq ID No: 522 & 523 
Seq ID No: 524 & 525 
Seq ID No: 526 & 527 
Seq ID No: 528 & 529 
Seq ID No: 530 & 531 
Seq ID No: 532 & 533 
Seq ID No: 534 & 535 
Seq IDNo:536 &537 
Seq IDNo:538 &539 
Seq ID No: 540 & 541 
Seq ID No: 542 & 543 
Seq ID No:544&545 
Seq ID No: 546 & 547 
Seq ID No:548&549 
Seq ID No: 550 & 551 
Seq ID No: 552 & 553 
SeqlDNo:554&555 
Seq ID No: 556 & 557 
Seq IDNo:558&559 
Seq ID No: 560 & 561 
Seq ID No: 562 & 563 
Seq ID No: 564 & 565 
Seq ID No: 566 & 567 
Seq ID No: 568 & 569 
Seq ID No: 570 & 571 
Seq ID No: 572 & 573 
Seq ID No: 574 & 575 
Seq ID No: 576 & 577 
Seq ID No: 578 & 579 
Seq ID No: 580 & 581 
Seq ID No: 582 & 583 
Seq ID No: 584 & 585 
Seq ID No: 586 & 587 
Seq ID No: 588 & 589 
Seq ID No: 590 & 591 
Seq ID No: 592 & 593 
Seq ID No: 594 & 595 
Seq ID No: 596 & 597 
Seq ID No: 598 & 599 
Seq ID No: 600 & 601 
Seq ID No: 602 & 603 
Seq ID No: 604 & 605 
Seq ID No: 606 & 607 
Seq ID No: 608 & 609 
Seq ID No: 610 & 611 
Seq ID No: 612 & 613 
Seq ID No: 614 & 615 
Seq ID No: 616 & 617 
Seq ID No: 618 & 619 
Seq ID No: 620 & 621 
Seq ID No: 622 & 623 
Seq IDNo:624 &625 
Seq ID No: 626 & 627 
Seq IDNo:628&629 
Seq IDNo:630 &631 



437852 
402075 
421110 
451668 
451666 
451668 
422282 
425852 
439738 
427747 
420281 
405932 
405932 
444342 
421379 
417079 
430890 
419721 
444471 
413063 
433800 
452401 
452401 
450001 . 
410407 
309931 
412719 
417034 
430486 
413753 
425650 
423673 
.418663 
418663 
429610 
406690 
431846 
422158 
431958 
437044 
428484 
429211 
417389 
431009 
417542 
449230 
410555 
410555 
424687 
418462 
410274 
439606 
404877 
4447B1 
418543 
415817 
415817 
415817 
415817 
415817 
415817 
421817 
418678 
418678 
409420 
332180 
408790 
408790 
439223 
409757 
428969 



AJ250717 

Z43948 

Z43948 

Z43948 

AF019225 

AK001504 

BE246502 

AW411425 



428969 
428969 
450701 
450701 
414774 
407944 
407944 
457489 
429547 
407242 
407242 
407242 
444005 



NM014398 

Y15221 

U65590 

X54232 

NNL001650 

AB020684 

AL035737 

AI034361 

NM.007115 

NM_007115 

N^001044 

X66839 

AW341683 

AW016610 

NM.006183 

BE062109 

U17760. 

NM.001944 

BE003054 

AK001100 

AK001100 

AB024937 

M29540 

BE019924 

L10343 

X63629 

AU035864 

AF104032 

AF052693 

BE260964 

BE149762 

J04129 

BE613348 

U92649 

U92649 

J05070 

BE001596 

AA381807 

W79123 

NM.014400 

NMJH5329 

UB3967 

U88967 

U88957 

U88957 



U88957 
AF146074 
NW.001327 
NMJJ01327 
Z15008 
AF134160 
AW580227 
AW580227 
AW238299 
NMJXH898 
AF120274 
AF120274 
AF120274 
AF120274 
H39960 
H39960 
XD2419 
R34008 
R34008 
A1593815 
AW009166 
M18728 
M18728 
M18728 

BE395085 Hs.10086 



Hs^56897 

Hs.1355 

HsJ26444 

Hs.326444 

HsJ26444 

Hs.1 14309 

Hs.159651 

Hs3598 

Hs.180655 

Rs.323494 



Hs.10887 

Hs.103982 

Hs.81134 

Hs.2699 

Hs.288650 

Hs.11217 

Hs.75184 

Hs.135150 

Hs.29352 

Hs^9352 

Hs.406 

Hs.63287 

Hs.816 

Hs.80962 

Hs.241551 

Hs.75517 

Hs.1925 

Hs.1695 

Hs.41690 

Hs.41690 

Hs.211092 

Hs.220529 

Hs.271560 

Hs.1 12341 

Hs.2877 

Hs.69517 

HS.1B4601 

Hs.198249 

Hs.82045 

Hs.48956 

Hs.82269 

Hs.211579 

Hs.64311 

Hs.64311 

Hs.151738 

Hs.85265 

Hs.61762 

Hs.58561 

Hs.1 1950 
H&.85962 
HS.7BB67 
Hs.78867 
Hs.78867 
Hs.78867 
Hs.78867 
Hs.78867 
Hs.1 08660 
Hs.167379 
Hs.167379 
Hs.54451 
Hs.7327 
Hs.47860 
Hs.47860 
Hs^50618 
Hs.123114 
Hs.194689 
Hs.194689 
Hs.194689 
Hs.194689 
Hs^88467 
Hs^88467 
Hs.77274 
Hs.239727 
Hs.239727 
Hs.127179 
Hs.99376 



ESTs, Weakly similar to dJ36501Z1 [H.sa 
ENSP00000251056*J , l3sma membrane calcium 
cathepstn E 

cartilage acidic protein 1 
cartilage acidic protein 1 
cartilage acidic protein 1 
apoDpoprotan L 

death receptor 6. TNF superfamBy member 
sema domain, irrtfTUinogtabuSn domain (Ig), 
serineAhreonine kinase 12 
Predicted cation efflux pump 



Cl5000305^3806122|gb|AAC®198Ll| (AF0 
similar to lysosome-associated membrane 
small inducible cytokine subfamily B (Cy 
irrterteuWn 1 receptor antagonist 
glypicanl 
aquaporin 4 
K1AA0877 protein 

chitinase 3-flke 1 (cartilage glycoprote 
lung type-l cell membrane-associated gty 
tumor necrosis factor, alpha-induced pro 
tumor necrosis factor, alpha-induced pro 
solute carrier family 6 (neuratransmitte 
carbonic anhydrase IX 

gb:hd13d0lJ(1 Soares.NFU_T_GBC_S1 Homos 
ESTs 

neurotensin 

chloride channel, calcium activated, tarn 
laminin, beta 3 (nicein (125kD), kaTmln 
desmoglein 3 (pemphigus vulgaris antigen 
matrix metaGoproteinase 12 (macrophage 
desmocollin 3 
desmocdlin 3 

LUNX protein; PLUNC (palate lung and nas 
carcinoembryonic antigen-related cell ad 
uroptakin 1B 

protease inhibitor 3, skin-derived (SKAL 
cadherin 3, type 1, P-cadherin (placenta 
differential expressed in FanconPs an 
solute carrier family 7 (cationic amino 
gap junction protein, beta 5 (connexin 3 
midkine (neurite growth-promoting factor 
gap junction protein, beta 6 (connexin 3 
pro gestagen- associated endometrial prote 
melanoma cell adhesion molecule 
adisintegrin and metalfoproteinase doma 
a disintegrin and metalloprotsinase doma 
matrix metalloprotelnase 9 (gelatinase B 
integrin, beta 4 
hypoxia-inducible protein 2 • 
G protein-coupled receptor 87 
NM_005365:Homo sapiens melanoma antigen, 
Gpfanchored metastasis-associated prote 



protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-^ 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase. receploM 
ATP-binding cassette, sub-family C (CFTR 
cancer/testis antigen (NY-ESO-1) 
cancer/testis antigen (NY-ESO-1) 
tembiin, gamma 2 (nicein (100kD). kalini 
claudlnl 

neurotrophic tyrosine kinase, receptor, 

neurotrophic tyrosine kinase, receptor, 

UL16 binding protein 2 

cystab'n SN 

artemin 

artemtn 

artemin 

artemin 

hypothetical protein XP.098151 (leucine- 
hypothetical protein XP_098151 (leucine- 
plasminogen activator, urokinase 
desmocollin 2 
desmocollin 2 
cryptic gene 
ESTs 

gb:Human nonspecific crossreacfing anOg 
gb:Human nonspecific crossreacOng antig 
gb^uman nonspecific cross reacting antig 
type I transmembrane protein Fn14 
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Seq ID Na 632 4633 
Seq ID No: 634 & 635 
Seq ID Na 536 4 637 
Seq ID Na 638 4 639 
Seq ID Na 640 & 641 
Seq ID Na 642 4 643 
Seq ID Ha 644 & 645 
Seq ID Nix 646 & 647 
Seq ID Na 648 & 649 
Seq ID Ncs 650 & 651 
Seq ID N<x 652 & 653 
Seq ID No: 654 & 655 
Seq ID Na 656 & 657 
Seq ID No: 658 & 659 
Seq ID Na 660 & 661 
Seq ID Na 652 & 663 
SeqIDNa 664 & 665 
Seq ID Na 666 & 667 
Seq ID Na66B 4 669 
Seq ID Na 670 & 671 
Seq ID Na 672 & 673 
Seq ID Na674 &675 
Seq ID Na 676 & 677 
Seq ID No: 678 4 679 
Seq ID Na680 &681 
Seq ID Na 682 4 683 
Seq ID Na 684 4 685 
Seq ID No: 686 & 687 
Seq ID Na 688 & 689 

TABLE 15B 

Pkey: Unique Eos probesel identifier number 
CAT" number Gene duster number 
Accession: Genbank accession numbers 
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429597 


NMJJ03816 


KS2442 


422109 


S73265 


Hs.1473 


419235 


AW470411 


Hs^88433 


449048 


Z45051 


Hs-22920 


419216 


AU076718 


Hs. 164021 


431462 


AW583672 


Hs.256311 


448243 


AW369771 


Hs.52620 


426427 


M86699 


.- Hs.169840 


445537 


" AJ245671 


Hs.12844 


422278 


AF072873 


Hs.1 14218 


428450 


NM_014791 


Hs.184339 


446619 


AU076643 


HU13 


453392 


U23752 


Hs.32964 


426514 


BE616633 


Hs.170195 


425776 


U25128 


Hs.159499 


425776 


U25128 


Hs.159499 


431515 


NMJJ12152 


Hs.258583 


419452 


U33635 


Hs.90572 


432653 


N62096 


Hs.293185 


432653 


N62096 _ 


Hs^93185 


432653 




HSJ293185 


432653 


N62096 


Hs.293185 


410001 


AB041036 


Hs.57771 


426501 


AWD43782 


Hs.293616 


408369 


R38438 


Hs.182575 


445413 


AA151342 


Hs.12677 


422424 


AI186431 


Hs.295638 


428330 


122524 


Hs.2256 


420610 


AI663183 


Hs.99348 



a tfisintegrin and mstaBoproteinase doma 
gasbuHeteasing peptide 
neurofairrdn 

similar to S68401 (cattle) glucose indue 
small inducible cytokine subfamily B {Cy 
granin-Oke neuroendocrine pepfide precu 
integrin, beta 8 
TTK protein kinase 
EGF-IBte-domain, multiple 6 
frizzled (DrosophOa) homolog 6 
KtAA0175gene product 
secreted phosphoprotein 1 (osteopontin, 
SRV (sex determining region Y^bc* 11 
bone morphogenetic protein 7 (osteogenic 
parathyroid hormone receptor 2 
parathyroid hormone receptor 2 
endothelial differentiation, tysophospha 
PTTC7 protein tyrosine kinase 7 
ESTs, Weakly similar to JC7328 amino aci 
ESTs, WeaMy similar to JC7328 amino aci 
ESTs, WeaMy similar to JC7328 amino aci 
ESTs, Weakly similar to JC732B amino aci 
kalfikreinll 
ESTs 

solute carrier family 1 5 (H??? transport 
CGM47 protein 
prostate differentiation factor 
matrix metalloproteinase 7 (mabitysin, 
distal-less homeo box 5 



Pkey 
309931 
330493 
439285 

450375 



CAT Number Accession 
AW341683 
33264,5 
47065J 



M27826 R7B41 6 AA307645 AW957879 AW957800 AA533529 H03652 ^..^^..i^im, 
AL133916 N79113 AF086101 N76721 AW950828 AA364013 AW955684 AI346341 A1867454 N547B4 AI655270 A1421279 AW014882 
M775552 N62351 N59253AA626243AI341407BE175639AA456968A1358918AA457077 
83327 1 AA009647 AA131254 AA374293 AW954405 H04410 AW606284 AA151166 BE157467 BE157601 H04384 W4S291 AW663674 H04021 H01532 

M190993 H03231 H596O5H01642AA852876M113758M62W15M746952AI161014M0S95M Bnrt «« 4 , M «« 
451320 86576 1 AW118072 AI631982 T15734 AA224195 AI701458 W20198 F26326 AA890570 N90552 AW071907 AI671352 AI375892 T03517^8265 

451320 .86676.1 S^^SbI^BW AI0B4316 AI354G86 T33652 AI140719 A172021 1 TO3490 AI372637 T15415 AW20S83B AAfiaH384T03515 733230 

AA017131 AA443303 T33623 AI222556 733511 T33785 AI419606 D55612 



TABLE 15C 

Ret* Su^mmS!^Srm nunita to IhSumn are Genbank Identifier (GO numbers. "Dunham I et at" refers to the publication enutled "The DNA 

sequence of human chromosome 22/ Dunham L et aL, Nature (1 999) 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposition: indicates nucleotide positions of predicted axons. 

pkey Ref Strand NLposifion 

402075 8117407 Plus 121907-122035,122804-122921,124019-124161,124455-124610,125672-126076 

403329 851612) Plus 96450-96598 

403478 9958258 Plus . 116458-116564 

404440 7528051 Plus 80430-81581 

404877 .1519284 Plus . 1095-2107 

405770 2735037 Plus 61057-62075 

405932 . 7767812 Minus 123525-123713 
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Table 16 

Seq ID NO: 1- DNA sequence 

Nucleic Acid Accession S: NM_00121S 

Coding sequence: 43'. .1422- 
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GCCCGTACAC 
AGCCCCTGGC 
CTGTCACTGC 



AGTGAAGAGG 
GAGGATCTAC 
TCCCTGAAGT 
AATAATGCCC 
CCGCCCTGGC 
CGCCCCCAGC 
CTCCCGCCGC 
CCTCCTGGGC 
CTGCACTGGG 
CCTGCCGAGA 
GGGCGCCCGG 
AGTGCCTATG 
CAGGTCCCAG 
TATGAGGGGT 
CAGACAGTGA 
GGTGACTCTC 
GAGGCCTCCT 
AATTCCTGCC 
ACCAGCGTCG 
GTGAGCTACC 
TGTGAGAAGC 
ATGCCACTTC 



11 

I- 

ACCGTGTGCT 
TCCCTCTGTT 
TGCTTCTGAT 
GAGGAGGCTC 
ATTCACCCAG 
CTGGAGAGGA 
TAGAGGATCT 
ACAGGGACAA 
CCCGGGTGTC 
TCGCCGCCTT 
TCCCAGAACT 
TAGAGATGGC 
GGGCTGCAGG 
TCCACGTGGT 
GAGGCCTGGC 
AGCAGTTGCT 
GACTGGACAT 
CTCTGACTAC 
TGCTGAGTGC 
GGCTACAGCT 
TCCCTGCTGG 
TGGCTGCTGG 
CGTTCCTTGT 
GCCCAGCAGA 
CAGCCAGAGG 
CTTTTAACTG 




TCGTCCGGGC 
TCACCTCAGC 
CGTGTTGGCC 
GTCTCGCTTG 
ATCTGCACTC 
ACCGCCCTGT 
TAAGCAGCTC 
GAACTTCCGA 
AGTGGACAGC 
TGACATCCTA 
GCAGATGAGA 
GGTAGCCGAG 
CATCTGAGGG 
CCAAGAAATT 



31 
I 

ACAGTCAGCC 
CCTGCTCCAG 
CCCCAGAGGT 
GATGACCCAC 
CCACCCGGAG 
GAAGTTAAGC 
GAGGCTCCTG 
GACCAGAGTC 
GCGGGCCGCT 
CTGCGCCCCC 
AACAATGGCC 
GGGCGGGAGT 
TOGGAGCACA 
ACCGCCTTTG 
GCCTTTCTGG 
GAAGAAATCG 
CTGCCCTCTG 
GCCCAGGGTG 

cacaccctct 
gcgacgcagc 
agtcctcggg 

GCCCTGGTTT 
AGGCAGCACA 
ACTGGAGCCT 
GGAGCCGGTA 
TTTTAAAATA 



41 
I 

GCATGGCTCC 
GCCTCACTGT 
TGCCCOGGAT 
TGGGCGAGGA 
AGGAGGATCT 
CTAAATCAGA 
GAGATCCTCA 
ATTGGCGCTA 
TCCAGTCCCC 
TGGAACTCCT 
ACAGTGTGCA 
ACCGGGCTCT 
CTGTGGAAGG 
CCAGAGTTGA 
AGGAGGGCCC 
CTGAGGAAGG 
ACTTCAGCCG 
TCATCTOGAC 
CTGACACCCT 
CTTTGAATGG 
CTGCTGAGCC 
TTGGCCTCCT 
GAAGGGGAAC 
AGAGGCTGGA 
ACTGTCCTGT 
AATATTTATA 



51 
I 

CCTGTGCCCC 

GCAACTGCTG 

GCAGGAGGAT 

GGATCTGCCC 

ACCTGGAGAG 

AGAAGAGGGC 

AGAACCCCAG 

TGGAGGCGAC 

GGTGGATATC 

GGGCTTCCAG ' 

ACTGACCCTG 

GCAGCTGCAT 

CCACCGTTTC 

CGAGGCCTTG 

GGAAGAAAAC 

CTCAGAGACT 

CTACTTCCAA 

TGTGTTTAAC 

GTGGGGACCT 

GCGAGTGATT 

AGTCCAGCTG 

TTTTGCTGTC 

CAAAGGGGGT 

TCTTGGAGAA 

CCTGCTCATT 

AT 



Seq ID NO: 2 Protein sequence: 
Protein Accession #: NP_001207 



1 
I 

MAPLCPSPWL 



11 
I 

PLUPAPAPG 



DPQEPQNNAH 
ELLGFQLPPL 
VEGHRFPAEI 
EEGSETQVPG 
DTLWGPGDSR 
GLLFAVTSVA 



RDKEGDDQSH 
PELRLRNNGH 
HWHLSTAFA 
LDISALLPSD 
liQLNFRATQP 
FLVQMRRQHR 



21 

I. 

LTVQLLLSLL 
EDLPGEEDLP 
WRYGGDPPWP 
SVQLTLPPGL 
RVDEALGRPG 
FSRYFQYEGS 
LNGRVIEASP 
RGTKGGVSYR 



LLMPVHPQRL 



41 

I 

PRMQEDSPLG 



51 
I 



RVSPACAGRF 
EMALGPGREY 
GLAVLAAPLE 
LTTPPCAQGV 
PAGVDSSPRA 
PAEVAETGA 



Seq ID NO: 3 DNA sequence 

Nucleic Acid Accession #: BC013923 

Coding sequence: 438-1391 



AGCGGGGTTG 
GTGTTTGCAA 
AGAGGAGAGA 
TAATAATAAC 
TGATCCTGAT 
TCCTCGCGGA 
CCCGCGGGCC 
CCGCGCACAG 
AGCAAACTTC 
AAAACAGCCC 
AGCGGCGCAA 
TGGGCGCCGA 
AGCGGCTGCG 
AAACCAAGAC 
GCGGCAATAG 
AGCGCATGGA 
AGGACCAGCT 
AGCCCATGCA 
CCTACATGAA 
TGGCTCTTGG 
TTACCTCTTC 
GCATGTATCT 
CCCAGCACTA 
CACACATGTG 
GGAAATGGGA 
TCAAAAAAAA 
AACACCAATC 
GAGAGATCCT 
GGGGAGGGCG 
TTTAAAAGTT 
TAATATTTAG 
TTGTACAGTA 



11 

I ' 

TCTATTAACT 
AAGGGGGAAA 
GAAAGAAAGG 
AATCATCGGC 
TCCAGTTTGC 
GCCCTGCGCT 
CCCCAAAGTC 
OGCCCGCATG 
GGGGGGCGGC 
GGACCGCGTC 
GATGGCCCAG 
GTGGAAACTT 
AGCGCTGCAC 
GCTCATGAAG 
CATGGCGAGC 
CAGTTACGCG 
GGGCTACCCG 
CCGCTACGAC 
CGGCTCGCCC 
CTCCATGGGT 
CTCCCACTCC 
CCCCGGCGCC 
CCAGAGCGGC 
AGGGCCGGAC 
GGGGTGCAAA 
AAAAAAAAAA 
CCATCCACAC 
GGACTTCTTT 
GGGGAATGGA 
CTAGTGGTAC 
AGCTAGTCTC 
TTTATOGAGA 



21 
I 

TGTTCAAAAA 
GTAGTTTGCT 
GAGAGAAGTT 
GGCGGCAGGA 
CTCTCTCTTT 
CCCGACACCC 
CCGGCCGGGC 
TACAACATGA 
GGCGGCAACT 
AAGCGGCCCA 
GAGAACCCCA 
TTGTCGGAGA 
ATGAAGGAGC 
AAGGATAAGT 
GGGGTCGGGG 
CACATGAACG 
CAGCACCCGG 
GTGAGCGCCC 
ACCTACAGCA 
TCGGTGGTCA 
AGGGCGCCCT 
GAGGTGCCGG 
CCGGTGCCCG 
AGCGAACTGG 
AGAGGAGAGT 
AAAATCCCAT 
TCACGCAAAA 
TKGGGGGACT 
CCTTGTATAG 
GGTAGGAGCT 
CAAGCGACGA 
TAAACATGGC 



31 

I 

GTATCAGGAG 
GCCTCTTTAA 
TGAGCCCCAG 
TCGGCCAGAG 
TTTTCCCCCA 
CCGCCCGCCT 
CGAGGGTCGG 
TGGAGACGGA 
CCACCGCGGC 
TGAATGCCTT 
AGATGCACAA 
CGGAGAAGOG 
ACCOGGATTA 
ACACGCTGCC 
TGGGCGCCGG 
GCTGGAGCAA 
GCCTCAATGC 
TGCAGTACAA 
TGTCCTACTC 
AGTCCGAGGC 
GCCAGGCCGG 
AACCCGCCGC 
GCACGGCCAT 
AGGGGGGAGA 
AAGAAACAGC 
CACCCACAGC 
ACCGCGATGC 
ATTTTTGTAC 
ATCTGGAGGA 
TTGCAGGAAG 
AAAAAATGTT 
AATCAAAATG 



QSPVDIRPQL 
RALQLHLHWG 
EGPEENSAYE 
IWTVFNQTVM 
AEPVQLNSCL 



41 

I 

TTGTCAAGGC 
GACTAGGACT 
GCTTAAGCCT 
GAGGAGGGAA 
AATTATTCTT 
CCCCTCCTCC 
CGGCCGCCGG 
GCTGAAGCCG 
GGCGGCCGGC 
CATGGTGTGG 
CTCGGAGATC 
GCCGTTCATC 
TAAATACOGG 
CGGCGGGCTG 
CCTGGGCGCG 
CGGCAGCTAC 
GCACGGCGCA 
CTCCATGACC 
GCAGCAGGGC 
CAGCTCCAGC 
GGACCTCCGG 
CCCCAGCAGA 
TAACGGCACA 
AATTTTCAAA 
ATGGAGAAAA 
AAATGACAGC 
CGACAAGAAA 
AGAGAAAACC 
AAGAAAGCTA 
TTTGCAAAAG 
TTAATATTTG 
TCCATTGTTT 



EDLPTVEAPG 
AAFCPALRPL 
AAGRPGSEHT 
QLLSRLEEIA 
LSAKQLHTLS 
AAGDILALVF 



51 
I 

AGAGAAGAGA 
GAGAGAAAGA 
TTCCAAAAAA 
GCGCTTTTTT 
CGCCTGATTT 
TCTCCCCCCG 
CGGGCCGGGC 
CCGGGCCCGC 
GGCAACCAGA 
TCCCGCGGGC 
AGCAAGCGCC 
GACGAGGCTA 
CCCCGGCSGA 
CTGGCCCCCG 
GGCGTGAACC 
AGCATGATGC 
GCGCAGATGC 
AGCTCGCAGA 
ACCCCTGGCA 
CCCCCTGTGG 
GACATGATCA 
CTTCACATGT 
CTGCCCCTCT 
GAAAAACGAG 
CCCGGTACGC 
TGCAAAAGAG 
ACTTTTATGA 
TGGGGAGGGT 
CGAAAAACTT 
TCTTTACCAA 
CAAGCAACTT 
ATAAGCTGAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
760 
B40 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 



189 



WO 02/086443 



PCTAJS02/12476 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 

85 



AATTTGCCAA 
TTAGGACAGT 
TAAAAATTGT 
GTTTAAAAAG 
AAATGGCCAT 
TTCCATTTTG 
GTTTGTAATA 
GTAGTTGTAT 
GTATATATTT 
CCATTATGCA 
AAAAAAACAA 
CACAACACAA 
CCACAACACA 



TATTTTTCAA 
TGCAAACGTG 
ACAAAAGGAA 
GGCAAAAGTT 
GCAGGTTGAC 
TTCAGATAAA 
TTTCTGTAAA 
TTTAAAAGAT 
GAACTAATAT 
CAGTTTGAGA 
AACAAAAAAA 
AAACAAAAAA 
AACAACAACA 



GGAGAGGCTT 
AAAAGAAGAA 
AAAATTAGAA 
TTAGACTGTA 
ACCGTTGGTA 
AAAAACCATG 
TTTATTGTGA 
TCGGCTCTGT 
CATCCTTATA 
TAAATAAATT 
CAAAAAACAA 
AAAAAAAAGA 
CACAGAGGG 



CTTGCTGAAT 
AATTATTCAA 
TAAGTACTGG 
CTAAATTTTA 
ATTTATAATA 
AAATTACTGT 
TATTTTAAGG 
ATTATTTGAA 
ACAGGTACAT 
TTTGAAATAT 
AAACAGAAAA 
AACAAACACA 



TTTGATTCTG 
ATTTGGACAT 
CGAACCATCT 
TAACTTACTG 
GCTTTTGTTC 
GTTTGAAATA 
TTTTCCCCCC 
TCAGTCTGCC 
TTTCAACTTA 
GGACACTGAA 
AACAAAAAAA 
CAACACAACA 



CAGCTGAAAT 
TTTAATTGTT 
CTGTGGTCTT 
TTAAAAGCAA 
GATCCCAACT 
TTTTCTTATG 
TTTATTTTCC 
CAGAATCCAT 
AGTTTTTACT 
AAAAAAAAAA 
AAAACAAAAC 
CAACACAAAA 



Seg ID NO: 4 Protein sequence: 
Protein Accession ft:CAAB3435.1" 

21 



51 



1 * 11 21 31 41 

I I.I I I v 

MYNMMETELK PPGPQQTSGG GGGNSTAAAA GGNQKNSPDR VKRPMNAFMV WSRGQRRXMA 
QENPKMHNSE ISKRLGAEWK LLSETEKRPF IDEAKRLRAL HMKEHPDYKY RPRRKTKTLM 
KXDKYTLPGG IiIAPGGNSMA SGVGVGAGLG AGVNQRMDSY AHMNGWSNGS YSMKQDQLGY 
PQHPGLNAHG AAQMQPMHRY DVSALQYNSM TSSQTYMNGS PTYSMSYSQQ GTPGMALGSM 
GSWKSEASS SPPWTSSSH SRAPCQAGDL RDMISMYLPG AEVPEPAAPS RLHMSQHYQS 
GPVPGTAING TLPLSHM 

Seq ID NO: 5 DNA sequence 
Nucleic Acid Accession # : U9161B 
Coding sequence: 29-541 



CGGACTTGGC 
CATGCTACTC 
AGCATTAGAA 
TCCCTCTTGG 
AGCTGAGGAA 
TGCTTTAGAT 
TCACAGCAGG 
TGACAAAAAT 
GCTGTATGAG 
AGAGAATAAA 
ATTATATTTG 
ATTGAATGTG 
TCTTCAAAAA 



11 
! 

TTGTTAGAAG 
CTGGCTTTCA 
GCAGATTTCT 
AAGATGACTC 
ACAGGAGAAG 
GGCTTTAGCT 
GCTTTTCAAC 
GGAAAGGAAG 
AATAAACCCA 
TCATTTATTT 
TGTGAAAATG 
TTTTTCTGCA 
AAAAAAAAAA 



21 
I 

GCTGAAAGAT 
GCTCCTGGAG 
TGACCAATAT 
TGCTAAATGT 
TTCATGAAGA 
TGGAAGCAAT 
ACTGGGAGTT 
AAGTCATAAA 
GAAGACCCTA 
ACATGTGATT 
TGACAAACAC 
CTAATAGAAA 
AAATGGGGCC 



31 

f 

GATGGCAGGA 
TCTGTGCTCA 
GCATACATCA 
TTGCAGTCTT 
GGAGCTTGTT 
GTTGACAATA 
AATCCAGGAA 
GAGAAAAATT 
CATACTCAAA 
GTGATTCATC 
ACTTATCTGT 
TTAGACTAAG 
GCAATT 



41 
I 

ATGAAAATCC 
GATTCAGAAG 
AAGATTAGTA 
GTAAATAATT 
GCAAGAAGGA 
TACCAGCTCC 
GATATTCTTG 
CCTTATATTC 
AGAGATTCTT 
ATCCCTTAAT 
CTCTTCTACA 
TGTTTTCAAA 



; S1 
I 

AGCTTGTATG 
AGGAAATGAA 
AAGCACATGT 
TGAACAGCCC 
AACTTCCTAC 
ACAAAATCTG 
ATACTGGAAA 
TGAAACGGCA 
ACTATTACTG 
TAAATATCAA 
ATTGTGGTTT 
TAAATCTAAA 



Seq ID NO: 6 Protein sequence: 
Protein Accession #: AAB50564 

11 



21 31 41 51 

I I I I I I 

MMAGMKIQLV CMLLIAFSSW SLCSDSEEEM KALBADFLTN MHTSKISKAH VPSWKMTLLN 
VCSLVNNLNS PAEETGEVHE EELVARRKLP TALDGFSLEA MLTIYQLHKI CHSRAFQHWE 
LIQEDILDTG NDKNGKEEVI KRKIPYILKR QLYENKPRRP YILKRDSYYY 

Seq ID NO: 7 DNA sequence 

Nucleic Acid Accession ft: NM_006536.2 

Coding sequence: 109-2940 



ACCTAAAACC 
ATGTATGCAG 
. AGCATTGCAG 
GAACTCCCAT 
ATTGCAATTA 
ATAACTGAAG 
ATAAAGATTT 
TCATATGAAA 
TACACCCTAC 
TTCCTACTGA 
GAATGGGCCC 
ATAAATGGGC 
GTGTGTGAAA' 
GGATGCACCT 
AGTTTATCTT 
CTACAGAACC 
TTTCACCACA 
GTACAGGCTG 
GCTGACAGAC 
ATTCATACCT 
CACCAAATTA 
TCAGCTAAAA 
AAACTGAATG 
CTTCTTGGCA 




31 
I 

CATCTGCATC 
TGGAGGCTTC 
TTTGTGACTC 
CTTCAAGACA 
CAGAACCTCA 
GCTACCAAGA 
GCTAATAATA 
GACTGGTATG 
GAGGGAAAAT 
TACGGATCAC 
GATGAGTATA 
AGGTGTTCAT 
AACTGTATTA 
AATGCAACTG 
GCAAGTACCC 
GCATGGGATG 
GAGCTTCCAC 
GTGCTGGATG 
GCAGAATTTT 
GACAGCAAAG 
TTGCTGGTTT 
GGGCTTAAGA 
ATGATATTAG 
AGCAGTGGTT 



41 

I 

CATATTGAAA 
TCTACAACAT 
TCCTGGTTGC 
ATGGGTATAA 
TCTCAAACAT 
GAAGAGTATT 
ACAGCAAAAT 
GGGCACATGG 
ACATTCATTT 
GAGGCCGAGT 
ACAATGACAA 
CTGACATCAC 
TTAGTAAGCT 
CATCAATAAT 
ACAACCAAGA 
TAATCACAGA 
CTCCTCCCAC 
TGTCCAGCAA 
ATTTGATGCA 
GAGAGATCAG 
CATATCTGCC 
AAGGATTTGA 
TGACCAGCGG 
CAACAATTCA 



51 
I 

ACCTGACACA 
GACCCAAAGG 
CTTAAGTTCA 
TGGATTGCTC 
TAAGGAAATG 
TTTCAGAAAT 
AAAACAAGAA 
AGATGATCCA 
CACACCTAAT 
GTTTGTCCAT 
ACCTTTCTAC 
AGGCATTTTT 
TTTTAAAGAA 
GTTCATGCAA 
AGCACCAAAC 
CTCTGCTGAC 
ATTCTCGCTT 
GATGGCAGAG 
GATTGTTGAA 
AGCCCAGCTA 
CACCACTGTA 
GGTGGTTGAA 
AGATGATAAG 
CTCCATTGCC 



1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



190 



WO 02/086443 



PCT/US02/12476 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 



CTGGGTTCAT 
TTCTTTGTTC 
TCTGGAACTG 
AAACCTCACC 
ATGTTTCTAG 
GGAOGAAAAT 
TGGATTCCAG 
TCTCTGCAAG 
GCCACTGTGG 
TATGCCAATG 
GAGCCAGAGA 
GTTATAAAAA 
TATAGCTTGA 
CCAGGGAGTC 
GCTCCAAGGA 
AGCTCAGGAG 
CCACCATGCA 
TGGACAGCAC 
AGTAAAAGTC 
AAGCGAAATC 
ACGAATGGAC 
GCAATACGAG 
CCTCTGTTTA 
GGAG.TTTTAA 
CATACTTTAA 
ATAAATATCC 
CATACTAACA 
ATACAGATAA 
CCTTACACTT 
GCAAAGGGAA 
AATAGCCCCA 
TCATTTAGTT 
TTTACATGAA 
CTTGCTATTT 
TTTCACTGTA 
TTTATGACAA 
TTTCTAAGTT 
TACCTAGGAA 



CTGCAGCCCC 
CAGATATATC 
GAGACATTTT 
ATCAATTGAA 
TTACGTGGCA 
ACTACACAAA 
GAACAGCTAA 
CCCTGAAAGT 
AAGCCTTTGT 
TGAAACAGGG 
CTGGAGATCC 
ATGATGGAAT 
AAGTGCATGT 
ATGCTATGTA 
AATCAGTAGG 
GCTCCTTTTC 
AAATTATTGA 
CTGGAGAAGA 
TACAGAATAT 
CTCAGCAAGC 
CTGAACATCA 
CAATGGATAG 
TTCCCCCCAA 
CAGCAATGGG 
GCAGGAAAAA 
AAAGTGTCTT 
AAGTCAAATT 
GATTTTTACA 
TGGCTATGAA 
GGGTAAAGTC 
AGCAGAGAAA 
ACTTTGATTA 
GATCATGCTA 
TGTTATATAT 
AGAGGTAACC 
AGGTCTATTG 
TATTGCCTTG 
A 



AAATCTGGAG 
AAACTCCAAT 
CCAGCAACAT 
AAACACAGTG 
GGCCAGTGGT 
TAATTTTATC 
GCCTGGGCAC 
GACAGTGACC 
GGAAAGAGAC 
ATTTTATCCC 
TGTTACGCTG 
TTACTCGAGG 
CAATCACTCT 
TGTACCAGGT 
CAGAAATGAG 
AGTGCTGGGA 
CCTGGAAGCT 
CTTTGATCAG 
CCAAGATGAC 
TGGCATCAGG 
GCCAAATGGA 
GAACTCCTTA 
TTCTGATCCT 
TTTGATAGGA 
GAGAGCAGAC 
CCTTCTTAGA 
AACATCAAAA 
TGGTAGATCA 
CAAATAATAA 
GGACCAGTGT 
AGGAGGGTAG 
ATTTTTCTTT 
TATTTTATAT 
ATTTCAGATG 
TTTAACAATA 
AATTTATTTG 
GGTTATTATG 



GAATTATCAC 
AGCATGATTG 
ATTCAGCTTG 
ACTGTGGATA 
CCTCCTGAGA 
ACCAATCTAA 
TGGACTTACA 
TCTCGCGCCT 
AGCCTCCATT 
ATTCTTAATG 
AGACTCCTTG 
TATTTTTTCT 
CCCAGCATAA 
TACACAGCAA 
GAGGAGCGAA 
GTTCCAGCTG 
GTAAAAGTAG 
GGCCAGGCTA 
TTTAACAATG 
GAGATATTTA 
GAAACACATG 
CAGTCTGCTG 
GTACCTGCCA 
ATCATTTGCC 
AAGAAAGAGA 
TATAAGACCC 
CTGTATTAAA 
ACAATTCTTT 
AAATTATTCT 
CAAGGAAAGT 
GTCTGCATTA 
TCTCCTTATC 
ATGTAGCCCC 
ACATCTCCCT 
TGGGTATTAC 
TNTGTAAGTT 
GAATGATAGT 



GTCTTACAGG 
ATGCTTTCAG 
AAAGTACAGG 
ATACTGTGGG 
TTATATTATT 
CTTTTCGGAC 
CCCTGAACAA 
CCAACTCAGC 
TTCCTCATCC 
CCACTGTCAC 
ATGATGGAGC 
CCTTTGCTGC 
GCACCCCAAC 
ACGGTAATAT 
AGTGGGGCTT 
GOCCCCACCC 
AAGAGGAATT 
CAAGCTATGA 
CTATTTTAGT 
OGTTCTCACC 
AAAGCCACAG 
TATCTAACAT 
GAGATTATCT 
TTATTATAGT 
ATGGAACAAA 
ATGGCCTTCG 
ATGCATTGAG 
TTGGGGGTAG 
TTAAAGTAAT 
TTGTTTTATT 
TAACTGTCTG 
TGTGCAGTAC 
TAATGCAAAG 
GCTAATGCTC 
CTTTGTCTCT 
TCTACTCCCA 
TATAGCCCCN 



AGGTTTAAAG 
TAGAATTTCC 
TGAAAATGTC 
CAACGACACT 
TGATCCTGAT 
AGCTAGTCTT 
TACCCATCAT 
TGTGCCCGCA 
TGTGATGATT 
TGCCACAGTT 
AGGTGCTGAT 
AAATGGTAGA 
CCACTCTATT 
TCAGATGAAT 
TAGCCGAGTC 
TGATGTGTTT 
GACCCTATCT 
AATAAGAATG 
AAATACATCA 
CCAGATTTCC 
AATTTATGTT 
TGCCCAGGCG 
TATATTGAAA 
TGTGACACAT 
ATTATTATAA 
ACTACAAAAA 
TTrTTGTACA 
ATTAGAAAAC 
GTCTTTAAAG 
GAGGTGGAAA 
TGTGAAGCAA 
AGGTTGCTTG 
CTCTTTACCT 
AGAGATCTTT 
TCATACCGGT 
TCAAAGCAGC 
TATAATGCCT 



Seq ID NO: 8 Protein sequence: 
Protein Accession #: NP_0 06527.1 



MTQRSIAGPI 
IKEMITEASP 
GDDPYTLQYR 
KPFYINGQNQ 
MFMQSLSSW 
TFSLVQAGDK 
RAQLHQINSN 
GDDKLLGNCL 
SRISSGTGDI 
FDPDGRKYYT 
AVPPATVEAP 
AGADVIKNDG 
IQMNAPRKSV 
LTLSWTAPGE 
PQISTNGPEH 
LILKGVLTAM 



11 
1 

CNLKFVTLLV 
YLFNATKRRV 
GCGKEGKYIH 
IKVTRCSSDI 
EFCNASTHNQ 
WCLVLDVSS 
DDRKLLVSYL 
PTVLSSGSTI 
FQQHIQLEST 
NNFITNLTFR 
VERDSLHFPH 
IYSRYFFSFA 
GRNEEERKWG 
DFDQGQATSY 



GLIGIICLII 



21 
I 

ALSSELPPLG 
FPRNIKILIP 
PTPNFLLNDN 
TGIFVCEKGP 
EAPNLQNQMC 
KMAEADRLLQ 
PTTVSAKTDI 
HSXA^GSSAA 
GENVKPHHQL 
TASLWIPGTA 
PVMIYANVKQ 
ANGRYSLKVH 
FSRVSSGGSP 
EIRMSKSLQN 
RIYVAIRAMD 
WTHHTLSRK 



31 

I 

AGVQLQDNGY 
ATWKANNNSK 
LTAGYGSRGR 
CPQENCIISK 
SLRSAWDVIT 
LQQAAEFYLM 
SICSGLKKGP 
PNLEELSRLT 
KNTVTVDNTV 
KPGHWTYTLN 
GFYPILNATV 
VNHSPSISTP 
SVLGVPAGPH 
IQDDFKNAIL 



KRADKKENGT 



41 
I 

NGLLIAINPQ 
IKQESYEKAN 
VFVHEWAHLR 
LFKEGCTFIY 
DSADPHHSFP 
QIVBIHTFVG 
EWEKLNGKA 
GGLKPFVPDI 
GNDTMPLVTW 
NTHHSLQAIiK 
TATVEPETGD 
AHSIPGSHAM 
PDVPPPCKII 
VNTSKRNPQQ 
IAQAPLFIPP 
KLL 



51 
I 

VPENQNTjISN 
VIVTDWYGAH 
WGVFDEYNND 
NSTQNATASI 
MNGTELPPPP 
IASFD5KGEI 
YGSVMILVTS 
SNSNSMIDAF 
QASGPPEIIZ. 
VTVTSRASNS 
PVTLRLLDDG 
YVPGYTANGN 
DLEAVKVEEE 
AGIREIFTPS 
NSDPVPARDY 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



65 
70 
75 
80 
85 



Seq ID NO i 9 DNA sequence 

Nucleic Acid Accession #s Eos sequence 

Coding sequence: 336-632 



CTCCCCTCAC 
CCTGGGTGGG 
GAGCTGGCAC 
CAGGGTTTGG 
CCAGTGGGGC 
GGGTCTGTCT 
CGCTGGCTGT 
AGCTGAGTAA 
AGAAAGTGGA 
AGCAGGTGGA 
ACTTCTTCCA 
TCTCTTGGGC 
TGTTGATAAT 
CTGGGAGATG 
TCTCCAAGGC 
ATTGGAAATC 
AAATACCA 



11 
I 

CCCGGTCCAG 
CTCAGGGGCT 
TCTCTGGGAG 
TGGGATCAGG 
CCACATATAA 
CTGCCACCTG 
GCTGGTCACT 
GGGGGAAATG 
TGAGGAGGGG 
CTTCCAGGAG 
GGGCTGCCCA 
CCAGGACTGT 
ATTTTAATTG 
AGGGCCTCCT 
CAGAGCTATG 
GAGATAGGTT 



21 

i 

GATGCCCAGT 
GCCCTTGACC 
GGAGGGGGCT 
TTGAGGCAGG 
ATCCTCACCC 
GTCTGCCACA 
ACCTTCCACA 
AAGGAACTTC 
CTGAAGAAGC 
TATGCTGTTT 
GACCGACCCT 
TGATGCCTTT 
CTCAGTGATG 
GGATCCTGCT 
CTTTAGGTCT 
GCTGACTTTT 



31 
I 

CCCCACGACA 
TGGCCTAGAG 
GGGAGGGAAT 
TTTGGTTTCC 
TGGGAGCCTG 
GATCCATGAT 
AGTACTCCTG 
TGCACAAGGA 
TGATGGGCAG 
TCCTGGCACT 
GAAGCAGAAC 
GAGTTTTGTA 
TTCCATAACC 
CCCTTCTGGG 
CAATTTTGGA 
ATTTTGTCAA 



41 
I 

CCTCCCACTT 
CCCTCCCCCA 
GAGTGGGAAT 
TTAAAATGCC 
GCTGCCTTGC 
GTGCAGTTCT 
CCAAGAGGGC 
GCTGCCCAGC 
CCTGGATGAG 
CATCACTGTC 
TCTTGACTTC 
TTCAATAAAC 
CGGCTGGCTC 
CTCTGACTCT 
ATTTCAAACA 
ATAAAGATAT 



51 
I 

CCCACTGTGG 
GCTGGTGGTG 
GGCAAGAGGC 
AAGTTGGGGG 
TCTCCTTCCT 
CTGGAGCAGG 
GACAAGTTCA 
TTTGTGGGGG 
AACAGTGACC 
ATGTGCAATG 
CTGCCATGGA 
TTTTTTTGTC 
AGCTGGAGTG 
CCTGGAAATC 
CCAGCAAAAA 
TAAAAAAGGC 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 



Seq ID NO: 10 Protein sequence: 
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Protein Accession #: NP_005969-1 

1 11 21 - 31 41 51 

I I I I I I 

J MMCSSLEQAL AVLVTTFHKY SCQEGDKFKL SKGEMKELLH KELPS FVGEK VDEEGLKKLM 60 

GSLDENSDQQ VDFQEYAVFL ALITVKCNDF FQGCPDRP : . 



Seq ID NO r 11 DNA sequence 
10 Nucleic Acid Accession fl : Eos sequence 
Coding sequence: 336-626 

1 11- 21 31 41 51 

i- I " I I I I I 

15 CTCCCCTCAC CCCGGTCCAG GATGCCCAGT CCCCACGACA CCTCCCACTT CCCACTGTGG 60 

CCTGGGTGGG CTCAGGGGCT GCCCTTGACC TGGCCTAGAG CCCTCCCCCA GCTGGTGGTG 120 

GAGCTGGCAC TCTCTGGGAG GGAGGGGGCT- GGGAGGGAAT GAGTGGGAAT GGCAAGAGGC 180 

CAGGGTTTGG TGGGATCAGG TTGAGGCAGG TTTGGTTTCC TTAAAATGCC AAGTTGGGGG 240 

CCAGTGGGGC CCACATATAA ATCCTCACCC TGGGAGCCTG GCTGCCTTGC TCTCCTTCCT 300 

20 GGGTCTGTCT CTGCCACCTG GTCTGCCACA GATCCATGAT GTGCAGTTCT CTGGAGCAGG 360 

CGCTGGCTGT GCTGGTCACT ACCTTCCACA AGTACTCCTG CCAAGAGGGC GACAAGTTCA 420 

AGCTGAGTAA GGGGGAAATG AAGGAACTTC TGCACAAGGA GCTGCCCAGC TTTGTGGGGC 480 

ATTCCAGAGA ACCATGTGCT GTGAGGGCCT TCCGAGTCCA TCTGTTTAAT CCTGTCATTG 540 

GAGACTTGAG AAACCAGAGC CCAGAAGGGA AAAGTGATTG TCCCAAGATC ACACAGCACT 600 

25 GGAGAAAGTG GATGAGGAGG GGCTGAAGAA GCTGATGGGC AGCCTGGATG AGAACAGTGA 660 

CCAGCAGGTG GACTTCCAGG AGTATGCTGT TTTCCTGGCA CTCATCACTG TCATGTGCAA 720 

TGACTTCTTC CAGGGCTGCC CAGACCGACC CTGAAGCAGA ACTCTTGACT T CCTGCCAT G 780 

GATCTCTTGG GCCCAGGACT GTTGATGCCT TTGAGTTTTG TATTCAATAA ACTTTTTTTG 840 

TCTGTTGATA ATATTTTAAT TGCTCAGTGA TGTTCCATAA CCCGGCTGGC TCAGCTGGAG 900 

30 TGCTGGGAGA TGAGGGCCTC CTGGATCCTG CTCCCTTCTG GGCTCTGACT CTCCTGGAAA 960 

TCTCTCCAAG GCCAGAGCTA TGCTTTAGGT CTCAATTTTG GAATTTCAAA CACCAGCAAA 1020 

AAATTGGAAA TCGAGATAGG TTGCTGACTT TTATTTTGTC AAATAAAGAT ATTAAAAAAG 1080 
GCAAATACCA 

35 Seq ID NO: 12 Protein sequence: 

Protein Accession ft: Eos sequence 

1 11 21 31 41 51 

ja I I I 1.1, I 

40 MMCSSLEQAL AVLVTTFHKY SCQEGDKFKL SKGEMKELLH KELPSFVGHS REPCAVRAFR 60 
VHLFNPVIGD LRNQSPEGKS DCPKITQHWR KWMRRG 



Seq ID NO: 13 DNA sequence 
45 Nucleic Acid Accession ft: Eos sequence 
Coding sequence: S8-354 

1 11 21 31 41 51 

cn I 1 I I 1 I 

50 GTGAGCTCAC CATGTGGGGG TGAGGCTGAG AGAAAACAAG TACACAGCCA CAGATCCATG 60 
ATGTGCAGTT CTCTGGAGCA GGCGCTGGCT GTGCTGGTCA CTACCTTCCA CAAGTACTCC 120 
TGCCAAGAGG GCGACAAGTT CAAGCTGAGT AAGGGGGAAA TGAAGGAACT TCTGCACAAG 180 
GAGCTGCCCA GCTTTGTGGG GGAGAAAGTG GATGAGGAGG GGCTGAAGAA GCTGATGGGC 240 
AGCCTGGATG AGAACAGTGA CCAGCAGGTG GACTTCCAGG AGTATGCTGT TTTCCTGGCA 300 

55 CTCATCACTG TCATGTGCAA TGACTTCTTC CAGGGCTGCC CAGACCGACC CTGA AGCAG A 360 
ACTCTTGACT TCCTGCCATG GATCTCTTGG GCCCAGGACT GTTGATGCCT TTGAGTTTTG 420 
TATTCAATAA ACTTTTTTTG T C TGT TG ATA ATATTTTAAT TGCTCAGTGA TGTTCCATAA 480 
CCCGGCTGGC TCAGCTGGAG TGCTGGGAGA TGAGGGCCTC CTGGATCCTG CTCC CTTCT G 540 
GGCTCTGACT CTCCTGGAAA TCTCTCCAAG GCCAGAGCTA TGCTTTAGGT CTCAATTTTG 600 

60 GAATTTCAAA CACCAGCAAA AAATTGGAAA TCGAGATAGG TTGCTGACTT TTATTTTGTC 660 
AAATAAAGAT ATTAAAAAAG GCAAATACCA 



Seq ID NO: 14 Protein sequence: 
Protein Accession 8: NP_005969.1 
65 1 11 21 31 41 51 

I | I I i i 

MMCSSLEQAL AVLVTTFHKY SCQEGDKFKL SKGEMKELLH KELPSFVGEK VDEEGLKKLM 60 
GSLDENSDQQ VDFQEYAVFL ALITVMCNDF FQGCPDRP 



70 



Seq ID NO: 15 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 62-358 



75 1 11 21 31 41 51 

I I • I I I I 

GGAGGGTGTG CCGCTGAGTC ACTGCCTGGG CATCTGGGCC TGGAACCTCG GCCACAGATC 60 

CATGATGTGC AGTTCTCTGG AGCAGGCGCT GGCTGTGCTG GTCACTACCT TCCACAAGTA 120 

CTCCTGCCAA GAGGGCGACA AGTTCAAGCT GAGTAAGGGG GAAATGAAGG AACTTCTGCA 180 

80 CAAGGAGCTG CCCAGCTTTG TGGGGGAGAA AGTGGATGAG GAGGGGCTGA AGAAGCTGAT 240 

GGGCAGCCTG GATGAGAACA GTGACCAGCA- GGTGGACTTC CAGGAGTATG CTGTTTTCCT 300 

GGCACTCATC ACTGTCATGT GCAATGACTT CTTCCAGGGC TGCCCAGACC GACCCTGAAG 360 

CAGAACTCTT GACTTCCTGC CATGGATCTC TTGGGCCCAG GACTGTTGAT GCCTTTGAGT 420 

TTTGTATTCA ATAAACTTTT TTTGTCTGTT GATAATATTT TAATTGCTCA GTGATGTTCC 480 

85 ATAACCCGGC TGGCTCAGCT GGAGTGCTGG GAGATGAGGG CCTCCTGGAT CCTGCTCCCT 540 

TCTGGGCTCT GACTCTCCTG GAAATCTCTC CAAGGCCAGA GCTATGCTTT AGGTCTCAAT. 600 

TTTGGAATTT CAAACACCAG CAAAAAATTG GAAATCGAGA TAGGTTGCTG ACTTTTATTT 660 
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TGTCAAATAA AGATATTAAA AAAGGCAAAT ACCA 



Seq ID NO: 16 Protein sequence: 
Protein Accession §: NP_005969.1 

1 11 21 31 41 SI 

I I I I I I 

MMCSSLEQAL AVLVTTFHKY SCQEGDKFKL SKGSMKELIiH KELPS FVGEK VDEEGLKKLM 60 
GSLDENSDQQ VDFQEYAVFL ALITVMCNDP FQGCPDRP 



Seq ID NO: 17 DNA sequence 

Nucleic Acid Accession 8 : Eos sequence 

Coding sequence: 939-2372 

1 11 21 31 41 51 

I I I 1 I I 

AAGACGGATT CTCAGACAAG GCTTGCAAAT GCCCCGCAGC CATCATTTAA CTGCACCCGC 60 

AGAATAGTTA CGGTTTGTCA CCCGACCCTC CCGGATCGCC TAATTTGTCC CTAGTGAGAC 120 

CCCGAGGCTC TGCCCGCGCC TGGCTTCTTC GTAGCTGGAT GCATATCGTG CTCCGGGCAG 180 

CGCGGGCGCA GGGCACGCGT TCGCGCACAC CCTAGCACAC ATGAACACGC GCAAGAGCTG 240 

AACCAAGCAC GGTTTCCATT TCAAAAAGGG AGACAGCCTC TACCGCGATT GTAGAAGAGA 300 

CTGTGGTGTG AATTAGGGAC CGGGAGGCGT CGAACGGAGG AACGGTTCAT CTTAGAGACT 360 

AATTTTCTGG AGTTTCTGCC CCTGCTCTGC GTCAGCCCTC ACGTCACTTC GCCAGCAGTA 420 

GCAGAGGCGG CGGCGGCGGC TCCCGGAATT GGGTTGGAGC AGGAGCCTCG CTGGCTGCTT 480 

CGCTCGCGCT CTACGCGCTC AGTCCCCGGC GGTAGCAGGA GCCTGGACCC AGGCGCOGCC 540 

GGCGGGCGTG AGGCGCCGGA GCCCGGCCTC GAGGTGCATA CCGGACCCCC ATTCGCATCT 600 

AACAAGGAAT CTGCGCCCCA GAGAGTCCCO GGAGCGCCGC CGGTCGGTGC CCGGCGCGCC 660 

GGGCCATGCA GCGACGGCCG CCGCGGAGCT CCGAGCAGCG GTAGCGCCCC CCTGTAAAGC 720 

GGTTGGCTAT GCCGGGGCCA CTGTGAACCC TGCCGCCTGC CGGAACACTC TTOGCTCCGG 780 

ACCAGCTCAG CCTCTGATAA GCTGGACTCG GCACGCCCGC AACAAGCACC GAGGAGTTAA 840 

GAGAGCCGCA AGCGCAGGGA AGGCCTCCCC GCACGGGTGG GGGAAAGCGG CCGGTGCAGC 900 

GCGGGGACAG GCACTCGGGC TGGCACTGGC TGCTAGGGAT GTCGTCCTGG ATAAGGTGGC 960 

ATGGACCCGC CATGGCGOGG CTCTGGGGCT TCTGCTGGCT GGTTGTGGGC TTCTGGAGGG 1020 

CCGCTTTCGC CTGTCCCACG TCCTGCAAAT GCAGTGCCTC TCGGATCTGG TGCAGCGACC 1080 

CTrCTCCTGG CATCGTGGCA TTTCCGAGAT TGGAGCCTAA CAGTGTAGAT CCTGAGAACA 1140 

TCACCGAAAT TTTCATCGCA AACCAGAAAA GGTTAGAAAT CATCAACGAA GATGATGTTG 1200 

AAGCTTATGT GGGACTGAGA AATCTGACAA TTGTGGATTC TGGATTAAAA TTTGTGGCTC 1260 

ATAAAGCATT TCTGAAAAAC AGCAACCTGC AGCACATCAA TTTTACCCGA AACAAACTGA 1320 

CGAGTTTGTC TAGGAAACAT TTCCGTCACC TTGACTTGTC TGAACTGATC CTGGTGGGCA 1380 

ATCCATTTAC ATGCTCCTGT GACATTATGT GGATCAAGAC TCTCCAAGAG GCTAAATCCA 1440 

GTCCAGACAC TCAGGATTTG TACTGOCTGA ATGAAAGCAG CAAGAATATT CCCCTGGCAA 1500 

ACCTGCAGAT ACCCAATTGT GGTTTGCCAT CTGCAAATCT GGCCGCACCT AACCTCACTG 1560 

TGGAGGAAGG AAAGTCTATC ACATTATCCT GTAGTGTGGC AGGTGATCCG GTTCCTAATA 1620 

TGTATTGGGA TGTTGGTAAC CTGGTTTCCA AACATATGAA TGAAACAAGC CACACACAGG 1680 

GCrCCTTAAG GATAACTAAC ATTTCATCCG ATGACAGTGG GAAGCAGATC TCTTGTGTGG 1740 

CGGAAAATCT TGTAGGAGAA GATCAAGATT CTGTCAACCT CACTGTGCAT TTTGCACCAA 1800 

CTATCACATT TCTOGAATCT CCAACCTCAG ACCACCACTG GTGCATTCCA TTCACTGTGA 1860 

AAGGCAACCC CAAACCAGCG CTTCAGTGGT TCTATAACGG GGCAATATTG AATGAGTCCA 1920 

AATACATCTG TACTAAAATA CATGTTACCA ATCACACGGA GTACCACGGC TGCCTCCAGC 1980 

TGGATAATCC CACTCACATG AACAATGGGG ACTACACTCT AATAGCCAAG AATGAGTATG 2040 

GGAAGGATGA GAAACAGATT TCTGCTCACT TCATGGGCTG GCCTGGAATT GACGATGGTG 2100 

CAAACCCAAA TTATCCTGAT GTAATTTATG AAGATTATGG AACTGCAGCG AATGACATCG 2160 

GGGACACCAC GAACAGAAGT AATGAAATCC CTTCCACAGA CGTCACTGAT AAAACCGGTC 2220 

GGGAACATCT CTCGGTCTAT GCTGTGGTGG TGATTGCGTC TGTGGTGGGA TTTTGCCTTT 2280 

TGGTAATGCT GTTTCTGCTT AAGTTGGCAA GACACTCCAA GTTTGGCATG AAAGGTTTTG 2340 

TTTTGTTTCA TAAGATCCCA CTGGATGGGT AGCTGAAATA AAGGAAAAGA CAGAGAAAGG 2400 

GGCTGTGGTG CTTGTTGGTT GATGCTGCCA TGTAAGCTGG ACTCCTGGGA CT GCTG TTGG 2460 

CTTATCCCGG GAAGTGCTGC TTATCTGGGG TTTTCTGGTA GATGTGGGCG GTGTTTGGAG 2520 

GCTGTACTAT ATGAAGCCTG CATATACTGT GAGCTGTGAT TGGGGAACAC CAATGCAGAG 2580 

GTAACTCTCA GGCAGCTAAG CAGCACCTCA AGAAAACATG TTAAATTAAT GCTTCTCTTC 2640 

TTACAGTAGT TCAAATACAA AACTGAAATG AAATCCCATT GGATTGTACT TCTCTTCTGA 2700 

AAAGTGTGCT TTTTGACCCT ACTGGACATT TATTGACTTA ATTGCTTCTG TTTATTAAAA 2760 

TTGACCTGCA AAGTTAAAAA AAAATTAAAG TTGAGAACAG GTATAAGTGC ACACTGAATA 2820 

GTCTAATCTA CATGTAACAC ATATTTTAGT GTGATTTTCT ATACTCTAAT CAGCACTGAA 2880 

TTCAGAGGGT TTGACTTTTT CATCTATAAC ACAGTGACTA AAAGAGTTAA GGGTATATAT 2940 

ACCATCACTT TGGGACTTGG TAGTATTATT AAAAGGTTAT TTCCTTCACT GTCAATAAAA 3000 

GTCCAAATGT TTAGCTTAGG TCTGAGAGTC AAACAATGTT AAGGATTGTC TTAAAGTTCC 3060 

TTAGCCAGCA AAACAAAACA AAACAAAACA AACAAATGAA AAACGTTTAA AAAGAAGAAG 3120 

AAGAAAAAAA ACAAGAACAA GCAGCAACAG CTGTTTTGTT GGGGCTATAG ATTTAAGTTA 3180 

GGCATAGTCA ATTTCAGAAT AACTAAGAGT GGAATATATG CATATGGTGA AATTATAACC 3240 

TTGCCCTTTT TTATTTGCCC TCTGCGATCC ACCTGCTTTT TAGAAGTCTG CCGAGTGAGA 3300 

AGGCCACAGT ATCTCATGCT GTTTGCATTA CAGAACTGCA GCTTTTCTAC TCTGAAAAGG 3360 

CCTGGGAGCA GAATGGCTGG CCTGCTGTGA GCAGGAGAGG AGATTCTAAG AAGGATAGTC 3420 

CCCCCTACAA CATACTGTCA TACTGCTGGG TTTTCATGGG TAGGAAAGCT TGTCCTGACC 3480 

CCAGCAGCAA AGAGGTGGCA GGTCGCTAAT GAATATATGC TTTATAATGT CCTTCTTCAT 3540 

TGCTGAGAGG GCAGCCTTAG AGCTGTGGAT TTCTGCATCC CCCCTGAGTC TGACCCATGG 3600 

ACACCTGTTT CATTCACTTT AGCATCACAG TGACCTTTGT ATGCTCTGTT CAGTCTGTGT 3660 

CAGGCAGTAT GCTTGTCCTG AAGAGAGGTT TGGCTATCCC CACCCCACCC CACCCCACCC 3720 

TGTTCCTTTT TTATCAGGAG GACTTCAGAG CCAGGCCTGC AGCATTTTGT TTGAAAACAC 3780 

AATCAGCTCT GACAGTTAGA CATGCACACA GACGCCATAG CTGGATTGGA AACATTGATG 3840 

TTTTAAAAAT TTATTTTTTT TGGAAATAGT TGCACAAATG CTGCAATTTA GCTTTAAGGT 3900 

TCTATAGATT TTTAACTAGT CCAACACAGT CAGAAACATT GTTTTGAATC CTCTGTAAAC 3960 

CAAGGCATTA ATCTTAATAA ACCAGGATCC ATTTAGGTAC CACTTGATAT AAAAAGGATA 4020 

TCCATAATGA ATATTTTATA CTGCATCCTT TACATTAGCC ACTAAATACG TTATTGCTTG 4080 

ATGAAGACCT TTCACAGAAT CCTATGGATT GCAGCATTTC ACTTGGCTAC TTCATACCCA 4140 
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TGCCTTAAAQ AGGGGCAGTT TCTCAAAAGC AGAAACATGC 
TCCTAACTCC ATTTGAATGT AAGGGCAGCT GGCCCCCAAT 
TCTGAATTCC CATTTTCTTG TTCGCGGCTA AATGACAGTT 
GATCTTTCCC AAAGGTGTTG ATTTACAAAG AGGCCAGCTA 
GAAAGAGAGA TGAAATTCAA GCTGTGAGCC AGGCAGGAGC 
GAGAATCAGC CATTTGGTAC AAAAAAGATT TTTAAAGCTT 
ATAGAAAGGC TATGGATTGT TTAAGAACTA TTTTAAAGTG 
AATAAAAAAA AAGGAATATT TGTACCCAAC AGCTAGAAGG 
TTTAAAATGG AGAGAAGTGG ACAGATAAGG CCATTTAATA 
CTCCTAGGGA ATGATGAAAA CAGCAGGCTA T 

Seq ID NO: IB Protein sequence: 
Protein Accession #: CAAS3571 



PCT/US02/12476 



MSSWIRWHGP 
NSVDPENITB 
NPTRNKLTSL 
SKNIPLANLQ 
NBTSHTQGSL 
WCIPFTVKGN 
LIAKNEYGKD 
DVTDKTGREH 



XI 
I 

AMARLWGFCW 
IPIANQKRLE 
SRKHFRHIiDL 
IPNCGLPSAN 
RITNISSDDS 
PKPALQWFYN 
EKQISAHFMG 
LSVYAWVIA 



21 
I 

LWGFWRAAF 
IINBDDVEAY 
SELILVGNPF 
LAAPNLTVEB 
GKQISCVAEN 
GAILNESKYI 
WPGIDDGANP 
SWGFCLLVM 



31 
I 

ACPTSCKCSA 
VGLRNLTIVD 
TCSCDIMWIK 
GKSITLSCSV 
LVGEDQDSVN 
CTKIHVTNHT 
NYPDVIYEDY 
LFLLKLARHS 



GGCCAGTTCT 
GTGGGGAGGT 
TCTGTCATTA 
ATAGCAGAAA 
TCAGTATGGC 
TTATGTTATA 
TTCCAGACCC 
ATTGCAAGGT 
TATCAAAGAT 



41 
I 

SRIWCSDPSP 
SGLKFVAHKA 
TLQEAKSSPD 
AGDPVPKMYW 
LTVHFAPTIT 
EYHGCLQLDN 
GTAANDIGDT 
KFGMKGFVLF 



CAAGTTTTCC 
CCGAACATTT 
CTTAGATTCC 
TCATGACCCT 
AAAGGTTCTT 
CCATGGAGCC 
AAAAAGGAAA 
AGATTTTTGT 
CAGTTGACAT 



51 
I 

GIVAFPRLKP 

FLKNSNLQHI . 

TQDLYCLNES 

DVGNLVSKKM 

FLESPTSDHH 

PTHMNNGDYT 

TNRSNEIPST 

HKIPLDG 



4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 



60 
120 
180 
240 
300 
360 
420 



Seq ID NO: 19 DNA sequence 

Nucleic Acid Accession #: NM_00022B 

Coding sequence: 82-3600 

1 11 21 31 41 51 

1 I I ! I I 

GCTTTCAGGC GATCTGGAGA AAGAACGGCA GAACACACAG CAAGGAAAGG TCCTTTCTGG 60 

GGATCACCCC ATTGGCTGAA GATGAGACCA TTCTTCCTCT TGTGTTTTGC CCTGCCTGGC 120 

CTCCTGCATG CCCAACAAGC CTGCTCCCGT GGGGCCTGCT ATCCACCTGT TGGGGACCTG 180 

CTTGTTGGGA GGACCCGGTT TCTCCGAGCT TCATCTACCT GTGGACTGAC CAAGCCTGAG 240 

ACCTACTGCA CCCAGTATGG CGAGTGGCAG ATGAAATGCT GCAAGTGTGA CTCCAGGCAG 300 

CCTCACAACT ACTACAGTCA CCGAGTAGAG AATGTGGCTT CATCCTCCGG CCCCATGCGC 360 

TGGTGGCAGT CCCAGAATGA TGTGAACCCT GTCTCTCTGC AGCTGGACCT GGACAGGAGA 420 

TTCCAGCTTC AAGAAGTCAT GATGGAGTTC CAGGGGCCCA TGCCCGCCGG CATGCTGATT 480 

GAGCGCTCCT CAGACTTCGG TAAGACCTGG CGAGTGTACC AGTACCTGGC TGCCGACTGC 540 

ACCTCCACCT TCCCTCGGGT CCGCCAGGGT CGGCCTCAGA GCTGGCAGGA TGTTCGGTGC 600 

CAGTCCCTGC CTCAGAGGCC TAATGCACGC CTAAATGGGG GGAAGGTCCA ACTTAACCTT 660 

ATGGATTTAG TGTCTGGGAT TCCAGCAACT CAAAGTCAAA AAATTCAAGA GGTGGGGGAG 720 

ATCACAAACT TGAGAGTCAA TTTCACCAGG CTGGCCCCTG TGCCCCAAAG GGGCTACCAC 7 B0 

CCTCCCAGCG CCTACTATGC TGTGTCCCAG CTCCGTCTGC AGGGGAGCTG CTTCTGTCAC 840 

GGCCATGCTG ATCGCTGCGC ACCCAAGCCT GGGGCCTCTG CAGGCCCCTC CACCGCTGTG 900 

CAGGTCCACG ATGTCTGTGT CTGCCAGCAC AACACTGCCG GCCCAAA.TTG TGAGOGCTGT 960 

GCACCCTTCT ACAACAACCG GCCCTGGAGA CCGGCGGAGG GCCAGGACGC CCATGAATGC 1020 

CAAAGGTGCG ACTGCAATGG GCACTCAGAG ACATGTCACT TTGACCCCGC TGTGTTTGCC 1080 

GCCAGCCAGG GGGCATATGG AGGTGTGTGT GACAATTGCC GGGACCACAC CGAAGGCAAG 1140 

AACTGTGAGC GGTGTCAGCT GCACTATTTC CGGAACCGGC GCCCGGGAGC TTCCATTCAG 1200 

GAGACCTGCA TCTCCTGCGA GTGTGATCCG GATGGGGCAG TGCCAGGGGC TCCCTGTGAC 1260 

CCAGTGACCG GGCAGTGTGT GTGCAAGGAG CATGTGCAGG GAGAGCGCTG TGACCTATGC 1320 

AAGCCGGGCT TCACTGGACT CACCTACGCC AACCCGCAGG GCTGCCACCG CTGTGACTGC 1380 

AACATCCTGG GGTCCCGGAG GGACATGCCG TGTGACGAGG AGAGTGGGCG CTGCCTTTGT 1440 

CTGCCCAACG TGGTGGGTCC CAAATGTGAC CAGTGTGCTC CCTACCACTG GAAGCTGGCC 1500 

AGTGGCCAGG GCTGTGAACC GTGTGCCTGC GACCCGCACA ACTCCCCTCA GCCCACAGTG 1560 

CAACCAGTTC ACAGGGCAGT GCCCTGTCGG GAAGGCTTTG GTGGCCTGAT GTGCAGCGCT 1620 

GCAGCCATCC GCCAGTGTCC AGACCGGACC TATGGAGACG TGGCCACAGG ATGCCGAGCC 1680 

TGTGACTGTG ATTTCCGGGG AACAGAGGGC CCGGGCTGCG ACAAGGCATC AGGCCGCTGC 1740 

CTCTGCCGCC CTGGCTTGAC CGGGCCCCGC TGTGACCAGT GCCAGCGAGG CTACTGCAAT 1800 

CGCTACCCGG TGTGCGTGGC CTGCCACCCT TGCTTCCAGA CCTATGATGC GGACCTCCGG 1B60 

GAGCAGGCCC TGCGCTTTGG TAGACTCCGC AATGCCACCG CCAGCCTGTG GTCAGGGCCT 1920 

GGGCTGGAGG ACCGTGGCCT GGCCTCCCGG ATCCTAGATG CAAAGAGTAA GATTGAGCAG 1980 

ATCCGAGCAG TTCTCAGCAG CCCCGCAGTC ACAGAGCAGG AGGTGGCTCA GGTGGCCAGT 2040 

GCCATCCTCT CCCTCAGGCG AACTCTCCAG GGCCTGCAGC TGGATCTGCC CCTGGAGGAG 2100 

GAGACGTTGT CCCTTCCGAG AGACCTGGAG AGTCTTGACA GAAGCTTCAA TGGTCTCCTT 2160 

ACTATGTATC AGAGGAAGAG GGAGCAGTTT GAAAAAATAA GCAGTGCTGA TCCTTCAGGA 2220 

GCCTTCCGGA TGCTGAGCAC AGCCTACGAG CAGTCAGCCC AGGCTGCTCA GCAGGTCTCC 2280 

GACAGCTCGC GCCTTTTGGA CCAGCTCAGG GACAGCOGGA GAGAGGCAGA GAGGCTGGTG 2340 

CGGCAGGCGG GAGGAGGAGG AGGCACCGGC AGCCCCAAGC TTGTGGCCCT GAGGCTGGAG 2400 

ATGTCTTCGT TGCCTGACCT GACACCCACC TTCAACAAGC TCTGTGGCAA CTCCAGGCAG 2460 

ATGGCTTGCA CCCCAATATC ATGCCCTGGT GAGCTATGTC CCCAAGACAA TGGCACAGCC 2520 

TGTGGCTCCC GCTGCAGGGG TGTCCTTCCC AGGGCOGGTG GGGCCTTCTT GATGGCGGGG 2580 

CAGGTGGCTG AGCAGCTGCG GGGCTTCAAT GCCCAGCTCC AGCGGACCAG GCAGATGATT 2640 

AGGGCAGCCG AGGAATCTGC CTCACAGATT CAATCCAGTG CCCAGCGCTT GGAGACCCAG 2700 

GTGAGCGCCA GCCGCTCCCA GATGGAGGAA GATGTCAGAC GCACACGGCT CCTAATCCAG 2760 

CAGGTCCGGG ACTTCCTAAC AGACCCCGAC ACTGATGCAG CCACTATCCA GGAGGTCAGC 2820 

GAGGCCGTGC TGGCCCTGTG GCTGCCCACA GACTCAGCTA CTGTTCTGCA GAAGATGAAT 2880 

GAGATCCAGG CCATTGCAGC CAGGCTCCCC AACGTGGACT TGGTGCTGTC CCAGACCAAG 2940 

CAGGACATTG CGCGTGCCCG CCGGTTGCAG GCTGAGGCTG AGGAAGCCAG GAGCCGAGCC 3000 

CATGCAGTGG AGGGCCAGGT GGAAGATGTG GTTGGGAACC TGCGGCAGGG GACAGTGGCA 3060 

CTGCAGGAAG CTCAGGACAC CATGCAAGGC ACCAGCCGCT CCCTTCGGCT TATCCAGGAC 3120 

AGGGTTGCTG AGGTTCAGCA GGTACTGCGG CCAGCAGAAA AGCTGGTGAC AAGCATGACC 3180 

AAGCAGCTGG GTGACTTCTG GACACGGATG GAGGAGCTCC GCCACCAAGC CCGGCAGCAG 3240 

GGGGCAGAGG CAGTCCAGGC CCAGCAGCTT GCGGAAGGTG CCAGCGAGCA GGCATTGAGT 3300 

GCCCAAGAGG GATTTGAGAG AATAAAACAA AAGTATGCTG AGTTGAAGGA CCGGTTGGGT 3360 
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CAGAGTTCCA TGCTGGGTGA GCAGGGTGCC CGGATCCAGA GTGTGAAGAC AGAGGCAGAG 3420 

GAGCTGTTTG GGGAGACCAT GGAGATGATG GACAGGATGA AAGACATGGA GTTGGAGCTG 3480 

CTGCGGGGCA GCCAGGCCAT CATGCTGCGC TCGGCGGACC TGACAGGACT GGAGAAGCGT 3540 

GTGGAGCAGA TCCGTGACCA CATCAATGGG CGCGTGCTCT ACTATGCCAC CTGCAAGTGA 3600 
TGCTACAGCT TCCAGCCCGT TGCCCCACTC ATCTGCCGCC TTTGCTTTTG GTTGGGGGCA '3660 

GATTGGGTTG GAATGCTTTC CATCTCCAGG AGACTTTCAT GCAGCCTAAA GTACAGCCTG 3720 

GACCACCCCT GGTGTGTAGC TAGTAAGATT ACCCTGAGCT GCAGCTGAGC CTGAGCCAAT 3780 

GGGACAGTTA CACTTGACAG ACAAAGATGG TGGAGATTGG CATGCCATTG AAACTAAGAG 3840 

CTCTCAAGTC AAGGAAGCTG GGCTGGGCAG TATCCCCCGC CTTTAGTTCT CCACTGGGGA 3900 

GGAATCCTGG ACCAAGCACA AAAACTTAAC AAAAGTGATG TAAAAATGAA AAGCCAAATA 3960 

aaaatcttto g 

Seq ID NO: 20 Protein sequence: 
Protein Accession #: NP_000219 

1 11 21 31 41 51 

| - | I I I 'I 

MRPFFLLCFA IiPGLLHAQQA CSRGACYPPV GDLLVGRTRF LRASSTCGLT KPETYCTQYG 60 

EWQMKCCKCD SRQPHNYYSH RVENVASSSG PMRWWQSQND VNPVSLQLDL DRRFQLQEVM 120 

MEFQGPMPAG MLIERSSDFG ICTWRVYQYLA ADCTSTFPRV RQGRPQSWQD VRCQSLPQRP 180 

NARLNGGKVQ LNLMDLVSGI PATQSQKIQE VGEITNLRVN FTRLAPVPQR GYHPPSAYYA 240 

VSQLRLQGSC FCHGHADRCA PKPGASAGPS TAVQVHDVCV CQHNTAGPNC ERCAPFYNNR 300 

PHRPAEGQDA HECQRCDCNG HSETCHFDPA VFAASQGAYG GVCDNCRDHT EGKNCERCQL 360 

HYFRNRRPGA SIQETCISCE CDPDGAVPGA PCDPVTGQCV CKEHVQGERC DLCKPGFTGL 420 

TYANPQGCHR CDCNILGSRR DMPCDEESGR CLCLPNWGP KCDQCAPYHW KLASGQGCEP 480 

CACDPHNSPQ PTVQPVHRAV PCREGFGGLM CSAAAIRQCP DRTYGDVATG CRACDCDFRG 540 

TEGPGCDKAS GRCLCRPGLT GPRCDQCQRG YCNRYPVCVA CHPCFQTYDA DI.REQALRFG 600 

RLRNATASLW SGPGIiEDRGL ASRILDAKSK IEQIRAVLSS PAVTEOEVAQ VASAILSLRR 660 

TLQGLQLDLP LEEETIjSLPR DLESLDRSFN GLLTMYQRKR EQFEKISSAD PSGAFRMLST . 720 

AYEQSAQAAQ QVSDSSRLLD QLRDSRREAE RLVRQAGGGG GTGSPKLVAL RLEMSSLPDIi 780 

TPTFNKLCGN SRQMACTPIS CPGELCPQDN GTACGSRCRG VLPRAGGAFL MAGQVAEQLR 840 

GFNAQLQRTR QMIRAAEESA SQIQSSAQRL ETQVSASRSQ MEEDVRRTRL LIQQVRDFLT 900 

DPDTDAATIQ EVSEAVLALW LPTDSATVLQ KMNEIQAIAA RLPNVDLVLS QTKQDIARAR 960 

RLQAEAEEAR SRAHAVEGQV EDWGNLRQG TVALQEAQDT MQGTSRSLRL IQDRVAEVQQ 1020 

VLRPAEKLVT SMTKQLGDFW TRMEELRHQA RQQGAEAVQA QQIxAEGASEQ ALSAQEGFER 1080 

I KQKYAELKD RLGQSSMLGE QGARIQSVKT EAEELFGETM EMMDRMKDME LELLRGSQAI 1140 
MLRSADLTGL EKRVEQIRDH INGRVLYYAT CK 

Seq ID NO: 21 DNA sequence 
Nucleic Acid Accession #: NMJ>03722 
Coding sequence: 145-1491 

1 11 21 31 41 51 

I I 1 1 I I 

TCGTTGATAT CAAAGACAGT TGAAGGAAAT GAATTTTGAA ACTTCACGGT GTGCCACCCT 60 

ACAGTACTGC CCTGACCCTT ACATCCAGCG TTTCGTAGAA ACCCAGCTCA TTTCTCTTGG 120 

AAAGAAAGTT ATTACCGATC CACCATGTCC CAGAGCACAC AGACAAATGA ATTCCTCAGT 180 

CCAGAGGTTT TCCAGCATAT CTGGGATTTT CTGGAACAGC CTATATGTTC AGTTCAGCCC 240 

ATTGACTTGA ACTTTGTGGA TGAACCATCA GAAGATGGTG CGACAAACAA GATTGAGATT 300 

AGCATGGACT GTATCCGCAT GCAGGACTCG GACCTGAGTG ACCCCATGTG GCCACAGTAC 360 

ACGAACCTGG GGCTCCTGAA CAGCATGGAC CAGCAGATTC AGAACGGCTC CTCGTCCACC 420 

AGTCCCTATA ACACAGACCA CGCGCAGAAC AGCGTCACGG CGCCCTCGCC CTACGCACAG 480 

CCCAGCTCCA CCTTCGATGC TCTCTCTCCA TCACCCGCCA TCCCCTCCAA CACCGACTAC 540 

CCAGGCCCGC ACAGTTTCGA CGTGTCCTTC CAGCAGTCGA GCACCGCCAA GTCGGCCACC 600 

TGGACGTATT CCACTCAACT GAAGAAACTC TACTGCCAAA TTGCAAAGAC ATGCCCCATC 660 

CAGATCAAGG TGATGACCCC ACCTCCTCAG GGAGCTGTTA TCCGCGCCAT GCCTGTCTAC 720 

AAAAAAGCTG AGCACGTCAC GGAGGTGGTG AAGCGGTGCC CCAACCATGA GCTGAGCCGT 780 

GAATTCAACG AGGGACAGAT TGCCCCTCCT AGTCATTTGA TTCGAGTAGA GGGGAACAGC 840 

CATGCCCAGT ATGTAGAAGA TCCCATCACA GGAAGACAGA GTGTGCTGGT ACCTTATGAG 900 

CCACCCCAGG TTGGCACTGA ATTCACGACA GTCTTGTACA ATTTCATGTG TAACAGCAGT 960 

TGTGTTGGAG GGATGAACCG CCGTCCAATT TTAATCATTG TTACTCTGGA AACCAGAGAT 1020 

GGGCAAGTCC TGGGCCGACG CTGCTTTGAG GCCCGGATCT GTGCTTGCCC AGGAAGAGAC 1080 

AGGAAGGCGG ATGAAGATAG CATCAGAAAG CAGCAAGTTT CGGACAGTAC AAAGAACGGT 1140 

GATGGTACGA AGCGCCCGTT TCGTCAGAAC ACACATGGTA TCCAGATGAC ATCCATCAAG 1200 

AAACGAAGAT CCCCAGATGA TGAACTGTTA TACTTACCAG TGAGGGGCCG TGAGACTTAT 1260 

GAAATGCTGT TGAAGATCAA AGAGTCCCTG GAACTCATGC AGTACCTTCC TCAGCACACA 1320 

ATTGAAACGT ACAGGCAACA GCAACAGCAG CAGCACCAGC ACTTACTTCA GAAACATCTC 1380 

CTTTCAGCCT GCTTCAGGAA TGAGCTTGTG GAGCCCCGGA GAGAAACTCC AAAACAATCT 1440 

GACGTCTTCT TTAGACATTC CAAGCCCCCA AACCGATCAG TGTACCCATA GAGCCCTATC 1500 

TCTATATTTT AAGTGTGTGT GTTGTATTTC CATGTGTATA TGTGAGTGTG TGTGTGTGTA 1560 

TGTGTGTGCG TGTGTATCTA GCCCTCATAA ACAGGACTTG AAGACACTTT GGCTCAGAGA 1620 

CCCAACTGCT CAAAGGCACA AAGCCACTAG TGAGAGAATC TTTTGAAGGG ACTCAAACCT 1680 

TTACAAGAAA GGATGTTTTC TGCAGATTTT GTATCCTTAG ACCGGCCATT GGTGGGTGAG 1740 

GAACCACTGT GTTTGTCTGT GAGCTTTCTG TTGTTTCCTG GGAGGGAGGG GTCAGGTGGG 1800 

GAAAGGGGCA TTAAGATGTT TATTGGAACC CTTTTCTGTC TTCTTCTGTT GTTTTTCTAA 1860 

AATTCACAGG GAAGCTTTTG AGCAGGTCTC AAACTTAAGA TGTCTTTTTA AGAAAAGGAG 1920 

AAAAAAGTTG TTATTGTCTG TGCATAAGTA AGTTGTAGGT GACTGAGAGA CTCAGTCAGA 1980 

CCCTTTTAAT GCTGGTCATG TAATAATATT GCAAGTAGTA AGAAACGAAG GTGTCAAGTG 2040 

TACTGCTGGG CAGCGAGGTG ATCATTACCA AAAGTAATCA ACTTTGTGGG TGGAGAGTTC 2100 

TTTGTGAGAA CTTGCATTAT TTGTGTCCTC CCCTCATGTG TAGGTAGAAC ATTTCTTAAT 2160 

GCTGTGTACC TGCCTCTGCC ACTGTATGTT GGCATCTGTT ATGCTAAAGT TTTTCTTGTA 2220 

CATGAAACCC TGGAAGACCT ACTACAAAAA AACTGTTGTT TGGCCCCCAT AGCAGGTGAA 2280 

CTCATTTTGT GCTTTTAATA GAAAGACAAA TCCACCCCAG TAATATTGCC CTTACGTAGT 2340 

TGTTTACCAT TATTCAAAGC TCAAAATAGA ATTTGAAGCC CTCTCACAAA ATCTGTGATT 2400 

AATTTGCTTA ATTAGAGCTT CTATCCCTCA AGCCTACCTA CCATAAAACC AGCCATATTA 2460 

CTGATACTGT TCAGTGCATT TAGCCAGGAG ACTTACGTTT TGAGTAAGTG AGATCCAAGC 2520 

AGACGTGTTA AAATCAGCAC TCCTGGACTG GAAATTAAAG ATTGAAAGGG TAGACTACTT 2580 
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TTCTTTTTTT TACTCAAAAG TTTAGAGAAT CTCTGTTTCT TTCCATTTTA AAAACATATT 2640 

TTAAGATAAT AGCATAAAGA CTTTAAAAAT GTTCCTCCCC TCCATCTTCC CACACCCAGT 2700 

CACCAGCACT. GTATTTTCTG TCACCAAGAC AATGATTTCT TGTTATTGAG GCTGTTGCTT 2760 
TTGTGGATGT GTGATTTTAA TTTTCAATAA ACTTTTGCAT CTTGGTTTAA, AAGAAA 

Seq ID NO: 22 Protein sequence: 
Protein Accession 8> NP_003713 

1 11 21 31 41 51 

I.I II I 1 

MSQSTQTNEP LSPEVFQHIW DFLEQPICSV QPIDUIFVDE PSEDGATNKI EISMDCIRMQ 60 
DSDLSDPMWP QYTNLGLLNS MDQQIQNGSS STSPYNTDHA QNSVTAPSPY AQPSSTFDAL 120 
SPSPAIPSNT DYPGPHSFDV SFQQSSTAKS ATWTYSTELK KLYCQIAKTC PIQIKVMTPP 180 
PQGAVIRAMP VYKKAEHVTE WKRCPNHEL SREFNEGQIA PPSHLIRVEG NSHAQYVEDP 240 
ITGRQSVLVP YEPPQVGTEF TTVLYNFMCN SSCVGGMNRR PILIIVTLET RDGQVLGRRC 300 
FEARICACPG RDRKADEDSI RKQQVSDSTK NGDGTKRPFR QNTHGIQMTS IKKRRSPDDE 360 
LLYLPVRGRS TYEMLLKIKE SLELMQYLPQ HTIETYRQQQ QQQHQHLLQK HLLSACFRNE 420 
LVEPRRETPK QSDVFFRHSK PPNRSVYP 



Seq ID NO: 23 DNA sequence 

Nucleic Acid Accession ft: NM_001944.1 

Coding sequence: 84-3083 

1 11 21 31 41 51 

I I.I i I I 

TTTTCTTAGA CATTAACTGC AGACGGCTGG CAGGATAGAA GCAGCGGCTC ACTTGGACTT 60 

TTTCACCAGG GAAATCAGAG ACAATGATGG GGCTCTTCCC CAGAACTACA GGGGCTCTGG 120 

CCATCTTCGT GGTGGTCATA TTGGTTCATG GAGAATTGCG AATAGAGACT AAAGGTCAAT 180 

ATGATGAAGA AGAGATGACT ATGCAACAAG CTAAAAGAAG GCAAAAACGT GAATGGGTGA 240 

AATTTGCCAA . ACCCTGCAGA GAAGGAGAAG ATAACTCAAA AAGAAACCCA ATTGCCAAGA 300 

TTACrrCAGA TTACCAAGCA ACCCAGAAAA TCACCTACCG AATCTCTGGA GTGGGAATCG 360 

ATCAGCCGCC TTTTGGAATC TTTGTTGTTG ACAAAAACAC TGGAGATATT AACATAACAG 420 

CTATAGTCGA CCGGGAGGAA ACTCCAAGCT TCCTGATCAC ATGTCGGGCT CTAAATGCCC 480 

AAGGACTAGA TGTAGAGAAA CCACTTATAC TAACGGTTAA AATTTTGGAT ATTAATGATA 540 

ATCCTCCAGT ATTTTCACAA CAAATTTTCA TGGGTGAAAT TGAAGAAAAT AGTGCCTCAA 600 

ACTCACTGGT GATGATACTA AATGCCACAG ATGCAGATGA ACCAAACCAC TTGAATTCTA 660 

AAATTGCCTT CAAAATTGTC TCTCAGGAAC CAGCAGGCAC ACCCATGTTC CTCCTAAGCA 720 

GAAACACTGG GGAAGTCCGT ACTTTGACCA ATTCTCTTGA CCGAGAGCAA GCTAGCAGCT 780 

ATCGTCTGGT TGTGAGTGGT GCAGACAAAG ATGGAGAAGG ACTATCAACT CAATGTGAAT 840 

GTAATATTAA AGTGAAAGAT GTCAACGATA ACTTCCCAAT GTTTAGAGAC TCTCAGTATT 900 

CAGCACGTAT TGAAGAAAAT ATTTTAAGTT CTGAATTACT TCGATTTCAA GTAACAGATT 960 

TGGATGAAGA GTACACAGAT AATTGGCTTG CAGTATATTT CTTTACCTCT GGGAATGAAG 1020 

GAAATTGGTT TGAAATACAA ACTGATCCTA GAACTAATGA AGGCATCCTG AAAGTGGTGA 1080 

AGGCTCTAGA TTATGAACAA CTACAAAGCG TGAAACTTAG TATTGCTGTC AAAAACAAAG 1140 

CTGAATTTCA CCAATCAGTT ATCTCTCGAT ACCGAGTTCA GTCAACCCCA GTCACAATTC 1200 

AGGTAATAAA TGTAAGAGAA GGAATTGCAT TCCGTCCTGC TTCCAAGACA TTTACTGTGC 1260 

AAAAAGGCAT AAGTAGCAAA AAATTGGTGG ATTATATCCT GGGAACATAT CAAGCCATCG 1320 

ATGAGGACAC TAACAAAGCT GCCTCAAATG TCAAATATGT CATGGGACGT AACGATGGTG 1380 

GATACCTAAT GATTGATTCA AAAACTGCTG AAATCAAATT TGTCAAAAAT ATGAACCGAG 1440 

ATTCTACTTT CATAGTTAAC AAAACAATCA CAGCTGAGGT TCTGGCCATA GATGAATACA 1500 

CGGGTAAAAC TTCTACAGGC ACGGTATATG TTAGAGTACC CGATTTCAAT GACAATTGTC 1560 

CAACAGCTGT CCTCGAAAAA GATGCAGTTT GCAGTTCTTC ACCTTCCGTG GTTGTCTCCG 1620 

CTAGAACACT GAATAATAGA TACACTGGCC CCTATACATT TGCACTGGAA GATCAACCTG 1680 

TAAAGTTGCC TGCCGTATGG AGTATCACAA CCCTCAATGC TACCTCGGCC CTCCTCAGAG 1740 

CCCAGGAACA GATACCTCCT GGAGTATACC ACATCTCCCT GGTACTTACA GACAGTCAGA 1800 

ACAATCGGTG TGAGATGCCA CGCAGCTTGA CACTGGAAGT CTGTCAGTGT GACAACAGGG 1860 

GCATCTGTGG AACTTCTTAC CCAACCACAA GCCCTGGGAC CAGGTATGGC AGGCCGCACT 1920 

CAGGGAGGCT GGGGCCTGCC GCCATGGGCC TGCTGCTCCT TGGTCTCCTG CTGCTGCTGT 1980 

TGGCCCCCCT TCTGCTGTTG ACCTGTGACT GTGGGGCAGG TTCTACTGGG GGAGTGACAG 2040 

GTGGTTTTAT CCCAGTTCCT GATGGCTCAG AAGGAACAAT TCATCAGTGG GGAATTGAAG 2100 

GAGCCCATCC TGAAGACAAG GAAATCACAA ATATTTGTGT GCCTCCTGTA ACAGCCAATG 2160 

GAGCCGATTT CATGGAAAGT TCTGAAGTTT GTACAAATAC GTATGCCAGA GGCACAGCGG 2220 

TGGAAGGCAC TTCAGGAATG GAAATGACCA CTAAGCTTGG AGCAGCCACT GAATCTGGAG 2280 

GTGCTGCAGG CTTTGCAACA GGGACAGTGT CAGGAGCTGC TTCAGGATTC GGAGCAGCCA 2340 

CTGGAGTTGG CATCTGTTCC TCAGGGCAGT CTGGAACCAT GAGAACAAGG CATTCCACTG 2400 

GAGGAACCAA TAAGGACTAC GCTGATGGGG CGATAAGCAT GAATTTTCTG GACTCCTACT 2460 

TTTCTCAGAA AGCATTTGCC TGTGCGGAGG AAGAOGATGG CCAGGAAGCA AATGACTGCT 2520 

TGTTGATCTA TGATAATGAA GGCGCAGATG CCACTGGTTC TCCTGTGGGC TCCGTGGGTT 2580 

GTTGCAGTTT TATTGCTGAT GACCTGGATG ACAGCTTCTT GGACTCACTT GGACCCAAAT 2640 

TTAAAAAACT TGCAGAGATA AGCCTTGGTG TTGATGGTGA AGGCAAAGAA GTTCAGCCAC 2700 

CCTCTAAAGA CAG CGGTTAT GGGATTGAAT CCTGTGGCCA TCCCATAGAA GTCCAGCAGA 2760 

CAGGATTTGT TAAGTGCCAG ACTTTGTCAG GAAGTCAAGG AGCTTCTGCT TTGTCCGCCT 2820 

CTGGGTCTGT CCAGCCAGCT GTTTCCATCC CTGACCCTCT GCAGCATGGT AACTATTTAG 2880 

TAACGGAGAC TTACTCGGCT TCTGGTTCCC TCGTGCAACC TTCCACTGCA GGCTTTGATC 2940 

CACTTCTCAC ACAAAATGTG ATAGTGACAG AAAGGGTGAT CTGTCCCATT TCCAGTGTTC 3000 

CTGGCAACCT AGCTGGCCCA ACGCAGCTAC GAGGGTCACA TACTATGCTC TGTACAGAGG 3 060 

ATCCTTGCTC CCGTCTAATA TGACCAGAAT GAGCTGGAAT ACCACACTGA CCAAATCTGG 3120 

ATCTTTGGAC TAAAGTATTC AAAATAGCAT AGCAAAGCTC ACTGTATTGG GCTAATAATT 3180 

TGGCACTTAT TAGCTTCTCT CATAAACTGA TCACGATTAT AAATTAAATG TTTGGGTTCA 3240 

TACCCCAAAA GCAATATGTT GTCACTCCTA ATTCTCAAGT ACTATTCAAA TTGTAGTAAA 3300 
TCTTAAAGTT TTTCAAAACC CTAAAATCAT ATTCGC 

Seq ID NO: 24 Protein sequence: 
Protein Accession 8: NP_001935.1 

1 11. 21 , 31 41 51 
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I : I 



VTERVICPIS SVPGNLAGPT QLRGSHTMLC TEDPCSRIiI 

Seq ID NO: 25 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 56-1642 



11 21 31 41 

11)1 



GCAAGGGATC 



CAAGTTGACT CGAGTGTTCC AAGGTTTCTT CACAGGCCGA 



WVKFAKPCRB 


60 


ITAIVDREET 


120 


ASNSLVMILN 


180 


SSYRLWSGA 


240 


TDLDEEYTDN 


300 


NKAEFHQSVI 


360 


AIDEDTNKAA 


420 


EYTGKTSTGT 


480 


QPVKLPAVWS 


540 


NRGICGTSYP 


600 


VTGGFIPVPD 


660 


TAVEGTSGME 


720 


STGGTNKDYA 


780 


VGCCSFIADD 


840 


QQTGFVKCQT 


900 


FDPLLTQNVI 


960 


51 




1 

CCGTCATGTC 


60 


TAGTTTCTCC 


120 


TGCTATCAGA 


180 


AGGACAGTAT 


240 


TGGAACGACA 


300 


AAGCACCCAA 


360 


ACAGGTTCAC 


420 


TAACTGTGAA 


480 


ATGGAGTCAC 


540 


TTCTCCCCCG 


600 


CTGATCTGAA 


660 


AGGAGGAAAT 


720 


CTTCCTTGAA 


780 


GTGGCATTGC 


840 


CAAGTCATCG 


900 


TCTCCATCTG 


960 


CGCCTAGCCA 


1020 


CCTATGTGAA 


1080 


TAAAAGTGGG 


1140 


GCAGTCACAG 


1200 


CCAAGATCAG 


1260 


AGAGTGGTGA 


1320 


GCTGTATTGC 


1380 


TCCGTGACAG 


1440 


GCATGATTGT 


1500 


AGTTCTCAGC 


1560 


CCATCCCTGC 


1620 


GGGGCTAAGG 


1680 


TATGGCAAAG 


1740 


CGACAGGAAA 


1800 


GAACAGATGC 


1860 


TTGGAGGAAA 


1920 


CAAGAAGAGA 


1980 


GCCAGACAAC 


2040 


CAAAGGTTGG 


2100 


CAGTGCAAAG 


2160 


GAACCACCAC 


2220 


CAGAAGAATA 


2280 


GCAGAGAGAG 


2340 


TGTGATGACA 


2400 


CTAGTGAAAC 


2460 


TTGAAACTCC 


2520 


CAACCAAACC 


2580 


CCAACCTGCC 


2640 


TCCCCTTTAC 


2700 


GAGCAGTCAT 


2760 


CTTTACCATA 


2620 


ACTTTTGTAT ' 


2880 


TGATTTCTAT 


2940 


AATTCCAAAT 


3000 



GTAGCAAAAT CATTAAAACA AATTATAAAA GGGACAGAAA AA 

Seq ID NO: 26 Protein sequence: 
Protein Accession ft: Eos sequence 

1 11 * 21 31 41 51 

| | I I I I 

HSQGIL.SPPA GLLSDDDWV SPMPESTAAD LGSWRKNLL SDCSWSTSL EDKQQVPSED 60 

SMEKVKVYLR VRPLLPSELB RQEOQGCVRI ENVETLVLCA PXDSPALKSN ERGIGQATHR 120 

FTPSQIFGPE VGQASFPNLT VKEMVKDVLK GQNWLIYTYG VTNSGKTHTI OGTIKDGGIL 180 

PRSliALIFNS LQGQLHPTPD LKPLLSNEVI WLDSKQIRQE EMKKLSLLNG GLQEEELSTS 240 

LKRSVYIESR IGTSTSFDSG IAGLSSISQC TSSSQLDETS KRWAQPDTAP LPVPANXRFS 300 
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IWISFFEIYN ELLYDUiEPP SQQRKROTLR LCEDQNGNPY VKDLNWIHVQ DAEEAWKLLK 
• VGRKNQSFAS ■ THljjQNSSRS HSIFSIRILH LQGEGDIVPK ISELSLCDLA GSERCKDQKS 
GERLKEAGNI NTSLHTLGRC IAAIJRQNQQN RSKQNLVPFR DSKLTRVFQ6 FFTGRGRSCM. 
IVNVNPCAST YDETLHVAKF SAIASQVTCA CPTYATGIPI PALVHQGT 

Seq ID NO: "27 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 13-1424 



PCT/US02/12476 



360 
420 
•480 



T AGAAGTTT A 
CTTCCCCTGA 
TTAGAAAAAT 
GGAAACTTAA 
GGGCAACTGG 
GTCCATCATT 
- AGAATCAATA 
GCTTTCCAAG 
GCTGACATTT 
AAAGGTGGAA 
TTCGATGAGG 
GTTCACGAGA 
TTCCCCACCT 
GGCATTCAGT 
TCAGAACCAG 
AAGATCTTTT 
AGTGTTAATT 
GAAATTGAAG 
AATTTAAGAC 
GTGAAAAAAA 
GATAACCAGT 
CTGATTACCA 
AACAAATACT 
CGTATCACCA 
TGGTTTTTGT 
GTGTACCACT 
TTAT ATAAAA 
CTCTACTATT 
CTCTGTAAGT 
TAAAATTAAG 



11* 
I 

CAATGAAGTT 
ACAGCTCTAC 
TTTATGGCCT 
TGAAGGAAAA 
ACACATCTAC 
TCAGGGAAAT 
ATTACACACC 
TATGGAGTAA 
TGGTGGTTTT 
TCCTAGCCCA 
ACGAATTCTG 
TTGGCCATTC 
ACAAATATGT 
CCCTGTATGG 
CTCTCTGTGA 
TCTTCAAAGA 
TAATTTCTTC 
CCAGAAATCA 
CAGAGCCAAA 
TTGATGCAGC 
ATTGGAGGTA 
AGAACTTCCA 
ACTATTTCTT 
AAACACTGAA 
TAGTTCACTT 
ACTTAGAGAT 
TACATAATAT 
AAGTTTGAAA 
TGCTTCCTAA 
TATATATATT 



21 
I 

TCTTCTAATA 
AAGCCTGGAA 
TGAGATAAAC 
AATCCAAGAA 
CCTGGAGATG 
GCCAGGGGGG 
TGACATGAAC 
TGTTACCCCC 
TGCCCGTGGA 
TGCTTTTGGA 
GACTACACAT 
CTTAGGTCTT 
TGACATCAAC 
AGACCCAAAA 
CCCCAATTTG 
CAGGTTCTTC 
CTTATGGCCA 
AGTTTTTCTT 
TTATCCCAAG 
TGTTTTTAAC 
TGATGAAAGG 
AGGAATCGGG 
CCAAGGATCT 
AAGCAATAGC 
CAGCTTAATA 
ATGTATCATA 
TTTTCAATTT 
ATAGTTACCT 
CATCCTTGGA 
TTGGCTCAAA 



31 
I 

CTGCTCCTGC 
AAAAATAATG 
AAACTTCCAG 
ATGCAGCACT 
ATGCACGCAC 
CCCGTATGGA 
CGTGAGGATG 
TTGAAATTCA 
GCTCATGGAG 
CCTGGATCTG 
TCAGGAGGCA 
GGCCATTCTA 
ACATTTCGCC 
GAGAACCAAC 
AGTTTTGATG 
TGGCTGAAGG 
ACCTTGCCAT 
TTTAAAGATG 
AGCATACATT 
CCACGTTTTT 
AGACAGATGA 
CCTAAAATTG 
AACCAATTTG 
TGGTTTGGTT 
AGTATTTATT 
AAAATAAAAT 
TGAAAACTCT 
TCAAAGCAAG 
CTGAGAAATT 
TAAAATTG 



41 . 
I 

AGGCCACTGC 
TGCTATTTGG 
TGACAAAAAT 
TCTTGGGTCT 
CTCGATGTGG. 
GGAAACATTA 
TTGACTACGC 
GCAAGATTAA 
ACTTCCATGC 
GCATTGGAGG 
CAAACTTGTT 
GTGATCCAAA 
TCTCTGCTGA 
GCTTGCCAAA 
CTGTCACTAC 
TTTCTGAGAG 
CTGGCATTGA 
ACAAATACTG 
CTTTTGGTTT 
ATAGGACCTA 
TGGACCCTGG 
ATGCAGTCTT 
AATATGACTT 
GTTGAAAATG 
GCATATTTGC 
CTGTAAACCA 
AATTGTCCAT 
ATAATTCTAT 
ATACTTACTT 



51 
I 

TTCTGGAGCT 
TGAAAGATAC 
GAAATATAGT 
GAAAGTGACC 
AGTCCCCGAT 
TATCACCTAC 
AATCCGGAAA 
CACAGGCATG 
TTTTGATGGC 
GGATGCACAT 
CCTCACTGCT 
GGCCGTAATG 
TGACATACGT 
TCCTGACAAT 
CGTGGGAAAT 
ACCAAAGACC 
AGCTGCTTAT 
GTTAATTAGC 
TCCTAACTTT 
CTTCTTTGTA 
TTATCCCAAA 
CTACTCTAAA 
CCTACTCCAA 
GTGTAATTAA 
TATGTCCTCA 
TAGGTAATGA 
TCTTGCTTGA 
TTGAAGCATG 
CTGGCATAAC 



Seq ID NO: 28 Protein sequence: 
Protein Accession #: Eos sequence 



MKFLLILLLQ 
KEKIQEMQHF 
YTPDMNREDV 
LAKAFGPGSG 
KYVDINTFRL 
PKDRPFWLKV 
EPNYPKSIHS 
NFQGIGPKID 



11 
I 

ATASGALPLN 
LGLKVTGQLD 
DYA1RKAFQV 
IGGDAHFDED 
SADDIRGIQS 
SERPKTSVNL 
FGFPNFVKKI 
AVFYSKNKYY 



21 

I . 

SSTSLEKNNV 
TSTLEMMHAP 
WSNVTPLKPS 
EFWTTHSGGT 
LYGDPKENQR 
ISSLWPTIiPS 
DAAVPNPRPY 
YFFQGSNQFB 



31 
I 

LFG ER YI*EKF 
RCGVPDVHHF 
KINTGMADIL 
NLFLTAVHEI 
LPNPDNSEPA 
GIEAAYEIEA 
RTYFFVDNQY 
YDFLLQRITK 



41 
I 

YGLEINKLPV 
REMPGGPVWR 
WFARGAHGD 
GHSLGLGHSS 
LCDPNLSFDA 
RNQVFLFKDD 
WRYDERRQMM 
TLKSNSWFGC 



Seq ID NO: 29 DNA sequence 

Nucleic Acid Accession #iNM_006115.1 

Coding sequence: 236.. 1765 



GCTTCAGGGT 
CGGGACACCC 
ACTCTCTGAG 
GAGACCTAGA 
ACGAAGGCGT 
CCCACGGAGA 
TGCCGCCCTG 
CGGGAGACAC 
TCTGGGAGTG 
TGGACTTGAT 
GGATTTACGG 
TCTGTACTCA 
TGGTTTGAGC 
CCTCAAGGAA 
GAAAAATGTA 
TATCAAGATG 
TACCTGGAAG 
GCGTAGACTC 
GCAGTATATC 
TGTGGACTCT 
CCCCTTGGAA 
GTCCCAGAGT 
CGATGTAAGT 
CCTGGrrCTTT 
GAGCCACTGC 



11- 
I 

ACAGCTCCCC 
CACCCGCTTC. 
GAAAAACCAT 
AATCCAAGCG' 
TTGTGGGGTT 
CTTGTGGAGC 
GAGTTGCTGC 
AGCCAGACCC 
CTGATGAAGG 
GTGCTCCTTG 
AAGAACTCTC 
TTTCCAGAGC 
ACAGAGGCAG 
GGTGCCTGTG 
CTAOGCCTGT 
ATCCTGAAAA 
CTACCCACCT 
CTCCTCTCCC 
GCCCAGTTCA 
TTATTTTTCC 
ACCCTCTCAA 
CCCAGCGTCA 
CCCGAGCCCC 
GATGAGTGTG 
TCCCAGCTTA 



21 

I. 

CGCAGCCAGA 
CCAGGCGTGA 
TTTGATTATT 
TTGGAGGTCC 
CCATTCAGAG 
TGGCAGGGCA 
CCAGGGAGCT 
TGAAGGCAAT 
GACAACATCT 
CCCAGGAGGT 
ATCAGGACTT 
CAGAAGCAGC 
AGCAGCCCTT 
ATGAATTGTT 
GCTGTAAGAA 
TGGTGCAGCT 
TGGCGAAATT 
ACATCCATGC 
CCTCTCAGTT 
TTAGAGGCCG 
TAACTAACTG 
GTCAGCTAAG 
TCCAAGCTCT 
GGATCACGGA 
CAACCTTAAG 



31 

I 

AGCCGGGCCT 
CCTGTCAACA 
ACTCTCAGAC 
TGAGGCCAGC 
CCGATACATC 
GAGCCTGCTG 
CTTCCCGCCA 
GGTGCAGGCC 
TCACCTGGAG 
TCGCCCCAGG 
CTGGACTGTA 
TCAGCCCATG 
CATTCCAGTA 
CTCCTACCTC 
GCTGAAGATT 
GGACTCTATT 
TTCTCCTTAC 
ATCTTCCTAC 
CCTCAGTCTG 
CCTGGATCAG 
CCGGCTTTCG 
TGTCCTGAGT 
GCTGGAGAGA 
TGATCAGCTC 
CTTCTACGGG 



41 
I 

GCAGCCCCTC 
GCAACTTCGC 
GTGCGTGGCA 
CTAAGTCGCT 
AGCATGAGTG 
AAGGATGAGG 
CTCTTCATGG 
TGGCCCTTCA 
ACCTTCAAAG 
AGGTGGAAAC 
TGGTCTGGAA 
ACAAAGAAGC 
GAGGTGCTCG 
ATTGAGAAAG 
TTTGCAATGC 
GAAGATTTGG 
CTGGGCCAGA 
ATTTCCCCGG 
CAGTGOCTGC 
TTGCTCAGGC 
GAAGGGGATG 
CTAAGTGGGG 
GCCTCTGCCA 
CTTGCCCTCC 
AATTCCATCT 



60 
120 
180 
240 
300 
360 
420 . 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080. 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



51 
I 

TKMKYSGNLM 
KHYITYRINN 
FHAFDGKGGI 
DPKAVMFPTY 
VTTVGNKIFF 
KYWLISNLRP 
DPGYPKLITK 



51 
I 

AGCACCGCTC 

GGTGTGGTGA 

ACAAGTGACT 

TCAAAATGGA 

TGTGGACAAG 

CCCTGGCCAT 

CAGCCTTTGA 

CCTGCCTCCC 

CTGTGCTTGA 

TTCAAGTGCT 

ACAGGGCCAG 

GAAAAGTAGA 

TAG ACCTGTT 

TGAAGCGAAA 

CCATGCAGGA 

AAGTGACTTG 

TGATTAATCT 

AGAAGGAAGA 

AGGCTCTCTA 

ACGTGATGAA 

TGATGCATCT ■ 

TCATGCTGAC 

CCCTCCAGGA 

TGCCTTCCCT 

CCATATCTGC 



60 
120 
180 
240 
300 
360 
420 



60. 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
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CTTGCAGAGT CTCCTGCAGC ACCTCATCGG GCTGAGCAAT CTGACCCACG TG CTGT ATCC 1560 

TGTCCCCCTG GAGAGTTATG AGGACATCCA TGGTACCCTC CACCTGGAGA . GGCTTGCCTA 1620 

TCTGCATGCC AGGCTCAGGG AGTTGCTGTG TGAGTTGGGG CGGCCCAGCA TGGTCTGGCT 1680 

TAGTGCCAAC CCCTGTCCTC ACTGTGGGGA CAGAACCTTC TATGACCCGG AGCCCATCCT 1740 

GTGCCCCTGT TTCATGCCTA ACTAGCTGGG TGCACATATC AAATGCTTCA TTCTGCATAC 1800 

TTGGACACTA AAGCCAGGAT GTGCATGCAT CTTGAAGCAA CAAAGCAGCC ACAGTTTCAG 1860 

ACAAATGTTC AGTGTGAGTG AGGAAAACAT GTTCAGTGAG GAAAAA ACAT TCAGACAAAT 1920 

GTTCAGTGAG GAAAAAAAGG GGAAGTTGGG GATAGGCAGA TGTTGACTTG AGGAGTTAAT 1980, 

GTGATCTTTG GGGAGATACA TCTTATAGAG TTAGAAATAG AATCTGAATT TCTAAAGGGA 2040 

GATTCTGGCT TGGGAAGTAC ATGTAGGAGT TAATCCCTGT GTAGACTGTT GTAAAGAAAC 2100 
TGTTGAAAAT AAAGAGAAGC AATGTGAAGC AAAAAAAAAA AAAAAAAA 

Seq ID NO: 30 Protein sequence: 
Protein Accession #: NPJJ06106.1 

1 11' 21 31 41 51 

I 1 I • I I I " 

GCTTCAGGGT ACAGCTCCCC CGCAGCCAGA AGCCGGGCCT GCAGCGCCTC AGCACCGCTC 60 

CGGGACACCC CACCCGCTTC CCAGGCGTGA CCTGTCAACA GCAACTTCGC GGTGTGGTGA 120 

ACTCTCTGAG 'GAAAAACCAT TTTGATTATT ACTCTCAGAC GTGCGTGGCA ACAAGTGACT 180 

GAGACCTAGA AATCCAAGCG TTGGAGGTCC TGAGGCCAGC CTAAGTCGCT TCAAAATGGA 240 

ACGAAGGCGT TTGTGGGGTT CCATTCAGAG CCGATACATC AGCATGAGTG TGTGGACAAG 300 

CCCACGGAGA CTTGTGGAGC TGGCAGGGCA GAGCCTGCTG AAGGATGAGG CCCTGGCCAT 360 

TGCCGCCCTG GAGTTGCTGC CCAGGGAGCT CTTCCCGCCA CTCTTCATGG CAGCCTTTGA 420- 

CGGGAGACAC AGCCAGACCC TGAAGGCAAT GGTGCAGGCC TGGCCCTTCA CCTGCCTCCC 480 

TCTGGGAGTG CTGATGAAGG GACAACATCT TCACCTGGAG ACCTTCAAAG CTGTGCTTGA 540 

TGGACTTGAT GTGCTCCTTG CGCAGGAGGT TCGCCCCAGG AGGTGGAAAC TTCAAGTGCT 600 

GGATTTACGG AAGAACTCTC ATCAGGACTT CTGGACTGTA TGGTCTGGAA ACAGGGCCAG 660 

TCTGTACTCA TTTCCAGAGC CAGAAGCAGC TCAGCCCATG ACAAAGAAGC GAAAAGTAGA 720 

TGGTTTGAGC ACAGAGGCAG AGCAGCCCTT CATTCCAGTA GAGGTGCTCG TAGACCTGTT 780 

CCTCAAGGAA GGTGCCTGTG ATGAATTGTT CTCCTACCTC ATTGAGAAAG TGAAGOGAAA 840 

■GAAAAATGTA CTACGCCTGT GCTGTAAGAA GCTGAAGATT TTTGCAATGC CCATGCAGGA 900 

TATCAAGATG ATCCTGAAAA TGGTGCAGCT GGACTCTATT GAAGATTTGG AAGTGACTTG 960 

TACCTGGAAG CTACCCACCT TGGCGAAATT TTCTCCTTAC CTGGGCCAGA TGATTAATCT 1020 

GCGTAGACTC CTCCTCTCCC ACATCCATGC ATCTTCCTAC ATTTCCCCGG AGAAGGAAGA 1080 

GCAGTATATC GCCCAGTTCA CCTCTCAGTT CCTCAGTCTG CAGTGCCTGC AGGCTCTCTA 1140 

TGTGGACTCT TTATTTTTCC TTAGAGGCCG CCTGGATCAG TTGCTCAGGC ACGTGATGAA 1200 

CCCCTTGGAA ACCCTCTCAA TAACTAACTG CCGGCTTTCG GAAGGGGATG TGATGCATCT 1260 

GTCCCAGAGT CCCAGCGTCA GTCAGCTAAG TGTCCTGAGT CTAAGTGGGG TCATGCTGAC 1320 

CGATGTAAGT CCCGAGCCCC TCCAAGCTCT GCTGGAGAGA GCCTCTGCCA CCCTCCAGGA 1380 

CCTGGTCTTT GATGAGTGTG GGATCACGGA TGATCAGCTC CTTGCCCTCC TGCCTTCCCT 1440 

GAGCCACTGC TCCCAGCTTA CAACCTTAAG CTTCTACGGG AATTCCATCT CCATATCTGC 1500 

CTTGCAGAGT CTCCTGCAGC ACCTCATCGG GCTGAGCAAT CTGACCCACG TGCTGTATCC 1560 

TGTCCCCCTG GAGAGTTATG AGGACATCCA TGGTACCCTC CACCTGGAGA GGCTTGCCTA 1620 

TCTGCATGCC AGGCTCAGGG AGTTGCTGTG TGAGTTGGGG CGGCCCAGCA TGGTCTGGCT 1680 

TAGTGCCAAC CCCTGTCCTC ACTGTGGGGA CAGAACCTTC TATGACCCGG AGCCCATCCT 1740 

GTGCCCCTGT TTCATGCCTA ACTAGCTGGG TGCACATATC AAATGCTTCA TTCTGCATAC 1800 

TTGGACACTA AAGCCAGGAT GTGCATGCAT CTTGAAGCAA CAAAGCAGCC ACAGTTTCAG 1860 

ACAAATGTTC AGTGTGAGTG AGGAAAACAT GTTCAGTGAG GAAAAAACAT TCAGACAAAT 1920 

GTTCAGTGAG GAAAAAAAGG GGAAGTTGGG GATAGGCAGA TGTTGACTTG AGGAGTTAAT 1980 

GTGATCTTTG GGGAGATACA TCTTATAGAG TTAGAAATAG AATCTGAATT TCTAAAGGGA 2040 

GATTCTGGCT TGGGAAGTAC ATGTAGGAGT TAATCCCTGT GTAGACTGTT GTAAAGAAAC 2100 
TGTTGAAAAT AAAGAGAAGC AATGTGAAGC AAAAAAAAAA AAAAAAAA 



Seq ID NO: 31 DNA sequence 

Nucleic Acid Accession #: Bos sequence 

Coding sequence: 64-2754 

1 11 21 31 41 51 

I I I I I I 

GGCAGGTCTC GCTCTCGGCA CCCTCCCGGC GCCCGCGTTC TCCTGGCCCT GCCCGGCATC 
CCGATGGCCG CCGCTGGGCC CCGGCGCTCC GTGCGCGGAG CCGTCTGCCT GCATCTGCTG 
CTGACCCTCG TGATCTTCAG TCGTGATGGT GAAGCCTGCA AAAAGGTGAT ACTTAATGTA 
CCTTCTAAAC TAGAGGCAGA CAAAATAATT GGCAGAGTTA ATTTGGAAGA GTGCTTCAGG 
TCTGCAGACC TCATCCGGTC AAGTGATCCT GATTTCAGAG TTCTAAATGA TGGGTCAGTG 
TACACAGCCA GGGCTGTTGC GCTGTCTGAT AAGAAAAGAT CATTTACCAT ATGGCTTTCT 
GACAAAAGGA AACAGACACA GAAAGAGGTT ACTGTGCTGC TAGAACATCA GAAGAAGGTA 
TCGAAGACAA GACACACTAG AGAAACTGTT CTCAGGCGTG CCAAGAGGAG ATGGGCACCT 
ATTCCTTGCT CTATGCAAGA GAATTCCTTG GGCCCTTTCC CATTGTTTCT TCAACAAGTT 
GAATCTGATG CAGCACAGAA CTATACTGTC TTCTACTCAA TAAGTGGACG TGGAGTTGAT 
AAAGAACCTT TAAATTTGTT TTATATAGAA AGAGACACTG GAAATCTATT TTGCACTCGG 
CCTGTGGATC GTGAAGAATA TGATGTTTTT GATTTGATTG CTTATGCGTC AACTGCAGAT 
GGATATTCAG CAGATCTGCC CCTCCCACTA CCCATCAGGG TAGAGGATGA AAATGACAAC 
CACCCTGTTT TCACAGAAGC AATTTATAAT TTTGAAGTTT TGGAAAGTAG TAGACCTGGT 
ACTACAGTGG GGGTGGTTTG TGCCACAGAC AGAGATGAAC CGGACACAAT GCATACGCGC 
CTGAAATACA GCATTTTGCA GCAGACACCA AGGTCACCTG GGCTCTTTTC TGTGCATCCC 
AGCACAGGCG TAATCACCAC AGTCTCTCAT TATTTGGACA GAGAGGTTGT AGACAAGTAC 
TCATTGATAA TGAAAGTACA AGACATGGAT GGCCAGTTTT TTGGATTGAT AGGCACATCA 
ACTTGTATCA TAACAGTAAC AGATTCAAAT GATAATGCAC CCACTTTCAG ACAAAATGCT 
TATGAAGCAT TTGTAGAGGA AAATGCATTC - AATGTGGAAA TCTTACGAAT ACCTATAGAA 
GATAAGGATT TAATTAACAC TGCCAATTGG AGAGTCAATT TTACCATTTT AAAGGGAAAT 
GAAAATGGAC ATTTCAAAAT CAGCACAGAC AAAGAAACTA ATGAAGGTGT TCTTTCTGTT 
GTAAAGCCAC TGAATTATGA AGAAAACCGT CAAGTGAACC TGGAAATTGG AGTAAACAAT 
GAAGCGCCAT TTGCTAGAGA - TATTCCCAGA GTGACAGCCT TGAACAGAGC CTTGGTTACA 
GTTCATGTGA GGGATCTGGA TGAGGGGCCT GAATGCACTC CTGCAGCCCA ATATGTGCGG 
ATTAAAGAAA ACTTAGCAGT GGGGTCAAAG ATCAACGGCT ATAAGGCATA TGACCCCGAA 
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AATAGAAATG GCAATGGTTT AAGGTACAAA AAATTGCATG ATCCTAAAGG- TTGGATCACC ,1620 

ATTGATGAAA -TTTCAGGGTC AATCATAACT TCCAAAATCC TGGATAGGGA GGTTGAAACT - 1680 

CCCAAAAATG AGTTGTATaA vTATTACAGTC CTGGCAATAG ACAAAGATGA TAGATCATGT 1740 

ACTGGAACAC TTGCTGTGAA CATTGAAGAT GTAAATGATA ATCCACCAGA AATACTTCAA 1800 

GAATATGTAG TCATTTGCAA ACCAAAAATG GGGTATACCG ACATTTTAGC TGTTGATCCT 1860 

GATGAACCTG TCCATGGAGC TCCATTTTAT TTCAGTTTGC CCAATACTTC TCCAGAAATC - 1920 

AGTAGACTGT GGAGCCTCAC CAAAGTTAAT GATACAGCTG CCCGTCTTTC ATATCAGAAA 1980 

AATGCTGGAT TTCAAGAATA TACCATTCCT ATTACTGTAA AAGACAGGGC CGGCCAAGCT 2040 

GCAACAAAAT TATTGAGAGT- TAATCTGTGT GAATGTACTC ATCCAACTCA GTGTCGTGCG 2100 

ACTTCAAGGA GTACAGGAGT AATACTTGGA AAATGGGCAA TCCTTGCAAT ATTACTGGGT 2160 

ATAGCACTGC TCTTTTCTGT ATTGCTAACT TTAGTATGTG GAGTTTTTGG TGCAACTAAA 2220 

GGGAAACGTT TTCCTGAAGA TTTAGCACAG CAAAACTTAA TTATATCAAA CACAGAAGCA 2280 

CCTGGAGACG ATAGAGTGTG CTCTGCCAAT GGATTTATGA CCCAAACTAC CAACAACTCT 2340 

AGCCAAGGTT TTTGTGGTAC TATGGGATCA GGAATGAAAA ATGGAGGGCA GGAAACCATT 2400 

GAAATGATGA AAGGAGGAAA CCAGACCTTG GAATCCTGCC GGGGGGCTGG GCATCATCAT 2460 

ACCCTGGACT CCTGCAGGGG AGGACACACG GAGGTGGACA ACTGCAGATA CACTTACTCG 2520 

GAGTGGCACA GTTTTACTCA ACCCCGTCTC GGTGAAAAAT TGCATCGATG TAATCAGAAT 2580 

GAAGACCGCA TGCCATCCCA AGATTATGTC CTCACTTATA ACTATGAGGG AAGA GGATC T 2640 

CCAGCTGGTT CTGTGGGCTG CTGCAGTGAA AAGCAGGAAG AAGATGGCCT TGACTTTTTA 2700 

AATAATTTGG AACCCAAATT TATTACATTA GCAGAAGCAT GCACAAAGAG ATAATGTCAC 2760 

AGTGCTACAA TTAGGTCTTT GTCAGACATT CTGGAGGTTT CCAAAAATAA TATTGTAAAG 2820 

TTCAATTTCA ACATGTATGT ATATGATGAT TTTTTTCTCA ATTTTGAATT ATGCTACTCA 2880 

CCAATTTATA TTTTTAAAGC CAGTTGTTGC TTATCTTTTC CAAAAAGTGA AAAATGTTAA 2940 

AACAGACAAC TGGTAAATCT CAAACTCCAG CACTGGAATT AAGGTCTCTA AAGCATCTGC 3000 

rC T TTTT TTT TTTTACGGAT ATTTTAGTAA TAAATATGCT GGATAAATAT TAGTCCAACA -3060 

ATAGCTAAGT TATGCTAATA TCACATTATT ATGTATTCAC TTTAAGTGAT AGTTTAAAAA 3120 

ATAAACAAGA AATATTGAGT ATCACTATGT GAAGAAAGTT TTGGAAAAGA AACAATGAAG 3180 

ACTGAATTAA ATTAAAAATG TTGCAGCTCA TAAAGAATTG GGACTCACCC CTACTGCACT 3240 

ACCAAATTCA TTTGACTTTG GAGGCAAAAT GTGTTGAAGT GCCCTATGAA GTAGCAATTT 3300 

TCTATAGGAA TATAGTTGGA AATAAATGTG TGTGTGTATA TTATTATTAA TCAATGCAAT 3360 

ATTTAAAATG AAATGAGAAC AAAGAGGAAA ATGGTAAAAA CTTGAAATGA GGCTGGGGTA 3420 

rAGTTTGTCC TACAATAGAA AAAAGAGAGA' GCTTCCTAGG CCTGGGCTCT TAAATGCTGC 3480 

ATTATAACTG AGTCTATGAG GAAATAGTTC CTGTCCAATT TGTGTAATTT GTTTAAAATT 3540 

GTAAATAAAT TAAACTTTTC TGGTTTCTGT GGGAAGGAAA TAGGGAATCC AATGGAACAG 3600 

TAGCTTTGCT TTGCAGTCTG TTTCAAGATT TCTGCATCCA CAAGTTAGTA GCAAACTGGG 3660 

GAATACTCGC TGCAGCTGGG GTTCCCTGCT TTTTGGTAGC AAGGGTCCAG AGATGAGGTG 3720 

TTTTTTTCGG GGAGCTAATA ACAAAAACAT TTTAAAACTT ACCTTTACTG AAGTTAAATC 3780 

CTCTATTGCT GTTTCTATTC TCTCTTATAG TGACCAACAT CTTTTTAATT TAGATCCAAA 3840 

TAACCATGTC CTCCTAGAGT TTAGAGGCTA GAGGGAGCTG AGGGGAGGAT CTTACTGAAA 3900 

GCACCCTGGG GAGATTGATT GTCCTTAAAC CTAAGCCCCA CAAACTTGAC ACCTGATCAG 3960 

GTCTGGGAGC TACAAAATTT CATTTTTCTC CTCACTGCCC TTCTTCTGAG TGGCATTGGC 4020 

CTGAATCAAG GAAAGCCAGG CCTTGTGGGC CCCCTTCTTT CGGCTTTCTG CTAAAGCAAC 4080 

ACCTCCAGCA GAGATTCCCT TAAGTGACTC CAGGTTTTCC ACCATCCTTC AGCGTGAATT 4140 

AATTTTTAAT CAGTTTGCTT TCTCCAGAGA AATTTTAAAA TAATAGAAGA AATA GAAATT 4200 

TTGAATGTAT AAAAGAAAAA GATCAAGTTG TCATTTTAGA ACAGAGGGAA CTTTGGGAGA 4260 

AAGCAGCCCA AGTAGGTTAT TTGTACAGTC AGAGGGCAAC AGGAAGATGC AGGCCTTCAA 4320 

GGGCAAGGAG AGGCCACAAG GAATATGGGT GGGAGTAAAA GCAACATCGT CTGCTTCATA 4380 

CTTTTTCCTA GGCTTGGCAC TGCCTTTTCC TTTCTCAGGC CAATGGCAAC TGCCATTTGA 4440 

GTCCGGTGAG GGATCAGCCA ACCTCTTCTC TATGGCTCAC CTTATTTGGA GTGAGAAATC 4500 

AAGGAGACAG AGCTGACTGC ATGATGAGTC TGAAGGCATT TGCAGGATGA GCCTGAACTG 4560 

GTTGTGCAGA ACAAACAAGG CATTCATGGG AATTGTTGTA TTCCTTCTGC AGCCCTCCTT 4620 

CTGGGCACTA AGAAGGTCTA TGAATTAAAT GCCTATCTAA AATTCTGATT TATTCCTACA 4680 

TTTTCTGTTT TCTAATTTGA CCCTAAAATC TATGTGTTTT AGACTTAGAC TTTTTATTGC 4740 

CCCCCCCCCC TTTTTTTTTG AGACGGAGTC TCGCTCTGAC GCACAGGCTG GAGTGCAGTG 4800 

GCTCCGATCT CTGCTCACTG AAAGCTCCGC CTCCCGGGTT CATGCCATTC TCCTGCCTCA 4860 

GCCTCCTGAG TAGCTGGGAC TACAGGCGCC CACCACCACG CCCGGCTAAT TTTTTGTATT 4920 

TTTAATAGAG ACGGGGTTTC ACTGTGTTAG CCAGGATGGT CTCGATCTCC TGACCTCGTG 4980 

ATCCGCCTGC CTCGGCCTCC CAAAGTGCTG GGATTACAGG CATGACCCAC CGCTCCCGGC 5040 

CTTGTTTTCC GTTTAAAGTC GTCTTCTTTT AATGTAATCA TTTTGAACAT GTGTGAAAGT 5100 

TGATCATACG AATTGGATCA ATCTTGAAAT ACTCAACCAA AAGACAGTCG AGAAGCCAGG 5160 

GGGAGAAAGA ACTCAGGGCA CAAAATATTG GTCTGAGAAT GGAATTCTCT GTAAGCCTAG 5220 

TTGCTGAAAT TTCCTGCTGT AACCAGAAGC CAGTTTTATC TAACGGCTAC TGAAA CACC C 5280 

ACTGTGTTTT GCTCACTCCC TCACTCACOG ATCAAAACCT GCTACCTCCC CAAGACTTTA 5340 

CTAGTGCCGA TAAACTTTCT CAAAGAGCAA CCAGTATCAC TTCCCTGTTT ATAAAACCTC 5400 

TAACCATCTC TTTGTTCTTT GAACATGCTG AAAACCACCT GGTCTGCATG TATGCCCGAA 5460 

TTTGTAATTC TTTTCTCTCA AATGAAAATT TAATTTTAGG GATTCATTTC TATATTTTCA 5520 

CATATGTAGT ATTATTATTT- CCTTATATGT GTAAGGTGAA ATTTATGGTA TTTGAGTGTG 5 5 BO 

CAAGAAAATA TATTTTTAAA GCTTTCATTT TTCCCCCAGT GAATGATTTA GAATTTTTTA 5640 

TGTAAATATA CAGAATGTTT TTTCTTACTT TTATAAGGAA GCAGCTGTCT AAAATGCAGT 5700 

GGGGTTTGTT TTGCAATGTT TTAAACAGAG TTTTAGTATT GCTATTAAAA GAAGTTACTT 5760 

TGCTTTTAAA GAAACTTGGC TGCTTAAAAT AAGCAAAAAT TGGATGCATA AAGTAATATT 5820 

TACAGATGTG GGGAGATGTA ATAAAACAAT ATTAACTTGG TTTCTTGTTT TTGCTGTATT 5880 

TAGAGATTAA ATAATTCTAA GATGATCACT TTGCAAAATT ATGCTTATGG CTGGCATGGA 5940 

AAT AGAAATA • CTCAATTATG TCTTTGTTGT ATTAATCGGG AATATTTTGG ACAATGTTTC 6000 

. ATTATCAAAT TGTCGACATC ATTAAT AT AT ATTGTAATGT TGGGAAGAGA TCACTATTTT 6060 

GAAGCACAGC TTTACAGATG AGTATCTATG ATACATATGT ATAATAAATT TTGATCGGGT 6120 

ATTAAAAGTA TTAGAAGGTG GTTATAATTG CAGAGTATTC CATGAATAGT ACACTGACAC 6180 

AGGGGTTTTA CTTTGAGGAC CAGTGTAGTC AAGGGAAAAC ATGAGTTAAA AAGAAAAGCA 6240 

GGCAATATTG CAGTCTTGAT TCTGCCACTT ACAGGATAGA TAATGCCTGA ACTTTAATGA 6300 

CAAGATGATC CAACCATAAA GGTGCTCTGT GCTTCACAGT GAATCTTTTC CCCATGCAGG • 6360 

AGTGTGCTCC CCTACAAACG TTAAGACTGA TCATTTCAAA AATCTATTAG CTATATCAAA 6420 

AGCCTTACAT TTTAATATAG GTTGAACCAA AATTTCAATT CCAGTAACTT CTATTGTAAC 6480 

CATTATTTTT GTGTATGTCT TCAAGAATGT TCATTGGATT TTTGTTTGTA ATAGTAAAAT 6540 

ACCGGATACA TTTCACGTGT CCTTCAGTAT TGATTTGGTT GAATATTGGG TCATAATGGT 6600 

TGAGAAGCAT GGACACTAGA GCCAGAATGC TTGGATATGA ATCCTGGATC TGTCACTTAC 6660 

TTCTGTGTGA CCTTTGAAAG GCTACTTATT TCCTCTCTTA GCTTTCTCAT TAAAATCAAT 6720 

GAACAATGCC AGCCTCATGG GGTTGTTGAA TGATTAAATT AGTTAATATA CCTAAAGTAC 6780 
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. ATAGAACACT GCCTGCACAT AGTAAAAGAA TTATAAGTGT GAGGTAGTTG GTAAAATTAT 
• GTAGTTGGAT ATACTACOGA ACAATATCTA ATCTCTTTTT AGGGAAATAA AGTTTGTGCA 
. T AT AT A TAAT CCOGAAACAT. 6 



PCT/US02/12476 



6840. 
6900 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



Seq ID NO: 32 Protein sequence: 
Protein Accession H:. NP_001932.1 



MAAAGPRRSV 
ADLIRSSDPD 
KTRHTRETVL 
EPLNLFYIER 
PVFTEAIYNF 
TGVITTVSHY 
EAFVEENAFN 
KPLNYEENRQ 
KENLAVGSKI 
KNELYNITVL 
EPVHGAPFYP 
TKLLRVNLCE 
KRFPEDLAQQ 
. MMKGGNQTLE 
DRMPSQDYVL 



11 
I 

RGAVCLHLLL 
FRVLKDGSVY 
RRAKRRWAPI 
DTGNLFCTRP 
EVLESSRPGT 
LDREWDKYS 
VEILRIPIED 
VNLEIGVNNE 
NGYKAYDPEN 
AIDKEDRSCT 
SLPNTSPEIS 
CTHPTQCRAT 
NLIISNTEAP 
SCRGAGHHHT 
TYNYEGRGSP 



21 
I 

TLVIFSRDGE 
TARAVALSDK 
PCSMOENSLG 
VDREEYDVFD 
TVGWCATDR 
LIMKVQDMDG 
KDZiIKTANWR 
APFARDIPRV 
RNGNGTiRYKK 
GTLAVNIEDV 
RLWSLTKVND 
SRSTGVILGK 
GDDRVCSANG 
LDSCRGGHTE 
AGSVGCCSEK 



31 

I 

ACKKVILNVP 
KRSFTIWLSD 
PFPLFLQQVE 
LIAYASTADG 
DEPDTMHTRL 
QFFGLIGTST 
VNFTILKGNE 
TALNRALVTV 
LHDPKGWITI 
NDNPPEILQE 
TAARLSYQKN 
WAILAILLGI 
FMTQTTNNSS 
VDNCRYTYSE 
QEEDGLDPLN 



41 

I 

SKLEADKIIG 
KRKQTQKEVT 
SDAAQNYTVP 
YSADLPLPLP 
KYSILQQTPR 
CIITVTDSKD 
NGHFKISTDK 
HVRDLDEGPE 
DEISGSIITS 
YWICKPKMG 
AGFQEYTIPI 
ALLFSVLLTL 
QGFCGTMGSG 
WHSFTQPRLG 
NLEPKFITLA 



SI - 
I 

RVNLEECFRS 
VLLSHQKKVS 
YSISGRGVDK 
IRVEDENDNH 
SPGLFSVHPS 
NAPTFRQNAY 
ETNEGVLSVV 
CTPAAQYVRI 
KILDREVETP 
YTDILAVDPD 
TVXDRAGQAA 
VOGVFGATKG 
MKNGGQETIE 
EKLKRCNQNE 
EACTKR 



Seq ID NO: 33 DNA sequence 

Nucleic Acid Accession #s Eos sequence 

Coding sequence: 64-2583 



GGCAGGTCTC 
CCGATGGCCG 
CTGACCCTCG 
CCTTCTAAAC 
TCTGCAGACC 
TACACAGCCA 
GACAAAAGGA 
TCGAAGACAA 
ATTCCTTGCT 
GAATCTGATG 
AAAGAACCTT 
CCTGTGGATC 
GGATATTCAG 
CACCCTGTTT 
ACTACAGTGG 
CTGAAATACA 
AGCACAGGCG. 

. TCATTGATAA 
ACTTGTATCA* 
TATGAAGCAT 
GATAAGGATT 
GAAAATGGAC 
GTAAAGCCAC 
GAAGCGCCAT 
GTTCATGTGA 
ATTAAAGAAA 
AATAGAAATG 
ATTGATGAAA 
CCCAAAAATG 
ACTGGAACAC 
GAATATGTAG 
GATGAACCTG 
AGTAGACTGT 

. AATGCTGGAT 
GCAACAAAAT 
ACTTCAAGGA 
ATAGCACTGC 
GGGAAACGTT 
CCTGGAGACG 
AGCCAAGGTT 
GAAATGATGA 
ACGCTGGACT 
GAGTGGCACA 
TAAAAATTAA 
CCAAGATTAT 
CTGCTGCAGT 
ATTTATTACA 

TTTGTCAGAC 
TGTATATGAT 
AGCCAGTTGT 
TCTCAAACTC 
GATATTTTAG 
ATATCACATT 
AGTATCACTA 
ATGTTGCAGC 



11 
I 

GCTCTCGGCA 

CCGCTGGGCC 

TGATCTTCAG 

TAGAGGCAGA 

TCATCCGGTC 

GGGCTGTTGC 

AACAGACACA 

GACACACTAG 

CTATGCAAGA 

CAGCACAGAA 

TAAATTTGTT 

GTGAAGAATA 

CAGATCTGCC 

TCACAGAAGC 

GGGTGGTTTG 

GCATTTTGCA 

TAATCACCAC 

TGAAAGTACA 

TAACAGTAAC 

TTGTAGAGGA 

TAATTAACAC 

ATTTCAAAAT 

TGAATTATGA 

TTGCTAGAGA 

GGGATCTGGA 

ACTTAGCAGT 

GCAATGGTTT 

TTTCAGGGTC 

AGTTGTATAA 

TTGCTGTGAA 

TCATTTGCAA 

TCCATGGAGC 

GGAGCCTCAC 

TTCAAGAATA 

TATTGAGAGT 

GTACAGGAGT 

TCTTTTCTGT 

TTCCTGAAGA 

ATAGAGTGTG 

TTTGTGGTAC 

AAGGAGGAAA 

CCTGCAGGGG 

GTTTTACTCA 

ACATAAAAGA 

GTCCTCACTT 

GAAAAGCAGG 

TTAGCAGAAG 

ATTCTGGAGG 

GATTTTTTTC 

TGCTTATCTT 

CAGCACTGGA 

TAATAAATAT " 

ATTATGTATT 

TGTGAAGAAA 

TCATAAAGAA 



21 
I 

CCCTCCCGGC 
CCGGOGCTCC 
TCGTGATGGT 
CAAAATAATT 
AAGTGATCCT 
GCTGTCTGAT 
GAAAGAGGTT 
AGAAACTGTT 
GAATTCCTTG 
CTATACTGTC 
TTATATAGAA 
TGATGTTTTT 
CCTCCCACTA 
AATTTATAAT 
TGCCACAGAC 
GCAGACACCA 
AGTCTCTCAT 
AGACATGGAT 
AGATTCAAAT 
AAATGCATTC 
TGCCAATTGG 
CAGCACAGAC 
AGAAAACCGT 
TATTCCCAGA 
TGAGGGGCCT 
GGGGTCAAAG 
AAGGTACAAA 
AATCATAACT 
TATTACAGTC 
CATTGAAGAT 
ACCAAAAATG 
TCCATTTTAT 
CAAAGTTAAT 
TACCATTCCT 
TAATCTGTGT 
AATACTTGGA 
ATTGCTAACT 
TTTAGCACAG 
CTCTGCCAAT 
TATGGGATCA 
CCAGACCTTG 
AGGACACACG 
ACCCCGTCTC 
AATTGCATCG 
ATAACTATGA 
AAGAAGATGG 
CATGCACAAA 
TTTCCAAAAA 
TCAATTTTGA 
TTCCAAAAAG 
ATTAAGGTCT 
GCTGGATAAA 
CACTTTAAGT 
GTTTTGGAAA 
TTGGGACTCA 



31 
I 

GCCOGCGTTC 
GTGCGCGGAG 
GAAGCCTGCA 
GGCAGAGTTA 
GATTTCAGAG 
AAGAAAAGAT 
ACTGTGCTGC 
CTCAGGCGTG 
GGCCCTTTCC 
TTCTACTCAA 
AGAGACACTG 
GATTTGATTG 
CCCATCAGGG 
TTTGAAGTTT 
AGAGATGAAC 
AGGTCACCTG 
TATTTGGACA 
GGCCAGTTTT 
GATAATGCAC 
AATGTGGAAA 
AGAGTCAATT 
AAAGAAACTA 
CAAGTGAACC 
GTGACAGCCT 
GAATGCACTC 
ATCAACGGCT 
AAATTGCATG 
TCCAAAATCC 
CTGGCAATAG 
GTAAATGATA 
GGGTATACCG 
TTCAGTTTGC 
GATACAGCTG 
ATTACTGTAA 
GAATGTACTC 
AAATGGGCAA 
TTAGTATGTG 
CAAAACTTAA 
GGATTTATGA 
GGAATGAAAA 
GAATCCTGCC 
GAGGTGGACA 
GGTGAAGAAT 
ATGTAATCAG 
GGGAAGAGGA 
CCTTGACTTT 
GAGATAATGT 
TAATATTGTA 
ATTATGCTAC 
TGAAAAATGT 
CTAAAGCATC 
TATTAGTCCA 
GATAGTTTAA 
AGAAACAATG 
CCCCTACTGC 



41 

I 

TCCTGGCCCT 
CCGTCTGCCT 
AAAAGGTGAT 
ATTTGGAAGA 
TTCTAAATGA 
CATTTACCAT 
TAGAACATCA 
CCAAGAGGAG 
CATTGTTTCT 
TAAGTGGACG 
GAAATCTATT 
CTTATGCGTC 
TAGAGGATGA 
TGGAAAGTAG 
CGGACACAAT 
GGCTCTTTTC 
GAGAGGTTGT 
TTGGATTGAT 
CCACTTTCAG 
TCTTACGAAT 
TTACCATTTT 
ATGAAGGTGT 
TGGAAATTGG 
TGAACAGAGC 
CTGCAGCCCA 
ATAAGGCATA 
ATCCTAAAGG 
TGGATAGGGA 
ACAAAGATGA 
ATCCACCAGA 
ACATTTTAGC 
CCAATACTTC 
CCCGTCTTTC 
AAGACAGGGC 
ATCCAACTCA 
TCCTTGCAAT 
GAGTTTTTGG 
TTATATCAAA 
CCCAAACTAC 
ATGGAGGGCA 
GGGGGGCTGG 
ACTGCAGATA 
CCATTAGAGG 
AATGAAGACC 
TCTCCAGCTG 
TTAAATAATT 
CACAGTGCTA 
AAGTTCAATT 
TCACCAATTT 
TAAAACAGAC 
TGCTCTTTTT 
ACAATAGCTA 
AAAATAAACA 
AAGACTGAAT 
ACTACCAAAT 



51 
I 

GCCCGGCATC 
GCATCTGCTG 
ACTTAATGTA 
GTGCTTCAGG 
TGGGTCAGTG 
ATGGCTTTCT 
GAAGAAGGTA 
ATGGGCACCT 
TCAACAAGTT 
TGGAGTTGAT 
TTGCACTCGG 
AACTGCAGAT 
AAATGACAAC 
TAGACCTGGT 
GCATACGCGC 
TGTGCATCCC 
AGACAAGTAC 
AGGCACATCA 
ACAAAATGCT 
ACCTATAGAA 
AAAGGGAAAT 
TCTTTCTGTT 
AGTAAACAAT 
CTTGGTTACA 
ATATGTGCGG 
TGACCCCGAA 
TTGGATCACC 
GGTTGAAACT 
TAGATCATGT 
AATACTTCAA 
TGTTGATCCT 
TCCAGAAATC 
ATATCAGAAA 
CGGCCAAGCT 
GTGTCGTGCG 
ATTACTGGGT 
TGCAACTAAA 
CACAGAAGCA 
CAACAACTCT 
GGAAACCATT 
GCATCATCAT 
CACTTACTCG 
ACACACTGGT 
GCATGCCATC 
GTTCTGTGGG 
TGGAACCCAA 
CAATTAGGTC 
TCAACATGTA 
ATATTTTTAA 
AACTGGTAAA 
TTTTTTTACG 
AGTTATGCTA 
AGAAATATTG 
TAAATTAAAA 
TCATTTGACT 



60 
120 
190 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 



60 
120 
180 
240 
300 
360 
. 420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3i80 
3240 
3300 
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TTGGAGGCAA AATGTGTTGA AGTGCCCTAT GAAGTAGCAA TTTTCTATAG GAATATAGTT .3360 
GGAAATAAAT GTGTGTGTGT * ATATTATT AT . TAATCAATGC AATATTTAAA ATGAAATGAG. 3420 
AACAAAGAGG AAAATGGTAA AAACTTGAAA TGAGGCTGGG GTATAGTTTG TCCTACAATA . 3480 

GAAAAAAGAG AGAGCTTCCT A6GCCTGGGC TCTTAAATGC TGCATTATAA CTGAGTCTAT 3540 

GAGGAAATAG TTCCTGTCCA ATTTGTGTAA TTTGTTTAAA ATTGTA AATA AATTAAACTT 3600 

TTCTGGTTTC TGTGGGAAGG AAATAGGGAA rCCAATGGAA CAGTAGCTTT GCTTTGCAGT 3660 

CTGTTTCAAG ATTTCTGCAT CCACAAGTTA GTAGCAAACT GG GGAATACT CGCTGCAGCT 3720 
GGGGTTCCCT GCTTTTTGGT AGCAAGGGTC CAGAGATGAG GTGTTTTTTT CGGGGAGCTA. 3780 

ATAACAAAAA CATTTTAAAA CTTACCTTTA CTGAAGTTAA ATCCTCTATT GCTGTTTCTA 3840 

TTCTCTCTTA TAGTGACCAA CATCTTTTTA ATTTAGATCC AAATAACCAT GTCCTCCTAG 3900 

AGTTTAGAGG CTAGAGGGAG CTGAGGGGAG GATCTTACTG AAAGCACCCT GGGGAGATTG 3960 

ATTGTCCTTA AACCTAAGCC CCACAAACTT GACACCTGAT CAGGTCTGGG AGCTACAAAA 4020 

TTTCATTTTT CTCCTCACTG CC CT TC TT CT GAGTGGCATT GGCCTGAATC AAGGAAAGCC 4080 

AGGCCTTGTG GGCCCCCTTC TTTCGGCTTT CTGCTAAAGC AACACCTCCA GCAGAGATTC 4140 

CCTTAAGTGA CTCCAGGTTT TCCACCATCC TTCAGOGTGA ATTAATTTTT AATCAGTTTG 4200 

CTTTCTCCAG AGAAATTTTA AAATAATAGA AGAAATAGAA ATTTTGAATG TATAAAAGAA 4260 

AAAGATCAAG TTGTCATTTT AGAACAGAGG GAACTTTGGG AGAAAGCAGC CCAAGTAGGT 4320 

TATTTGTACA GTCAGAGGGC AACAGGAAGA TGCAGGCCTT CAAGGGCAAG GAGAGGCCAC 4380 

AAGGAATATG GGTGGGAGTA AAAGCAACAT CGTCTGCTTC ATACTTTTTC CTAGGCTTGG 4440 

CACTGCCTTT TCCTTTCTCA GGCCAATGGC AACTGCCATT TGAGTCCGGT GAGGGATCAG 4500 

CCAACCTCTT CTCTATGGCT CACCTTATTT GGAGTGAGAA ATCAAGGAGA CAGAGCTGAC 4560 

TGCATGATGA GTCTGAAGGC ATTTGCAGGA TGAGCCTGAA CTGGTTGTGC AGAACAAACA 4 620 

AGGCATTCAT GGGAATTGTT GTATTCCTTC TGCAGCCCTC CTTCTGGGCA CTAAGAAGGT 4680 

CTATGAATTA AATGCCTATC TAAAATTCTG ATTTATTCCT ACATTTTCTG TTTTCTAATT 4740 

TGACCCTAAA ATCTATGTGT TTTAGACTTA GACTTTTTAT TGCCCCCCCC CCCTTTTTTT 4800 

TTGAGACGGA GTCTCGCTCT GACGCACAGG CTGGAGTGCA GTGGCTCCGA TCTCTGCTCA 4B60 

CTGAAAGCTC CGCCTCCCGG GTTCATGCCA TTCTCCTGCC TCAGCCTCCT GAGTAGCTGG 4920 

GACTACAGGC GCCCACCACC ACGCCCGGCT AATTTTTTGT ATTTTTAATA GAGACGGGGT 4980 

TTCACTGTGT TAGCCAGGAT GGTCTCGATC TCCTGACCTC GTGATCCGCC TGCCTCGGCC 5040 

TCCCAAAGTG CTGGGATTAC AGGCATGACC CACCGCTCCC GGCCTTGTTT TCCGTTTAAA 5100 

GTCGTCTTCT TTTAATGTAA TCATTTTGAA CATGTGTGAA AGTTGATCAT ACGAATTGGA 5160 

TCAATCTTGA AATACTCAAC CAAAAGACAG TCGAGAAGCC AGGGGGAGAA AGAACTCAGG 5220 

GCACAAAATA TTGGTCTGAG AATGGAATTC TCTGTAAGCC TAGTTGCTGA AATTTCCTGC 5280 

TGTAACCAGA AGCCAGTTTT ATCTAACGGC TACTGAAACA CCCACTGTGT TTTGCTCACT 5340 

CCCACTCACC GATCAAAACC TGCTACCTCC CCAAGACTTT ACTAGTGCCG ATAAACTTTC 5400 

TCAAAGAGCA ACCAGTATCA CTTCCCTGTT TATAAAACCT CTAACCATCT CTTTGTTCTT 5460 

TGAACATGCT GAAAACCACC TGGTCTGCAT GTATGCCCGA ATTTGTAATT CTTTTCTCTC 5520 

AAATGAAAAT TTAATTTTAG GGATTCATTT CTATATTTTC ACATATGTAG TATTATTATT 5580 

TCCTTATATG TGTAAGGTGA AATTTATGGT ATTTGAGTGT GCAAGAAAAT ATATTTTTAA 5640 

AGCTTTCATT TTTCCCCCAG TGAATGATTT AGAATTTTTT ATGTAAATAT ACAGAATGTT 5700 

TTTTCTTACT TTTATAAGGA AGCAGCTGTC TAAAATGCAG TGGGGTTTGT TTTGCAATGT 5760 

TTTAAACAGA GTTTTAGTAT TGCTATTAAA AGAAGTTACT TTGCTTTTAA AGAAACTTGG 5820 

CTGCTTAAAA TAAGCAAAAA TTGGATGCAT AAAGTAATAT TTACAGATGT GGGGAGATGT 5880 

AATAAAACAA TATTAACTTG GCTGCTTAAA ATAAGCAAAA ATTGGATGCA TAAAGTAATA 5940 

TTTACAGATG TGGGGAGATG TAATAAAACA ATATTAACTT GGTTTCTTGT TTTTGCTGTA 6000 

TTTAGAGATT AAATAATTCT AAGATGATCA CTTTGCAAAA TTATGCTTAT GGCTGGCATG 5060 

GAAATAGAAA TACTCAATTA TGTCTTTGTT GTATTAATGG GGAATATTTT GGACAATGTT 6120 

TCATTATCAA ATTGTCGACA TCATTAATAT ATATTGTAAT GTTGGGAAGA GATCACTATT 6180 

TTGAAGCACA GCTTTACAGA TGAGTATCTA TGATACATAT GTATAATAAA TTTTGATCGG 6240 

GTATTAAAAG TATTAGAAGG TGGTTATAAT TGCAGAGTAT TCCATGAATA GTACACTGAC 6300 

ACAGGGGTTT TACTTTGAGG ACCAGTGTAG TCAAGGGAAA ACATGAGTTA AAAAGAAAAG 6360 

CAGGCAATAT TGCAGTCTTG ATTCTGCCAC TTACAGGATA GATAATGCCT GAACTTTAAT 6420 

GACAAGATGA TCCAACCATA AAGGTGCTCT GTGCTTCACA GTGAATCTTT TCCCCATGCA ,6480 

GGAGTGTGCT CCCCTACAAA CGTTAAGACT GATCATTTCA AAAATCTATT AGCTATATCA S540 

AAAGCCTTAC ATTTTAATAT AGGTTGAACC AAAATTTCAA TTCCAGTAAC. TTCTATTGTA 6600 

ACCATTATTT TTGTGTATGT CTTCAAGAAT GTTCATTGGA TTTTTGTTTG TAATAGTAAA 6660 

ATACCGGATA CATTTCACGT GTCCTTCAGT ATTGATTTGG TTGAATATTG GGTCATAATG 6720 

GTTGAGAAGC ATGGACACTA GAGCCAGAAT GCTTGGATAT GAATCCTGGA TCTGTCACTT 6780 

ACTTCTGTGT GACCTTTGAA AGGCTACTTA TTTCCTCTCT TAGCTTTCTC ATTAAAATCA 6840 

ATGAACAATG CCAGCCTCAT GGGGTTGTTG AATGATTAAA TTAGTTAATA TACCTAAAGT 6900 

ACATAGAACA CTGCCTGCAC ATAGTAAAAG AATTATAAGT GTGAGGTAGT TGGTAAAATT 6960 

• ATGTAGTTGG ATATACTACC GAACAATATC TAATCTCTTT TTAGGGAAAT AAAGTTTGTG 7020 
CATATATATA ATCCCGAAAC ATG 

Seq ID NO: 34 Protein sequence j 
Protein Accession #: NP_077741.1 

1 - 11 21 31 41 51 

I I I I I I 

MAAAGPRRSV RGAVCLHLLL TLVIFSRDGB ACKKVILNVP SKLEADKIIG RVNLEECFRS ■ 60 

ADLIRSSDPD FRVLNDGSVY TARAVALSDK KRSFTIWLSD KRKQTQKEVT VLIjEHQKKVS 120 

KTRHTRETVL RRAKRRWAPI PCSMQENSLG PFPLFLQQVE SDAAQNYTVF YSISGRGVDK 180 

EPIiNLFYIER DTGNLFCTRP VDREBVDVFD LIAYASTADG YSADLPLPLP IRVEDENDNH 240 

PVFTEAIYNF EVLESSRPGT TVGWCATDR DEPDTMHTRL KYSILQQTPR SPGLFSVHP3 300 

TGVITTVSHY LDREWDKYS LIMKVQDMDG QFFGLIGTST CIITVTDSND NAPTFRQNAY 360 

EAFVEEKAFN VEILRIPIED KDLINTANWR VNPTILKGNE NGHFKISTDK ETNEGVLSW 420 

KPLNYBENRQ VNLEIGVKNE APFARDIPRV TALNRALVTV HVRDLDEGPE CTPAAQYVRI 480 

KENLAVG9KI NGYKAYDPEN RNGNGLRYKK LHDPKGWITI DEI SGS I ITS KILDREVETP . 540 

KNELYNITVL AIDKDDRSCT GTLAVNIEDV NDNPPEILQE YVVICKPKMG YTDILAVDPD 600 

EPyHGAPFYP SLPNTSPEIS RLWSLTKVND TAARLSYQKN AGFQEYTIPI TVKDRAGQAA 660 

TKLLRVNLCE CTHPTQCRAT SRSTGVILGK WAILAILLGI ALLFSVLLTL VCGVFGATKG 720 

KRFPEDLAQQ NLIISNTEAP GDDRVCSANG FMTQTTNNSS QGFCGTMGSG MKKGGQETIE . 780 
MMKGGNQTLB SCRGAGHHHT LDSCRGGHTE VDMCRYTYSE WHSFTQPRLG EESIRGHTG 

Seq ID NO: 35 DNA sequence 

Nucleic Acid Accession # : Eos sequence 

Coding sequence: 146-1273- 
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1 II * ■/ -21 .31 41 51 

I I : •-..-! • I I I , 

GGGAGTGGGC GTGGCGGTGC TGCCCAGGTG " AGCCACCGCT GCTTCTGCCC AGACACGGTC 60 

GCCTCCACAT CCAGGTCTTT GTGCTCCTCQ CTTGCCTGTT CCTTTTCCAC GCATTTTCCA ,120 

GGATAACTGT GACTCCAGGC CCGCAATGGA TGCCCTGCAA CTAGCAAATT CGGCTTTTGC ISO 

CGTTGATCTG TTCAAACAAC TATGTGAAAA GGAGCCACTG GGCAATGTCC TCTTCTCTCC 240 

AATCTGTCTC TCCACCTCTC TGTCACTTGC . TCAAGTGGGT GCTAAAGGTG ACACTGCAAA 300 

TGAAATTGGA CAGGTTCTTC ATTTTGAAAA TGTCAAAGAT ATACCCTTTG GATTTCAAAC 360 

AGTAACATCG GATGTAAACA AACTTAGTTC CTTTTACTCA CTGAAACTAA TCAAGCGGCT 420 

CTACGTAGAC AAATCTCTGA ATCTTTCTAC AGAGTTCATC AGCTCTACGA AGAGACCCTA 480 

TGCAAAGGAA TTGGAAACTG * TTGACTTCAA AGATAAATTG GAAGAAACGA AAGGTCAGAT 540 

CAACAACTCA ATTAAGGATC TCACAGATGG CCACTTTGAG AACATTTTAG CTGACAACAG 600 

TGTGAACGAC CAGACCAAAA TCCTTGTGGT TAATGCTGCC TACTTTGTTG GCAAGTGGAT 660 

GAAGAAATTT CCTGAATCAG AAACAAAAGA ATGTCCTTTC AGACTCAACA AGACAGACAC 720 

CAAACCAGTG CAGATGATGA ACATGGAGGC CACGTTCTGT ATGGGAAACA TTGACAGTAT 780 

CAATTGTAAG ATCATAGAGC , TTCCTTTTCA AAATAAGCAT CTCAGCATGT TCATCCTACT ' B40 

ACCCAAGGAT GTGGAGGATG AGTCCACAGG CTTGGAGAAG ATTGAAAAAC AACTCAACTC 900 

AGAGTCACTG TCACAGTGGA. CTAATCCCAG CACCATGGCC AATGCCAAGG TCAAACTCTC 960 

CATTCCAAAA TTTAAGGTGG AAAAGATGAT TGATCCCAAG GCTTGTCTGG AAAATCTAGG 1020 

GCTGAAACAT ATCTTCAGTG AAGACACATC TGATTTCTCT GGAATGTCAG AGACCAAGGG 1080 

AGTGGCCCTA TCAAATGTTA TCCACAAAGT GTGCTTAGAA ATAACTGAAG ATGGTGGGGA 1140 

TTCCATAGAG . GTGCCAGGAG CACGGATCCT GCAGCACAAG GATGAATTGA ATGCTGACCA 1200 

TCCCTTTATT TACATCATCA GGCACAACAA AACTCGAAAC ATCATTTTCT TTGGCAAATT 1260 

CTGTTCTCCT TAAGTGGCAT AGCCCATGTT AAGTCCTCCC TGACTTTTCT GTGGATGCCG 1320 

ATTTCTGTAA ACTCTGCATC CAGAGATTCA TTTTCTAGAT ACAATAAATT GCTAATGTTG 1380 

CTGGATCAGG AAGCCGCCAG TACTTGTCAT ATGTAGCCTT CACACAGATA GACCTTTTTT 1440 

TTTTTCCAAT TCTATCTTTT GTTTCCTTTT TTCCCATAAG ACAATGACAT ACGCTTTTAA 1500 

TGAAAAGGAA TCACGTTAGA GGAAAAATAT TTATTCATTA TTTGTCAAAT TGTCCGGGGT 1560 

AGTTGGCAGA AATACAGTCT TCCACAAAGA AAATTCCTAT AAGGAAGATT TGGAAGCTCT 1620 

TCTTCCCAGC ACTATGCTTT CCTTCTTTCG GATAGAGAAT GTTCCAGACA TTCTCGCTTC 1680 

CCTGAAAGAC TGAAGAAAGT GTAGTGCATG GGACCCACGA AACTGCCCTG GCTCCAGTGA 1740 

AACTTGGGCA CATGCTCAGG CTACTATAGG TCCAGAAGTC CTTATGTTAA GCCCTGGCAG 1800 

GCAGGTGTTT ATTAAAATTC TGAATTTTGG GGATTTTCAA AAGATAATAT TTTACATACA 1860 

CTGTATGTTA TAGAACTTCA TGGATCAGAT CTGGGGCAGC, AACCTATAAA TCAACACCTT. 1920 

AATATGCTGC AACAAAATGT AGAATATTCA GACAAAATOG ATACATAAAG ACTAAGTAGC 1980 

CCATAAGGGG TCAAAATTTG CTGCCAAATG CGTATGCCAC CAACTTACAA AAACACTTCG 2040 

TTCGCAGAGC TTTTCAGATT GTGGAATGTT GGATAAGGAA TTATAGACCT CTAGTAGCTG 2100 

AAATGCAAGA CCCCAAGAGG- AAGTTCAGAT CTTAATATAA ATTCACTTTC ATTTTTGATA 2160 

GCTGTCCCAT CTGGTCATGT GGTTGGCACT AGACTGGTGG CAGGGGCTTC TAGCTGACTC 2220 

GCACAGGGAT TCTCACAATA GCCGATATCA GAATTTGTGT TGAAGGAACT TGTCTCTTCA 2280 

TCTAATATGA TAGCGGGAAA AGGAGAGGAA ACTACTGCCT TTAGAAAATA TAAGTAAAGT 2340 

GATTAAAGTG CTCACGTTAC CTTGACACAT AGTTTTTCAG TCTATGGGTT TAGTTACTTT 2400 

AGATGGCAAG CATGTAACTT ATATTAATAG TAATTTGTAA AGTTGGGTGG ATAAGCTATC 2460 

CCTGTTGCCG GTTCATGGAT TACTTCTCTA TAAAAAATAT ATATTTACCA AAAAATTTTG 2520 

TGACATTCCT TCTCCCATCT CTTCCTTGAC ATGCATTGTA AATAGGTTCT TCTTGTTCTG 2580 
AGATTCAAT A TTGAATTTCT. CCTATGCTAT TGACAATAAA ATATTATTGA ACTACC 

Seq ID NO: 36 Protein sequence : 
Protein Accession #: NP_002 630.1 

1 11 21 31 41 51 

'I I -I I I I 

MDALQLANSA FAVDLPKQLC EKEPLGNVLF SPICLSTSLS LAQVGAKGDT AKEIGQVLHF 60 

ENVKDIPFGF QTVTSDVNKL SSFYSLKLIK RLYVDKSLNL STEFISSTKR PYAKELETVD 120 

FKDKLEETKG QINNSIKDLT DGHFENILAD NSVNDQTKZL WNAAYFVGK WMKKFPESET 180 

KECPFRLNKT DTKPVQMMNM EATFCMGNID SINCKIIELP FQNKHLSMFI LLPKDVEDES 240 

TGLEKIEKQL NSESLSQWTN PSTMANAKVK LSIPKFKVEK MIDPKACLEN LGLKHIFSED 300 

TSDFSGMSET KGVALSNVIH KVCLEITEDG GDSIEVPGAR ILQHKDELNA DHPFIYIIRH 360 
NKTRNIIFFG KFCSP . 



Seq ID NO: 37 DNA sequence 

Nucleic Acid Accession #: NM_016B583 

Coding sequence: 72-842 

1 11 21 31 41 51 

III I I I 

GGAGTGGGGG AGAGAGAGGA GACCAGGACA GCTGCTGAGA CCTCTAAGAA GTCCAGATAC 60 

TAAGAGCAAA GATGTTTCAA ACTGGGGGCC TCATTGTCTT CTACGGGCTG TTAGCCCAGA 120 

CCATGGCCCA GTTTGGAGGC CTGCCCGTGC CCCTGGACCA GACCCTGCCC TTGAATGTGA 180 

ATCCAGCCCT GCCCTTGAGT CCCACAGGTC TTGCAGGAAG CTTGACAAAT GCCCTCAGCA 240 

ATGGCCTGCT GTCTGGGGGC CTGTTGGGCA TTCTGGAAAA CCTTCCGCTC CTGGACATCC 300 

TGAAGCCTGG AGGAGGTACT TCTGGTGGCC TCCTTGGGGG ACTGCTTGGA AAAGTGACGT 360 

CAGTGATTCC TGGCCTGAAC AACATCATTG ACATAAAGGT CACTGACCCC CAGCTGCTGG . 420 

AACTTGGCCT TGTGCAGAGC CCTGATGGCC ACCGTCTCTA TGTCACCATC CCTCTCGGCA 480 

TAAAGCTCCA AGTGAATACG CCCCTGGTCG GTGCAAGTCT GTTGAGGCTG GCTGTGAAGC 540 

TGGAGATCAC TGCAGAAATC TTAGCTGTGA GAGATAAGCA GGAGAGGATC CACCTGGTCC 600 

TTGGTGACTG CACCCATTCC CCTGGAAGCC TGCAAATTTC TCTGCTTGAT GGACTTGGCC 660 

CCCTCCCCAT TCAAGGTCTT CTGGACAGCC TCACAGGGAT CTTGAATAAA GTCCTGCCTG 720 

AGTTGGTTCA GGGCAACGTG TGCCCTCTGG TCAATGAGGT TCTCAGAGGC TTGGACATCA 780 

CCCTGGTGCA TGACATTGTT AACATGCTGA TCCACGGACT ACAGTTTGTC ATCAAGGTCT 840 

AAGCCTTCCA GGAAGGGGCT .GGCCTCTGCT GAGCTGCTTC CCAGTGCTCA CAGATGGCTG 900 

GCCCATGTGC TGGAAGATGA CACAGTTGCC TTCTCTCCGA GGAACCTGCC CCCTCTCCTT 960 

TCCCACCAGG CGTGTGTAAC ATCCCATGTG CCTCACCTAA TAAAATGGCT CTTCTTCTGC 1020 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAA . . 
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Seq ID" NO: 38 Protein sequence: 
Protein Accession ft: NP_057667 

1 11 21 31 41 51 

MFQTGGLIVP YGLLAQTMAQ FGGLPVPLDQ TLPLNVNPAL PLSPTGLAGS LTNALSNGLL 60 

SGGLLGILEN LPLLDILKPG GGTSGGLLGG LLGKVTSVIP GLNNIIDIKV TDPQLLELGL 120 

VQSPDGHRLY VTIPLGIKLQ VNTPLVGASL LRLAVKLDIT AEILAVRDKQ ERIHLVLGDC 180 

THSPGSLQIS LLDGLGPLPI QGLLDSLTGI LNKVLPELVQ GNVCPLVNEV LRGLDITLVH 240 
DIVNMLIHGI* QFVIKV 

Seq ID- NO t 39 DNA sequence 

Nucleic Acid Accession's: NM_004363.1 

Coding sequence: 115*2223 

1 11 21 31 . 41 51 

CTCAGGGCAG AGGGAGGAAG GACAGCAGAC CAGACAGTCA CAGCAGCCTT GACAAAACGT 60 

TCCTGGAACT CAAGCTCTTC TCCACAGAGG AGGACAGAGC AGACAGCAGA GACCATGGAG 120 

TCTCCCTCGG CCCCTCCCCA CAG ATGGTG C ATCCCCTGGC AGAGGCTCCT GCTCACAGCC 180 

TCACTTCTAA CCTTCTGGAA CCCGCCCACC ACTGCCAAGC TCACTATTGA ATCCACGCCG 240 

TTCAATGTCG CAGAGGGGAA GGAGGTGCTT CTACTTGTCC ACAATCTGCC CCAGCATCTT 300 

TTTGGCTACA GCTGGTACAA AGGTGAAAGA GTGGATGGCA ACCGTCAAAT TATAGGATAT 360 

GTAATAGGAA CTCAACAAGC TACCCCAGGG CCCGCATACA GTGGTCGAGA GATAATATAC 420 

CCCAATGCAT CCCTGCTGAT CCAGAACATC ATCCAGAATG ACACAGGATT CTACACCCTA 480 

CACGTCATAA AGTCAGATCT TGTGAATGAA GAAGCAACTG GCCAGTTCCG GGTATACCCG 540 

GAGCTGCCCA AGCCCTCCAT CTCCAGCAAC AACTCCAAAC CCGTGGAGGA CAAGGATGCT 600 

GTGGCCTTCA* CCTGTGAACC TGAGACTCAG GACGCAACCT ACCTGTGGTG GGTAAACAAT 660 

CAGAGCCTCC CGGTCAGTCC CAGGCTGCAG CTGTCCAATG GCAACAGGAC CCTCACTCTA 720 

TTCAATGTCA CAAGAAATGA CACAGCAAGC TACAAATGTG AAACCCAGAA CCCAGTGAGT 780 

GCCAGGCGCA GTGATTCAGT CATCCTGAAT GTCCTCTATG GCCCGGATGC CCCCACCATT 840 

TCCCCTCTAA ACACATCTTA CAGATCAGGG GAAAATCTGA ACCTCTCCTG CCACGCAGCC 900 

TCTAACCCAC CTGCACAGTA CTCTTGGTTT GTCAATGGGA CTTTCCAGCA ATCCACCCAA 960 

GAGCTCTTTA TCCCCAACAT CACTGTGAAT AATAGTGGAT CCTATACGTG CCAAGCCCAT 1020 

AACTCAGACA CTGGCCTCAA TAGGACCACA GTCACGACGA TCACAGTCTA TGCAGAGCCA 1080 

CCCAAACCCT TCATCACCAG CAACAACTCC AACCCCGTGG AGGATGAGGA TGCTGTAGCC 1140 

TTAACCTGTG AACCTGAGAT TCAGAACACA ACCTACCTGT GGTGGGTAAA TAATCAGAGC 1200 

CTCCCGGTCA GTCCCAGGCT GCAGCTGTCC AATGACAACA GGACCCTCAC TCTACTCAGT 1260 

GTCACAAGGA ATGATGTAGG ACCCTATGAG TGTGGAATCC AGAACGAATT AAGTGTTGAC 1320 

CACAGCGACC CAGTCATCCT GAATGTCCTC TATGGCCCAG AOGACCCCAC CATTTCCCCC 1380 

TCATACACCT ATTACCGTCC AGGGGTGAAC CTCAGCCTCT CCTGCCATGC AGCCTCTAAC 1440 

CCACCTGCAC AGTATTCTTG GCTGATTGAT GGGAACATCC AGCAACACAC ACAAGAGCTC 1500 

TTTATCTCCA ACATCACTGA GAAGAACAGC GGACTCTATA CCTGCCAGGC CAATAACTCA 1560 

GCCAGTGGCC ACAGCAGGAC TACAGTCAAG ACAATCACAG TCTCTGCGGA GCTGCCCAAG 1620 

CCCTCCATCT CCAGCAACAA CTCCAAACCC GTGGAGGACA AGGATGCTGT GGCCTTCACC 1680 

TGTGAACCTG AGGCTCAGAA CACAACCTAC CTGTGGTGGG TAAATGGTCA GAGCCTCCCA 1740 

GTCAGTCCCA GGCTGCAGCT GTCCAATGGC AACAGGACCC TCACTCTATT CAATGTCACA 1800 

AGAAATGACG CAAGAGCCTA TGTATGTGGA ATCCAGAACT CAGTGAGTGC AAACCGCAGT 1860 

GACCCAGTCA CCCTGGATGT CCTCTATGGG CCGGACACCC CCATCATTTC CCCCCCAGAC 1920 

TCGTCTTACC TTTCGGGAGC GAACCTCAAC CTCTCCTGCC ACTCGGCCTC TAACCCATCC 19 B0 

CCGCAGTATT CTTGGCGTAT CAATGGGATA CCGCAGCAAC ACACACAAGT TCTCTTTATC 2040 

GCCAAAATCA CGCCAAATAA TAACGGGACC TATGCCTGTT TTGTCTCTAA CTTGGCTACT 2100 

GGCCGCAATA ATTCCATAGT CAAGAGCATC ACAGTCTCTG CATCTGGAAC TTCTCCTGGT 2160 

CTCTCAGCTG GGGCCACTGT CGGCATCATG ATTGGAGTGC TGGTTGGGGT TGCTCTGATA 2220 

TAGCAGCCCT GGTGTAGTTT CTTCATTTCA GGAAGACTGA CAGTTGTTTT GCTTCTTCCT 2280 

TAAAGCATTT GCAACAGCTA CAGTCTAAAA TTGCTTCTTT ACCAAGGATA TTTACAGAAA 2340 

AGACTCTGAC CAGAGATCGA GACCATCCTA GCCAACATCG TGAAACCCCA TCTCTACTAA 2400 

AAATACAAAA ATGAGCTGGG CTTGGTGGCG CGCACCTGTA GTCCCAGTTA CTCGGGAGGC 2460 

TGAGGCAGGA GAATCGCTTG AACCCGGGAG GTGGAGATTG CAGTGAGCCC AGATCGCACC 2520 

ACTGCACTCC AGTCTGGCAA CAGAGCAAGA CTCCATCTCA AAAAGAAAAG AAAAGAAGAC 2580 

TCTGACCTGT ACTCTTGAAT ACAAGTTTCT GATACCACTG CACTGTCTGA GAATTTCCAA 2640 

AACTTTAATG AACTAACTGA CAGCTTCATG AAACTGTCCA CCAAGATCAA GCAGAGAAAA 2700 

TAATTAATTT CATGGGACTA AATGAACTAA TGAGGATTGC TGATTCTTTA AATGTCTTGT 2760 

TTCCCAGATT TCAGGAAACT TTTTTTCTTT TAAGCTATCC ACTCTTACAG CAATTTGATA 2820 

AAATATACTT TTGTGAACAA AAATTGAGAC ATTTACATTT TCTCCCTATG TGGTCGCTCC 2B80 

AGACTTGGGA AACTATTCAT GAATATTTAT ATTGTATGGT AATATAGTTA TTGCACAAGT 2940 
TCAATAAAAA TCTGCTCTTT GTATAACAGA AAAA 

Seq ID NO: 40 Protein sequence; 
Protein Accession #: NP_004354.1 

1 11 
1 I 



21 


31 


41 


51 
1 




1 

TASLLTFWNP 


1 

PTTAKLTIES 


1 

TPFNVAEGKE 


VLLLVHNLPQ 


60 


GYVIGTQQAT 


PGPAYSGREI 


IYPNASLLIQ 


NIIQNDTGFY 


120 


YPEI*PKPSIS 


SNNSKPVEDK 


DAVAPTCEPE 


TQDATYLWWV 


180 


TLFNVTRNDT 


ASYKCETQNP 


VSARRSDSVI 


LNVLYGPDAP 


240 


AASNPPAQYS 


WPVNGTFQQS 


TQELPIPNIT 


VNNSGSYTCQ 


300 


EPPKPPITSN 


NSNPVEDEDA 


VALTCEPEIQ 


NTTYLWWVNN 


360 


LSVTRNDVGP 


YECGIQNELS 


VDHSDPVILN 


VLYGPDDPTI 


420 


SNPPAQYSWL 


IDGNIQQHTQ 


ELFISNITEK 


NSGLYTCQAN 


480 


PFCPSISSNNS 


KPVEDKDAVA 


PTCEPEAQNT 


TYLWWVNGQS ■ 


540 


VTRNDARAYV 


CGIQNSVSAN 


RSDPVTLDVL 


YGPDTPIISP 


600 


PSPQYSWRIN 


GIPQQHTQVL 


FIAKITPNNN 


GTYACFVSNL 


660 


PGLSAGATVG 


IMIGVLVGVA 


LI 
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Seq ID NO: 41 DNA sequence 

Nucleic Acid Accession #: NMJ>06952.1 

Coding sequence: 11-793 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 . 

80 

85 



i 
I 

AATCCCGACA 
TGGAAATGTG 
ATCTGACCAA 
GGCTGCCTGG 
TGTAGGCATC 
AGTATATGCC 
ACCCAACCTC 
TGATGACCAG 
CAATTGCTGT 
TGAGAATAAT 
AGAACCTCTC 
CTGCTATGAA 
ATTTGCCATT 
AATTGAATAT 



11 
I 

ATGGCGAAAG 
ATTATTGGTT 
CACAGCCTCT 
ATCGGCATAT 
ATGAAGTCCA 
TTTGAAGTGG 
TTCCTGAAGC 
TGGAAAAACA 
GGCGTAAATG 
GATGCTGACT 
AACCTGGAGG 
CTGATCTCTG 
CTCTGCTGGA 
TAAGAA 



21 
I 

ACAACTCAAC 
GTTGCGGCAT 
ACCCACTGCT 
TTGTGGGCAT 
GCAGGAAAAT 
CATCTTGTAT 
AGATGCTAGA 
ATGGAGTCAC 
GTCCATCAGA 
ATCCCTGGCC 
CTTGTAAACT 
GTCCAATGAA 



31 , 
I 

TGTTCGTTGC 
TGCCCTGACT 
TGAAGCCACC 
CTGCCTCTTC 
TCTTCTGGCG 
CACAGCAGCA 
GAGGTACCAA 
CAAAACCTGG 
CTGGCAAAAA 
TCGTCAATGC 
AGGCGTGCCT 
CCGACACGCC 
TCTCCTGGGT 



41 
I 

TTCCAGGGCC 
GCGGAGTGCA 
GACAACGATG 
TGCCTGTCTG 
TATTTCATTC 
ACACAACGAG 
AACAACAGCC 
GACAGGCTCA 
TACACATCTG 
TGTGTTATGA 
GGTTTTTATC 
TGGGGGGTTG 
ACCATGTTCT 



51 
I 

TGCTGATTTT 
TCTTCTTTGT 
ACATCTATGG 
TTCTAGGCAT 
TGATGTTTAT 
ACTTTTTCAC 
CTCCAAACAA 
TGCTCCAGGA 
CCTTCCGGAC 
ACAATCTTAA 
ACAATCAGGG 
CCTGGTTTGG 
ACTGGAGCAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 



Seq ID NO: 42 Protein sequence: 
Protein Accession #: NP_008883.1 

1 11 21 31 41 51 

I 1 I I I I 

MAKDNSTVRC FQGLLIFGNV IIGCCGIALT AECIFFVSDQ HSLYPLLEAT DNDDIYGAAW 
IGIFVGICLF CLSVLGIVGI MKSSRKILLA YPILMPIVYA PEVASCITAA TQRDFFTPNL 
PLKQMLERYQ NNSPPNNDDQ WKNNGVTKTW DRLMLQDNCC GVNGPSDWQK YTSAFRTENN 
DADYPWPRQC CVMNNLKKPL NLEACKLGVP GFYHNQGCYB LISGPMNRHA WGVAHPGFAI 
LCWTFWVLLG TMFYWSRIEY 



Seq ID NO; 43 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: B3-2605 

1 11 21 31 41 51 

I I I I I I 

GCCGGACAGA TCTGCGCGTA TCCTGGAGCC GGCCCAGTTG TGAACTAGGA GAGCTTTGGG 60 

ACCTCTGTCC CAAGCAAGAG AGATGAATGG AGAGTATAGA GGCAGAGGAT TTGGACGAGG 120 

AAGATTTCAA AGCTGGAAAA GGGGAAGAGG TGGTGGGAAC TTCTCAGGAA AATGGAGAGA 180 

AAGAGAACAC AGACCTGATC TGAGTAAAAC CACAGGAAAA CGTACTTCTG AACAAACCCC -240 

ACAGTTTTTG CTTTCAACAA AGACCCCACA GTCAATGCAG TCAACATTGG ATCGATTCAT 300 

ACCATATAAA GGCTGGAAGC TTTATTTCTC TGAAGTTTAC AGCGATAGCT CTCCTTTGAT 360 

TGAGAAGATT CAAGCATTTG AAAAATTTTT CACAAGGCAT ATTGATTTGT ATGACAAGGA 420 

TGAAATAGAA AGAAAGGGAA GTATTTTGGT AGATTTTAAA GAACTGACAG AAGGTGGTGA 480 

AGTAACTAAC TTGATACCAG ATATAGCAAC TGAACTAAGA GATGCACCTG AGAAAACCTT 540 

GGCTTGCATG GGTTTGGCAA TACATCAGGT GTTAACTAAG GACCTTGAAA GGCATGCAGC 600 

TGAGTTACAA GCCCAGGAAG GATTGTCTAA TGATGGAGAA ACAATGGTAA ATGTGCCACA 660 

TATTCATGCA AGGGTGTACA ACTATGAGCC TTTGACACAG CTCAAGAATG TCAGAGCAAA 720 

TTACTATGGA AAATACATTG CTCTAAGAGG GACAGTGGTT CGTGTCAGTA ATATAAAGCC 780 

TCTTTGCACC AAGATGGCTT TTCTTTGTGC TGCATGTGGA GAAATTCAGA GCTTTCCTCT 840 

TCCAGATGGA AAATACAGTC TTCCCACAAA GTGTCCTGTG CCTGTGTGTC GAGGCAGGTC 900 

ATTTACTGCT CTCCGCAGCT CTCCTCTCAC AGTTACGATG GACTGGCAGT CAATCAAAAT 960 

CCAGGAATTG ATGTCTGATG ATCAGAGAGA AGCAGGTCGG ATTCCACGAA CAATAGAATG 1020 

TGAGCTTGTT CATGATCTTG TGGATAGCTG TGTCCCGGGA GACACAGTGA CTATTACTGG 1080 

AATTGTCAAA GTCTCAAATG CGGAAGAAGG TTCTCGAAAT AAGAATGACA AGTGTATGTT 1140 

CCTTTTGTAT ATTGAAGCAA .ATTCTATTAG. TAATAGCAAA GGACAGAAAA CAAAGAGTTC 1200 

TGAGGATGGG TGTAAGCATG GAATGTTGAT GGAGTTCTCA CTTAAAGACC TTTATGCCAT 1260 

CCAAGAGATT CAAGCTGAAG AAAACCTGTT. TAAACTCATT GTCAACTCGC TTTGCCCTGT 1320 

CATTTTTGGT CATGAACTTG TTAAAGCAGG TTTGGCATTA GCACTCTTTG GAGGAAGCCA 1380 

GAAATACGCA GATGACAAAA. ACAGAATTCC ■ AATTCGGGGA GACCCCCACA TCCTTGTTGT 1440 

7GGAGATCCA GGCCTAGGAA AAAGTCAAAT GCTACAGGCA GCGTGCAATG TTGCCCCACG 1500 

TGGCGTGTAT GTTTGTGGTA ACACCACGAC CACCTCTGGT CTGACGGTAA CTCTTTCAAA 1560 

AGATAGTTCC TCTGGAGATT TTGCTTTGGA. AGCTGGTGCC CTGGTACTTG GTGATCAAGG 1620 

TATTTGTGGA ATCGATGAAT ' TTGATAAGAT GGGGAATCAA CATCAAGCCT TGTTGGAAGC 1680 

CATGGAGCAG CAAAGTATTA GTCTTGCTAA GGCTGGTGTG GTTTGTAGCC TTCCTGCAAG 1740 

AACTTCCATT ATTGCTGCTG CAAATCCAGT TGGAGGACAT TACAATAAAG CCAAAACAGT 1800 

TTCTGAGAAT TTAAAAATGG GGAGTGCACT ACTATCCAGA TTTGATTTGG TCTTTATCCT 1860 

GTTAGATACT CCAAATGAGC ATCATGATCA CTTACTCTCT GAACATGTGA TTGCAATAAG 1920 

AGCTGGAAAG CAGAGAACCA TTAGCAGTGC CACAGTAGCT CGTATGAATA GTCAAGATTC 1980 

AAATACTTCC GTACTTGAAG TAGTTTCTGA GAAGCCATTA TCAGAAAGAC TAAAGGTGGT 2040 

TCCTGGAGAA ACAATAGATC CCATTCCCCA . CCAGCTATTG AGAAAGTACA TTGGCTATGC 2100 

TCGGCAGTAT GTGT ACCCAA GGCTATCCAC ' AGAAGCTGCT CGAGTTCTTC AAGATTTTTA 2160 

CCTTGAGCTC CGGAAACAGA GCCAGAGGTT AAATAGCTCA CCAATCACTA CCAGGCAGCT 2220 

GGAATCTTTG ATTCGTCTGA CAGAGGCACG . AGCAAGGTTG GAATTGAGAG AGGAAGCAAC 2280 

CAAAGAAGAC GCTGAGGATA TAGTGGAAAT, TATGAAATAT AGCATGCTAG GAACTTACTC 2340 

TGATGAATTT GGGAACCTAG ATTTTGAGCG ATCCCAGCAT GGTTCTGGAA TGAGCAACAG 2400 

GTCAACAGCG AAAAGATTTA TTTCTGCTCT * CAACAACGTT GCTGAAAGAA CTTATAATAA 2460 

TATATTTCAA TTTCATCAAC TTCGGCAGAT TGCCAAAGAA CTAAACATTC AGGTTGCTGA 2520 

TTTTGAAAAT TTTATTGGAT CACTAAATGA CCAGGGTTAC CTCTTGAAAA AAGGCCCAAA 2580 

AGTTTACCAG CTTCAAACTA TGTAAAAGGA CTTCACCAAG TTAGGGCCTC CTGGGTTTAT 2640 ' 

TGCAGATTAA AGCCATCTCA GTGAAGATAT GCGTGCACGC ACAGACAGAC AGACACACAC 2700 

ACACACACAC ACACACACAC ACACACACAC ACACACAGTC AAATACTGTT CTCTGAAAAA 2760 

TGATGTCCCA AAAGTATTAT AATAGGAAAA AAGCATTAAA TATAATAAAC TAATTTAAGA 2820 
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WO 02/086443 : 

AGTGATAAAG TCTCCAGATG CAGTAGCTCA 
GGTGAGAGGA TTCCTTGAGG CCAGGGTTCG 
CATTTCTTAA AAAAAAAAAA AAAAAATTTA 
TAGTCTCAGC TACTTGTGAG GCTGAGGCAG 
TACAGTGAGC CACAATCACA CCAATCACTG 
GACTCAAAAA AATAAAAAAA ATTGTAGTGG 
CCAAAGGGCT AAAAGTAAAT TACTTATAAA 

* TTATATGTAT GAATATTTCA TAGTTTTGCA 
AAACCAATGA ATATATTACA • TATTCTGTGT 
ATTTGAATTT CATAAAATTT TCCCATGTCA 
GCTATTTAAT AATAGGTCTC ATTTATTCCA 
AATAGAAACA GACTGATTAA GCAGGAGAAG 
AATTATTAGA AGGCAGGTGA ACCAGGAGGG 
GCCTTAGAAT TGGACTAAGG AAGAAGCTGC 

' AGAAAGTGCT GCTGCCTCCC TGCCCCACCT 
GAATGCCCCC ■ ACCCGCACCG GAACAGCAAC 
ATTGCTGAAT TCAAAAAAGA AGTTGCATAC 

• TATGCCCCTT TCATAGGCTG CTAGGGAGTT 
AATAAGACCA GAATTTCTCA TATGTTGTGA 
AAACTATCAA TCATGTATAA ATCCAACAAA 
GTGAACCATT GTTGGAGAAT CTACTAAAAT 
AATGTAAATA AAAAGAACTG GCAGTGTATA 
GATGTGGAGA CTATTGCCAT AGACCACAAT 
GGAATCAAAA GGGGCCAGGT GCAGTGGCTC 
GAGGCAGGAG GATCACTTGA AGCCAGTTTT 
TATCTCTACA AAAAATAGAT TAGCTGGGCA 
GTGGAGGCTG AAGTAGGAAA TCACTTGAGC 
TTATACCACT GCACTCCAGC CTGGGCAAGA 

Seq ID NO: 44 Protein sequence: 
Protein Accession #: CAB55276.2 



PCTAJS02/12476 



CACTGTAATC 
AGACCAACCT 
AACTTAGCTG 
GAGGATTCTT 
CACTCCAGCC 
TAGCCATGTG 
TTTTTTATAG 
TATCAGATGT 
TCCAATAAAA 
AGAATACAAA 
CAGGCTGTAG 
TTTTTTGAAA 
TAAGCTTCCA 
TGACACTCCA 
TTGCCACTTC 
AAAAGGATTC 
AAAGACATCT 
TTCCTGGTTC 
GAGGATTCAA 
CACTTTGTAA 
ACGGCTTCCC 
TCAGATGTTT 
GTAAATTTTT 
ACATCTATAA 
GAGACCAGCC 
CGGTGGTGCA 
CCGAGAGTTT 
GAGCAAGACC 



ACAGTGACTC 
TGGGCAACAT 
GGTATGGTGG 
TGAGCCCAGG 
TGGGCAATAA 
TTAATTGTTA 
TTGTATTTTT 
AGGCATACAG 
' CTTT ATTT AT 
ATACTTGAGT 
TTTGTAGTCT 
GAATTTTGTT 
GCAGCAATTT 
CTGCCACACA 
TGCAGCAGGA 
TGCATGAGAT 
GATTGAAAAA 
TACTTTCAGG 
ATGTTACAGG 
CATACAAGAA 
GCAAACGAAG 
AACTATAGGA 
AAGTGAGGAA 
TCCCAGAGCT 
TATGCAACAC 
TGCCTATTGT 
GAGGTTACAG 
TTGTCTCTT 



AGGAGGCTGA 
AGCAAGACCC 
CACATGCCTA 
AGTTTGAGGT 
AGTAACTCTT 
AATAAATTCT 
GACCTGCCTT 
ACAAATACAT 
GGACACTAAA 
TTTGTTTTTA 
TGCTTGAAAC 
TGGCTCACGG 
GTAAAACCAT 
GGGCACTGGA 
ATAGGTAGAA 
GCCTCCCTAA 
GGGTATGTTA 
TGGTGGGATC 
GTTGCCAGCC 
CTCAGGAAAT 
ATGAATGGAA 
CCAGAACTAA 
GGAAAAATCA 
TTGGGAGTTC 
ATTGAGACCC 
CCTACCTACT 
TGAGCTATGA 



MNGEYRGRGF 
TPQSMQSTLD 
ILVDFKELTE 
LSNDGETMVN 
LCAACGSIQS 
QREAGRIPRT 
SISNSKGQKT 
KAGIiALAItFG 
TTTTSGLTVT 
LAKAGWCSL 
HDHLLSEHVI 
IPHQliliRKVI 
EARARLELRE 
SALNNVAERT 



11 
I 

GRGRFQSWKR 
RFIPYKGWKL 
GGEVTNLI PD 
VPHIHARVYN 
FPLPDGKYSL 
IECBLVHDLV 
KSSEDGCKHG 
GSQKYADDXN 
LSKDSSSGDF 
PARTS I I AAA 
AIRAGKQRTI 
GYARQYVYPR 
EATKEDAEDI 
YNNIFQFHQL 



21 . 
I 

GRGGGNFSGK 
YFSEVYSDSS 
IATELRDAPE 
YBPLTQLKNV 
PTKCPVPVCR 
DSCVPGDTVT 
MLMEFSLKDL 
RIPIRGDPHI 
ALEAGALVLG 
NFVGGHYNKA 
SSATVARMNS 
LSTEAARVLQ 
VEIMKYSMLG 
HQIAKELNIQ 



31 
I 

WREREHRPDL 
PLIEKIQAFE 
KTLACMGIAI 
RANYYGKYIA 
GRSFTALRSS 
ITGIVKVSNA 
YAIQEIQAEE 
LWGDPGLGK 
DQGICGIDEF 
KTVSENLKMG 
QDSNTSVLEV 
DFYLELRKQS 
TYSDEFGNLD 
VADFENFIGS 



41 
I 

SKTTGKRTSE 
KFFTRHIDLY 
HQVLTKDLER 
LRGTWRVSN 
PLTVTMDWQS 
EEGSRNKNDK 
NIiFKLIVNSL 
SQMLQAACNV 
DKMGNQHQAL 
SALLSRFDLV 
VSEKPLSERL 
QRLNSSPITT 
FERSQHGSGM 
IiNDQGYLLKK 



Seq ID NO: 45 DNA sequence 

Nucleic Acid Accession #: NM_005416.l 

Coding sequence: 149.. 658 



ACCAGATCCC 
CTGAAGACCA 
AAAGAGTGTG 
CCCACCACGT 
AATATTTGTT 
AAAGATTCCA 
GCCAGGCTGT 
CAAGGTCCCT 
ACCAGGCAGC 
CAAAGTTCCT 
GCCATGTCCT 
TGGTGCACAG 
TGTTTCTGTG 
AGTCTCTCTC 
CTGAAGAATC 
GGCTGCTCAG 
CTCATTAAAT 



11 
I 

AGAGGCTGAA 
GAAAAGCCAC 
TCCACGATCC 
CAGCTTCAAG 
CCCACAACCA 
GAGCCAGGCT 
ACCAAGGTCC 
GAGCCAGGTT 
ATCAAGGTCC 
GAGCAAGGAT 
TCAACGGTCA 
ACAAGCCCTT 
TCTTAATTGT 
TTATTTGTAT 
CTGTAAGCCC 
GGTTCATCTG 
TGCTTTTAAT 



21 
I 

CACCTCGACC 
TAAGACTTTC 
TTTGAAGCAT 
AGCAGCAGGT 
AGGAGCCATG 
GTACCAAGGT 
CTGAGCCAGG 
GTACCAAGGT 
CTGACCAAGG 
ACACCAAAGT 
CTCCAGGCCC 
GAGAAGCCAA 
CTGTAGACCT 
CCTAAAAATA 
CTGAATTAAG 
AAGATTCGAA 
TCCA 



31 
I 

TTCTCTGCAC 
TGCTTAATTC 
GAGTTCTTAC 
GAAACAACCC 
CCACTCAAAG 
CCCTGAGCCA 
TTGTACCAAG 
CCCTGAGCCA 
CTTCATCAAG 
TCCTGTGCCA 
AGCTCAGCAG 
CCACCAGATG 
TGTAATCAGC 
CGTACTATAA 
CAGAAAGTCT 
TGAAAAGAAA 



41 

I 

AGCAGATGAT 
AGGAGCTTAG 
CAGCAGAAGC 
AGCCAGCCTC 
GTTCCACAAC 
GGCTGTACCA 
GTCCCTGAGC 
GGCTACACCA 
TTTCCTGAGC 
GGCTACACAA 
AAGACCAAGC 
CTGGACACCC 
ACATTGTCAC 
AGCTTTTGTT 
TCATGGCTTT 
TGCATGTTTC 



51 
I 

CCCTGAGCAG 
AGGATTCTTC 
AGACCTTTAC 
CACCTCAGGA 
CTGGAAACAC 
AGGTCCCTGA 
CAGGCTGTAC 
AGGTCCCTGA 
CAGGTGCCAT 
AGCTACCAGA 
AGAAGTAATT 
TCTTCCCATC 
CCCAAGCCAT 
CACACACACT 
TCTGGTCTTC 
CTGCTCTTCC 



Seq ID NO: 46 Protein sequence: 
Protein Accession #: NP_005407.1 

I ' 11 21 . 31 41 51 

i i i : I I i 

MSSYQQKOTF TPPPQK2QQQ VKQPSQPPPQ EIFVPTTKEP CHSKVPQPGN TKIPEPGCTK 
VPEPGCTKVP EPGCTKVPEP GCTKVPEPGC TKVPEPGCTK VPEPGYTKVP EPGSIKVPDQ 
GFIKFPEPGA IKVPEQGYTK VPVPGYTKLP EPCPSTVTPG PAQOKTKQK 

• ■ u * 

Seq ID NO: 47 DNA sequence; 

Nucleic Acid Accession 8: Eos sequence 



2880 
2940 
"3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 



51 
I 

QTPQFLLSTK 
DKDEIERKGS 
HAAELQAQEG 
IKPLCTKMAF 
IKIQELMSDD 
CMFLLYIEAN 
CPVIFGHELV 
APRGVYVCGN 
LEAMEQQSIS 
FILLDTPNEH 
KWPGETIDP 
RQLESLIRLT 
SNRSTAKRFI 
GPKVYQLQTM 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



60 
120 



206 



5 

10 
15 

20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 V : . . 

1 11 21 • "31 

I I I I 

GCGTCGTGTG CAGGCGTCCC CGGGCTGTGG ATAATTAGAC 

AAGGCTCGTT AGAATTCGCC CTAGAGCTGT ATCATGTATT 

TTGCAATTAA GCTTAGGGAA CCAGCAACAA AAGCAAACTT 

GAAAATGGAT TAGAGAAACT TCTTCCCCGA TTTAAGGGGA 

TTTGGGGAAA GTGCCCCGAC CGCAGAGGCG ACGACAGGGG 

AGTCGGCGTT GGCGGCAGCG GTGGCCTTCC TCA TCTGG GC 

TAAGGATAAC ATCCTGGAAA TGACTTCTGT AOGGTTTGAG 

TTGGAGCTGC CCTGTGGAGT TACAGTTTAC CAAACACATT 

CTAAAAACTT TGTGAGAATT TTCTTTTACT AAAATTTTTT 

Seq ID NO: 48 .DMA sequence: 

Nucleic Acid Accession #: CAT cluster 



PCIYUS02/12476 



41 

I 

ACGTTCTTCC 
TTCTTTCAAA 
GGCCCGAGGT 
AAGATTCCTG 
AGCAGGAAGC 
GATGTGGGCT 
CCCAACTGCA 
CATGAACATA 
CTTATTACAA 



X 

I . ' 

TTCCAAATTT 
TTTTAGTAAA 
CTCCAAGTCA 
TCCTTACTCT 
CCGACTACCG 
CCCAAAGCGC 
ATTTTCGCGG 
TTGCAAGCAA 
AGCCTTGGGC 
CGACGCT 



11 
I 

TTTTTTTTGT 
TGAGATTATG 
TGAGTGTGCA 
TCTCGGAGCC 
TGAGCAGCTT 
TGGCCGCAGG 
TGAACGACCT 
AGTTAATTTG 
AATGAGGGAA 



21 
I 

AATAAGAAAA 
TTCATGAATG 
GTTGGGCTCA' 
CACATCGCCC 
CCTGCTCCCC 
AATCTTTCCC 
CGGGCCAAGT 
AAAGAAAATA 
GAACGTGTCT 



31 . 
I 

AATTTTAGTA 
TGTTTGGTAA 
AACCGTACAG 
AGATGAGGAA 
TGTCGTCGCC 
CTTAAATCGG 
TTGCTTTTGT 
CATGATACAG 
AGTTATCCAC 



Seq ID NO; 4 9 DNA sequence - 
Nucleic Acid Accession #; CAT cluster 



TCTTTCTTCT 
CCTGCCGACC 
ACCGTAGACC 
GACACATGCA 
CCCAACCAAA 
TTCATTTAAA 
CTTTCTCTGA 
TCGCTGTAGC 
TAAAGAAACT 
GATTGAACCA 
TGCTGGTATC 
GACGTGGGGA 
CCGGGTCTCT 
GAGACAGCAT 
ACTG 



11 
) 

GCTGCTCGTT 
TCTGTTGTCT 
CGAAACCATT 
GACCAGTTTT 
GTGTTTAAAA 
AAACTCTAAT 
TCTGTGTCTT 
CATGGGAATC 
GACACAGGAG 
GTGCACTCCA 
GTCCTGCAGC 
GAGCTGGTCT 
CCTGGCCCCG 
CATTTATGAG 



21 
I 

TGTCTCTCCT 
CTTCTCTGAT 
GGGTGTCACA 
CCTGGAACNG 
CTTTTTAGGG 
ATTTATATTA 
TTTTCTTTGA 
CGTTTCATTA 
AATCACTTGA 
GCCTTGGCAG 
CCCATCCTCG 
ATATATCCGG 
GGGACCTAGT 
CCTGCAGCAT 



31 
I 

GTGCTCTTCT 
GGCGGGGGGC 
AGCCGGTCGC 
CATGACCATG 
CACCCCCAAA 
AATACAAAGA 
CAGCATCTCC 
TTATGGTAGC 
ACTTGGGAGG 
CGGAGCAAGA 
GTTCCATTGC 
GTGAAGCTCA 
ATTTTTGCCA 
CCACCCTACT 



. Seq ID NO: 50 DNA sequence 
Nucleic Acid Accession #; L05187 
Coding sequence; 1991.. 2260 



CTGCAGGGAG 
TCAGAAAGGA 
CAGAAGAAGG 
TGAAGGAAAG 
AGAGTCATAA 
GGAAATGGAT 
ATTTCTAGCT 
CCCCTCCCTT 
ACAACCATCT 
CCAGGGTTAA 
CCCTGCACCT 
CAGCCAGCTA 
GATGAGGATG 
AGCTTCTATT 
TCACACCAAA 
ATTGCAACAA 
ATATGTGTAA 
TATTTTAAGT 
CCTCAGTAGA 
AGTTCATAGC 
TGACAAGATA 
ATTTAAGGCA 
AACATAAAAC 
AGTAATTGGC 
AGGAGACCTC 
AGATGGGAAG 
GAGGCTTAGA 
GAGGAAAGTG 
GAAGCCAGCT 
GAGCCAAGAA 
TTCAAAGGGC 



11 

1 

GCAGGTAGAA 
GGAAAAGGCC 
ATTAGCCCCT 
CAGGTTTTCC 
GTAAATTATT 
GGAAGGTCTT 
TCCACCTTCA 
TCCCACCTAT 
CAATGACAAG 
CTCATGAAAC 
GGGTCTGAGG 
GTGCCAAAAA 
GTAGTGTGAG 
TCCTTGAGGC 
CCCAAGGGAC 
ACTGGCAATT 
GCAGGTTAAT 
TAAATTACAG 
TAGTCATTGA 
AGAACTAGAA 
TTTATAGAAA 
GTATGCTAGG 
CTAGCAGGAA 
ATGACGGAGA 
TAGGGTGTCA 
AAAAGCATTT 
TGAATATAAA 
GTCTGATGCC 
TTAGTAGGGC 
GAGAACTCCA 
CTGAAAATTA 



21 
1 

AAGGCTTTTG 
AGGGCAGATG 
GAAAGTCCCT 
CAGATTAGCA 
CTGAATGTGT 
GGACTCTGAG 
CCAAGGCAGA 
TCATGTGTGC 
GACAGCAGGT 
CCTCCATGAA 
ATGAGGGTGG 
ATATCAGGTG 
TCATGTGTGA 
AGGGCTCATT 
CACACAGCCC 
CTAGTGTACT 
CCAGGGTTTC 
TCTGGATTTG 
ACTGGGAGTC 
CTCAGGCCAG 
TTTTAATTTA 
CACTTTGGAC 
GGTAATACAT 
TGGGCAGAGA 
AGTGATGTGA 
GGAAGGGACT 
GCCATCCTAT 
ATTTTCCAAA 
ATTTTTCCAG 
ATAAAATGGA 
TCCAAGCTTA 



31 
I 

GGTTTTCAGG. 
TCTGGGTGGA 
GAAGTAGGAG 
ACCAGTCAGG 
GTAGTTTAAT 
ACAAGGGGTC 
CAAGGAGGGC 
AAGAGTGCCC 
GGCAAGGCTC 
GCCTGCTGCT 
CAGTGAAAAT 
GTGTTCATCA 
CAGGTGAGGA 
CATCTTATAA 
ATTCTGCTCC 
TTTTCATTAT 
AATGGGAGAT 
AAAGGACCTT 
CTGGAGAAGA 
AGCACTCTCA 
TTAGATGGAT 
AAATCAATGC 
ATATATAAAT 
AGGGCTGTGC 
GCTATGATGG 
GTGTAAGCAC 
AAGTCACAGG 
AGACCTAATA 
AACAGATATA 
GCAGAAGAAA 
TTTCATTTTT 



41 
I 

AAAGAAAATT 
ACTGTAACTC 
AAGTCATTTC 
GGCCACCGCT 
TCTGCGGTCG 
GGAAGAAGTT 
TGCTGGTTCC 
CTCTAGGGCG 
AGCCCGGGGA 



41 

I 

TCTTTCTTTC 
GGGAGAAGCT 
CGGCTTTTTT 
TTATTACTAT 
ATTTTTTTTT 
TACCCAAACC 
ATTTTTTTTC 
AATATGGAGT 
CAGAGTTTGC 
TTCTGTCACA 
GCTGCCAGGC 
GCTGTGGCAC 
CGAGTGTACA 
GCTGTATCCA 



41 
I 

TGGGGGGCAG 
GTGAAGGGAA 
AAGGGTAAAG 
GGGAGGAAGG 
GGAATTGGGA 
TATAATCAGT 
CCACCTCAGC 
TGTCCCACAG 
AACAGGACTC 
CACCCCTCCC 
TAGGCCAGTG 
AATAAGCCGA 
ATGAAAACAG 
AAGCCAGCTG 
GTATACCAGG 
TAGAAATTAG 
AGAGAATAGT. 
AGAGATGGTT 
TTGTTCAAAT 
GTAACACTGC 
CTCTACTGAG 
CCTAACGTAC 
AAATGAAATG 
ACTTTTGGGA 
AGGGGTATTT 
AGACCAGAAG 
CTTTCTACAT 
TGCGGACCTC 
AGGTGCCTTG 
TTGCCTTTTA 
AAATGTAATG 



SI 




1 

CTCATTGCCC 


60 


TTAACTTTGC 


120 


CGTTCACCGC 


180 


CGGCCAGCGC 


- 240 


TGCTCACGGT 


300 




360 


CACTCATGAC 


420 


ATCTCATTTA . 


480 


A 




51 




1 

CTCACAAAGT 


60 






CAGGATGTTA 


180 


GCCGCCAACG 


240 


GGGCACTTTC 


300 


TCTCTAATCC 


360 


CTAAGCTTAA 


420 


AATTCTAACG 


480 


CGCCTGCACA 


540 


51 




1 

CCTCGCCGCT 


60 


GACCGGTGAG 


120 


GGGAGAACCC 


180 


GGGCCGCCTC 


240 




300 


CTTTATGCTT 


360 


TGCTGCTTCA 


420 


GCTGTATTCC 


480 


AGTGAGCCGA 


540 


GTTCCTGAAG 


600 


AGGGTGCTGG 


660 


ACCTTGGATG 


720 


CCAAACAAAG 


780 


GTTTCCATTG 


840 


51 




| 

TCTAGCCTGA 


60 


AAAGTGATCC 


120 


GTGTGGTTGG 


180 


TGAGAGTGGG 


240 


AAAAGATGGG 


300 


CCATTTCATT 


360 


TCCTCTGCTC 


420 


AACACGGGGA 


480 


AGATGTCCCC 


540 


TCAAGGCAAG 


600 


ACATCATTTT 


660 


GCCAACCGGT 


720 


AGTGCCCGAG 


780 


GCCATTGCCT 


840 


TAAGTCTCTG 


900 


CTAAAGGCAA 


960 


GGAATATCTT 


1020 


AGGGCTCCCA 


1080 


GCCCATGGGA 


1140 


AATTTCCCCC 


1200 


CATTTATTCC 


1260 


TTACTTAACA 


1320 


CAAAGTAGAT 


1380 


GACTTGCTCA 


1440 


GGACAAGCAG 


1500 


CAAAACCATA 


1560 


GGTACTAGGA 


1620 


ATGTCCCTCA 


1680 


GGTAGGAAGG 


1740 


GCTCCTCCTC 


1800 


GGGGAGCTAA 


1860 



207 



; WO 02/086443 

GGGAGATGAA AGGCTTTCTC TTCTAAAGGG TCCPGAAATA AAATCTGTTT GGCATTGAAT 1920 

TTGTATCCAT CTTTCTTTAA TTGAATCACT GTGTCAGCTT TCTGTCTCTA GAAAAAAACA * 19 BO 

CATTTGAAGC ATGAATTCTC AGCAGCAGAA GCAGCCTTGC ACCCCACCCC CTCAGCCTCA .2040 

GCAGCAGCAG GTGAAACAAC CTTGCCAGCC TCCACCCCAG GAACCATGCA TCCCCAAAAC 2100 

CAAGGAGCCC TGCCAACCCA AGGTGCCTGA GCCCTGCCAC CCCAAAGTGC CTGAGCCCTG 2160 

CCAGCCCAAG . ATTCCAGAGC CCTGCCAGCC CAAGGTGCCT GAGCCCTGCC CTTCAACGGT 2220 

CACTCCAGCA CCAGCCCAGC AGAAGACCAA GCAGAAGTAA TGTGGTCCAC AGCCATGCCC 2280 

TTGAGGAGCT GGCCACTGGA TACTGAACAC CCTACTCCAT TCTGCTTATG AATCCCATTT 2340 

GCCTATTGAC CCTGCAGTTA GCATGCTGTC ACCCTGAATC ATAATCGCTC CTTTGCACCT 2400 

CTAAAAAGAT GTCCCTTACC CTCATTCTGG AGGCTCCTGA GCCTCTGCGT AAGGCTGAAC 2460 

GTCTCACTGA CTGAGCTAGT CTTCTTGTTG CTCGGGTGCA TTTGAGGATG GATTTGGGGA 2520 
AGGTCAAGTG ACCATCCCTA G 

Seq ID NO: 51 Protein sequence: 
Protein Accession #:AAC26838 

1 11 21 31 41 51 

I II I t 1 

MNSOQQKQPC TPPPQPQQQQ VKQPCQPPPQ EPCIPKTKBP CQPKVPEPCH PKVPEPCQPK . 60 
IPEPCQPKVP EPCPSTVTPA PAQQKTKQK 



Seq ID NO: 52 DNA sequence 

Nucleic Acid Accession #: NM_002638.1 

Coding sequence: 120-473 

1 11 21 31 41 51 

I I I I I I 

CAATACAGCT AAGGAATTAT CCCTTGTAAA TACCACAGAC CCGCCCTGGA GCCAGGCCAA 60 

GCTGGACTGC ATAAAGATTG GTATGGCCTT AGCTCTTAGC CAAACACCTT CCTGACACCA 120 

TGAGGGCCAG CAGCTTCTTG ATCGTGGTGG TGTTCCTCAT CGCTGGGACG CTGGTTCTAG 180 

AGGCAGCTGT CACGGGAGTT CCTGTTAAAG GTCAAGACAC TGTCAAAGGC CGTGTTCCAT 240 

TCAATGGACA AGATCCCGTT AAAGGACAAG TTTCAGTTAA AGGTCAAGAT AAAGTCAAAG 300 

CGCAAGAGCC AGTCAAAGGT CCAGTCTCCA CTAAGCCTGG CTCCTGCCCC ATTATCTTGA 360 

TCCGGTGCGC CATGTTGAAT CCCCCTAACC GCTGCTTGAA AGATACTGAC TGCCCAGGAA 420 

TCAAGAAGTG CTGTGAAGGC TCTTGCGGGA TGGCCTGTTT CGTTCCCCAG TGAAGGGAGC 480 

CGGTCCTTGC TGCACCTGTG CCGTCCCCAG AGCTACAGGC CCCATCTGGT CCTAAGTCCC 540 

TGCTGCCCTT CCCCTTCCCA CACTGTCCAT TCTTCCTCCC ATTCAGGATG CCCACGGCTG 600 
GAGCTGCCTC TCTCATCCAC TTTCCAATAA A . „ . 

Seq ID NO: 53 Protein sequence: 
Protein Accession #: NP_002629.1 

1 11 21 31 41 51 

I I . I I I 1 

MRASSPLIW VFLIAGTLVL EAAVTGVFVK GQDTVKGRVP FNGQDPVKGQ VSVKGODKVK 60 
AQEPVKGPVS TKPGSCPIII, IRCAMIiNPPN RCLKDTDCPG IKKCCEGSCG MACFVPQ 

Seq ID NO: 54 DNA sequence 
Nucleic Acid Accession #: NM_019618 
Coding sequence: 75-5B4 

1 11 21 31 41 51 

I I I I I I 

GGCACGAGCC ACGATTCAGT CCCCTGGACT GTAGATAAAG ACCCTTTCTT GCCAGGTGCT 60 

GAGACAACCA CACTATGAGA GGCACTCCAG GAGACGCTGA TGGTGGAGGA AGGGCCGTCT 120 

ATCAATCAAT GTGTAAACCT ATTACTGGGA CTATTAATGA TTTGAATCAG CAAGTGTGGA 180 

CCCTTCAGGG TCAGAACCTT GTGGCAGTTC CACGAAGTGA CAGTGTGACC CCAGTCACTG 240 

TTGCTGTTAT CACATGCAAG TATCCAGAGG CTCTTGAGCA AGGCAGAGGG GATCCCATTT 300 

ATTTGGGAAT CCAGAATCCA GAAATGTGTT TGTATTGTGA GAAGGTTGGA GAACAGCCCA 360 

CATTGCAGCT AAAAGAGCAG AAGATCATGG ATCTGTATGG CCAACCCGAG CCCGTGAAAC 420 

CCTTCCTTTT CTACCGTGCC AAGACTGGTA GGACCTCCAC CCTTGAGTCT GTGGCC TTCC 480 

CGGACTGGTT. CATTGCCTCC TCCAAGAGAG ACCAGCCCAT CATTCTGACT TCAGAACTTG 540 

GGAAGTCATA CAACACTGCC TTTGAATTAA ATATAAATGA CTGAACTCAG CCTAGAGGTG 600 

GCAGCTTGGT CTTTGTCTTA AAGTTTCTGG TTCCCAATGT GTTTTCGTCT ACATT TTCT T 660 

AGTGTCATTT TCACGCTGGT GCTGAGACAG GGGCAAGGCT GCTGTTATCA TCTCATTTTA 720 

TAATGAAGAA GAAGCAATTA. CTTCATAGCA ACTGAAGAAC AGGATGTGGC CTCAGAAGCA 7 B0 

GGAGAGCTGG GTGGTATAAG GCTGTCCTCT CAAGCTGGTG CTGTGTAGGC CACAAGGCAT 840 

CTGCATGAGT GACTTTAAGA CTCAAAGACC AAACACTGAG CTTTCTTCTA GGGGTGGGTA 900 

TGAAGATGCT TCAGAGCTCA TGCGCGTTAC CCACGATGGC ATGACTAGCA .CAGAGCTGAT 960 

CTCTGTTTCT GTTTTGCTTT ATTCCCTCTT GGGATGATAT CATCCAGTCT TTATATGTTG 1020 

CCAATATACC. TCATTGTGTG TAATAGAACC TTCTTAGCAT TAAGACCTTG TAAACAAAAA 1080 

TAATTCTTGT GTTAAGTTAA ATCATTTTTG TCCTAATTGT AATGTGTAAT CTTAAAGTTA 1140 
AATAAACTTT GTGTATTTAT ATAATAAAAA AAAAAAAAAA AAA 

Seq ID NO: 55 Protein sequence: 
Protein Accession #: NP_062564 

1 11 21 . 31 41 51 

I I I I I 1 

MRGTPGDADG GGRAVYQSMC KPITGTINDL NQQVHTLQGQ NLVAVPRSDS VTPVTVAVIT 60 
CKYPEALEQG RGDPIYLGIQ NPEMCLYCEK VGEQPTLQLK EQKIMDLYGQ PEPVKPFI»FY 120 
RAKTGRTSTL ESVAFPDWFI ASSKRDQPII LTSELGKSYN TAFELNIND . 

Seq ID NO: 56 DNA ■ sequence 
Nucleic Acid Accession #: NM_003125 
Coding sequence: 65-334 



208 



WO 02/086443 



PCT/US02/12476 



1 . 11 21 31 .41 SI 

I. I I I I I 

AGCAGTTCTA AGGGACCATA CAGAGTATTC- CTCTCTTCAC ACCAGGACCA GCCACTGTTG SO. 

5 CAGCATGAGT TCCCAGCAGC AGAAGCAGCC CTGCATCCCA CCCCCTCAGC TTCAGCAGCA 120 

GCAGGTGAAA CAGCCTTGCC AGCCTCCACC TCAGGAACCA TGCATCCCCA AAACCAAGGA 1B0 
GCCCTGCCAC CCCAAGGTGC CTGAGCCCTG CCACCCCAAA GTGCCTGAGC CCTGCCAGCC ■ 240 

CAAGCTTCCA GAGCCATGCC ACCCCAAGGT GCCTGAGCCC TGCCCTTCAA TAGTCACTCC 300 

AGCACCAGCC CAGCAGAAGA CCAAGCAGAA GTAATGTGGT CCACAGCCAT GCCCTTGAGG 360 

10 AGCOGGCCAC CAGATGCTGA ATCCCCTATC CCATTCTGTG TATGAGTCCC ATTTGCCTTG 420 

CAATTAGCAT TCTGTCTCCC CCAAAAAAGA ATGTGCTATG AAGCTTTCTT TCCTACACAC 480 

TCTGAGTCTC TGAATGAAGC TGAAGGTCTT AGTACCAGAG CTAGTTTTCA GCTGCTCAGA 540 

ATTCATCTGA AGAGAGACTT AAGATGAAAG CAAATGATTC AGCTCCCTTA TACCCCCATT 600 
AAATTCACTT TCAATTCCA 



15 
20 
25 



Seq ID NO: 57 Protein sequence: 
Protein Accession #:NP_0 03116 

1 11 21 31 41 51 

1 I I 1 I 1 

MSSQQQKQPC IPPPQLQQQQ VKQPCQPPPQ EPCIPKTKEP CHPKVPEPCH PKVPEPCQPK 60 
LPEPCHPKVP EPCPSIVTPA PAQQKTKQK 

Seq ID NO: 58 DNA sequence 

Nucleic Acid Accession #: NM_001793.2 

Coding sequence: 71-2560 



30 1 11 21 31 41 51 

I I i I I I 

AAAGGGGCAA GAGCTGAGCG GAACACCGGC CCGCCGTCGC GGCAGCTGCT TCACCCCTCT 60 

CTCTGCAGCC ATGGGGCTCC CTCGTGGACC TCTCGCGTCT CTCCTCCTTC TCCAGGTTTG 120 

CTGGCTGCAG TGCGCGGCCT CCGAGCCGTG CCGGGCGGTC TTCAGGGAGG CTGAAGTGAC ISO 

35 CTTGGAGGCG GGAGGCGCGG AGCAGGAGCC CGGCCAGGCG CTGGGGAAAG TATTCATGGG 240 

CTGCCCTGGG CAAGAGCCAG CTCTGTTTAG CACTGATAAT GATGACTTCA CTGTGCGGAA . 300 

TGGCGAGACA GTCCAGGAAA GAAGGTCACT GAAGGAAAGG AATCCATTGA AGATCTTCCC 360 

ATCCAAACGT ATCTTACGAA GACACAAGAG AGATTGGGTG GTTGCTCCAA TATCTGTCCC 420 

TGAAAATGGC AAGGGTCCCT TCCCCCAGAG ACTGAATCAG CTCAAGTCTA ATAAAGATAG 480 

40 AGACACCAAG ATTTTCTACA GCATCACGGG GCCGGGGGCA GACAGCCCCC CTGAGGGTGT 540 

CTTCGCTGTA GAGAAGGAGA CAGGCTGG7T GTTGTTGAAT AAGCCACTGG ACCGGGAGGA 600 

GATTGCCAAG TATGAGCTCT TTGGCCACGC TGTGTCAGAG AATGGTGCCT CAGTGGAGGA 660 

CCCCATGAAC ATCTCCATCA TCGTGACCGA CCAGAATGAC CACAAGCCCA AGTTTACCCA 720 

GGACACCTTC CGAGGGAGTG TCTTAGAGGG AGTCCTACCA GGTACTTCTG TGATGCAGGT 780 

45 GACAGCCACG GATGAGGATG ATGCCATCTA CACCTACAAT GGGGTGGTTG CTTACTCCAT 840 

CCATAGCCAA GAACCAAAGG ACCCACACGA CCTCATGTTC ACCATTCACC GGAGCACAGG 900 

CACCATCAGC GTCATCTCCA GTGGCCTGGA CCGGGAAAAA GTCCCTGAGT ACACACTGAC 960 

CATCCAGGCC ACAGACATGG ATGGGGACGG CTCCACCACC ACGGCAGTGG CAGTAGTGGA 1020 

GATCCTTGAT GCCAATGACA ATGCTCCCAT GTTTGACCCC CAGAAGTACG AGGCCCATGT 1080 

50 GCCTGAGAAT GCAGTGGGCC ATGAGGTGCA GAGGCTGACG GTCACTGATC TGGACGCCCC 1140 

CAACTCACCA GCGTGGCGTG CCACCTACCT TATCATGGGC GGTGACGACG GGGACCATTT 1200 

TACCATCACC ACCCACCCTG AGAGCAACCA GGGCATCCTG ACAACCAGGA AGGGTTTGGA 1260 

TTTTGAGGCC AAAAACCAGC ACACCCTGTA CGTTGAAGTG ACCAACGAGG CCCCTTTTGT 1320 

GCTGAAGCTC CCAACCTCCA CAGCCACCAT AGTGGTCCAC GTGGAGGATG TGAATGAGGC 1380 

55 ACCTGTGTTT GTCCCACCCT CCAAAGTCGT TGAGGTCCAG GAGGGCATCC CCACTGGGGA 1440 

GCCTGTGTGT GTCTACACTG CAGAAGACCC TGACAAGGAG AATCAAAAGA TCAGCTACCG 1500 

CATCCTGAGA GACCCAGCAG GGTGGCTAGC CATGGACCCA GACAGTGGGC AGGTCACAGC 1560 

TGTGGGCACC CTCGACCGTG AGGATGAGCA GTTTGTGAGG AACAACATCT ATGAAGTCAT 1620 

GGTCTTGGCC ATGGACAATG GAAGCCCTCC CACCACTGGC ACGGGAACCC TTCTGCTAAC 1680 

60 ACTGATTGAT GTCAATGACC ATGGCCCAGT CCCTGAGCCC CGTCAGATCA CCATCTGCAA 1740 

CCAAAGCCCT GTGCGCCAGG TGCTGAACAT CACGGACAAG GACCTGTCTC CCCACACCTC 1800 

CCCTTTCCAG GCCCAGCTCA CAGATGACTC AGACATCTAC TGGACGGCAG AGGTCAACGA 1860 

GGAAGGTGAC ACAGTGGTCT TGTCCCTGAA GAAGTTCCTG AAGCAGGATA CATATGACGT 1920 

GCACCTTTCT CTGTCTGACC ATGGCAACAA AGAGCAGCTG ACGGTGATCA GGGCCACTGT 1980 

65 GTGCGACTGC CATGGCCATG TCGAAACCTG CCCTGGACCC TGGAAGGGAG GTTTCATCCT 2040 

CCCTGTGCTG GGGGCTGTCC TGGCTCTGCT GTTCCTCCTG CTGGTGCTGC TTTTGTTGGT 2100 

GAGAAAGAAG CGGAAGATCA AGGAGCCCCT CCTACTCCCA GAAGATGACA CCCGTGACAA 2160 

CGTCTTCTAC TATGGOGAAG AGGGGGGTGG CGAAGAGGAC CAGGACTATG ACATCACCCA 2220 

GCTCCACCGA GGTCTGGAGG CCAGGCCGGA GGTGGTTCTC CGCAATGACG TGGCACCAAC 2280 

70 . CATCATCCCG ACACCCATGT ACCGTCCTCG GCCAGCCAAC CCAGATGAAA TCGGCAACTT 2340 

TATAATTGAG AACCTGAAGG CGGCTAACAC AGACCCCACA GCCCCGCCCT ACGACACCCT 2400 

CTTGGTGTTC GACTATGAGG GCAGCGGCTC CGACGCCGCG TCCCTGAGCT CCCTCACCTC 2460 

CTCCGCCTCC GACCAAGACC AAGATTACGA TTATCTGAAC GAGTGGGGCA GCCGCTTCAA 2520 

GAAGCTGGCA GACATGTACG GTGGCGGGGA GGACGACTAG GOGGCCTGCC TGCAGGGCTG 2580 

75 GGGACCAAAC GTCAGGCCAC AGAGCATCTC CAAGGGGTCT CAGTTCCCCC TTCAGCTGAG 2640 

GACTTCGGAG CTTGTCAGGA AGTGGCCGTA GCAACTTGGC GGAGACAGGC TATGAGTCTG 2700 

ACGTTAGAGT GGTTGCTTCC TTAGCCTTTC AGGATGGAGG AATGTGGGCA GTTTGACTTC 2760 

AGCACTGAAA ACCTCTCCAC CTGGGCCAGG GTTGCCTCAG AGGCCAAGTT TCCAGAAGCC 2820 

TCTTACCTGC CGTAAAATGC TCAACCCTGT GTCCTGGGCC TGGGCCTGCT GTGACTGACC 2880 

80 rACAGTGGAC TTTCTCTCTG GAATGGAACC TTCTTAGGCC TCCTGGTGCA ACTTAATTTT 2940 

TTTTTTTAAT GCTATCTTCA AAACGTTAGA GAAAGTTCTT CAAAAGTGCA GCCCAGAGCT 3000 

GCTGGGCCCA CTGGCCGTCC TGCATTTCTG GTTTCCAGAC CCCAATGCCT CCCATTCGGA 3060 

TGGATCTCTG CGTTTTTATA CTGAGTGTGC CTAGGTTGCC CCTTATTTTT TATTTTCCCT 3120 

GTTGCGTTGC TATAGATGAA GGGTGAGGAC AATCGTGTAT ATGTACTAGA ACTTTTTTAT 3180 

85 TAAAGAAACT TTTCCCAGAA AAAAA 

Seq ID NO: 59 Protein sequence: 



209 



WO 02/086443 

Protein Accession 8: 



PCT/US02/12476 



NP 001784.2 



11. 



.10. 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



MGLPRGPLAS 
QEPALFSTDN 
KGPFPQRLNQ 
YELFGHAVSE 
DEDDAIYTYN 
TDMDGDGSTT 
AWRATYLIMG 
PTSTATIWH 
DPAGWIAMDP 
VNDHGPVPBP 
TWLSLKKFL 
GAVLALLFLL 
GLEARPEWL 
DYEGSGSDAA 



LUJjQVCWLQ 
DDPTVRNGET 
LKSNKDRDTK 
NGASVEDPMN 
GWAYSIKSQ 
TAVAWEILD 
GDDGDHFTIT 
VEDVNEAPVP 
DSGQVTAVGT 
RQITICNQSP 
KQDTYDVHLS 
LVLIiLLVRKK 
RNDVAPTIIP 
SLSSLTSSAS 



21 . 

i • : 

CAASEPCRAV 
VQERRSLKER 
IFYSITGPGA 
ISIIVTDQND 
EPKDPHDLMF. 
ANDNAPKFDP 
THPESNQGIL 
VPPSKWEVQ 
LDREDEQFVR 
VRQVLNITDK 
LSDHGNKEQL 
RKIKEPLLLP 
TPMYRPRPAN 
DQDQDYDYLN 



31 
I 

FREAEVTLEA 
NPLKIFPSKR 
DSPPEGVFAV 
HKPKFTQDTF 
TIHRSTGTIS 
QKYEAHVPEN 
TTRKGLDFEA 
EGIPTGEPVC 
NNIYEVMVLA 
DLSPHTSPFQ 
TVIRATVCDC 
EDDTRDNVFY 
PDEIGNFIIE 
EWGSRFKKLA 



Seq ID NO: 60 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: . 162-420 



GCGTTCCGTT 
CATACGGACC 
AGCACCTCGG 
TCTCCCAGAG 
AGCGAAAGAA 
GTTTACTGTT 
GTAGAGTCAT 
GAGGTTAGAA 
AGATCATAAA 



11 
I 

GGCGGOGGAT 
GGATTGTTTT 
AAGCTGAGGC 
GAAGCAGATA 
GCCTCAACTT 
TGTTCATCGA 
TAACAAGGAG 
GTCAAAGAAC 
GACATTTTTT 



21 

! 

TCGAACGTTC 
CGCTGGCCCA 
AGCTGGTACT 
AAGCGGAAGG 
CGTCTGGAGA 
TTAGCAGAAG 
CATGTACTGG 
ATATTCTTGA 
ACACATCAGT 



31 
I 

GGACTGAGGT 
GTGTCCCCGG 
TGACAGAGAG 
CTCCCCGTGG 
AAAGTGGTGA 
AGTCCAGGAC 
CCGCAGCAAA 
AAGTTATGAT 
TAATATGGGA 



41 


51 




| - 

GGAEQEPGQA 


1 • 

LGKVFMGCPG 


60 


ILRRHKRDWV 


VAPISVPENG 


120 


EKETGWLIiLN 


KPLDREEIAK 


180 


RGSVLEGVLP 


GTSVMQVTAT 


240 


ifT pevT r^Ti utr 
VISSGIiDREK 




300 


AVGHEVQRLT 


VTDLDAPNSP 


360 


KNQHTLYVEV 


TNEAPFVLKL 


420 


VYTAEDPDKE 


NQKISYRILR 


480 


MDNGSPPTTG 


Tv5TLLujTL»i. U 


540 


AQLTDDSDIY 


WTAEVNEEGD 


600 


HGHVETCPGP 


WKGGFILPVL 


660 


YGEEGGGEED 


QDYDITQLHR 


720 


NLKAANTDPT 


APPYDTLLVF 


780 


DMYGGGEDD 






41 


51 




1 

TTTTCTGCCT 


1 

GAAGAAGCGT r 


60 


AGCTTGTGTG 


CGATACAGAG ' 


120 


GATGGCGCTG 


TCGACCATAG 


180 


CTTTCTAAAG 


CGAGTCTTCA 


240 


CTTATTGGTC 


CATCTGAACT 


300 


AAACGCTTGT 


GGGAGTAAAT 


360 


GGTAATTCTA 


AAGAAGAGCA 


420 


GCATTCTTTT 


GGGTGGTAAC 


480 


TTATTAAATA 


TTGG 





Seq ID NO: 61 Protein sequence: 
Protein Accession #: Eos sequence 



1 11 21 31 41 SI 

I I I I I I 

MALSTIVSQR KQIKRKAPRG FLKRVFKRKK PQLRLEKSGD LLVHLNCLLF VHRLAEESRT 

NACASKCRVI NKEHVLAAAK VILKKSRG 



Seq ID NO: 62 DNA sequence 

Nucleic Acid Accession #: NM_000094.2 

Coding sequence: 99-8933 



GGGCTGGAGG 
GAGGCGGGGG 
CCGCGCTCTG 
GAGTGACCTG 
CCATTGGCCG 
TCTCTGGAGC 
CACGGACAGA 
GTGAGCTTAG 
ACCATGTCTT 
CAGACGGGAA. 
TCAAGCTATT 
CACAGCCAAC. 
TGCCCCTCGT 
CGGATGACTC 
TGAGAGTACA 
CTCTGACGGG 
GTGAGACCAG 
TTGCCCTCTA 
TAGAAGGGCC 
GGAGTGTGCC 
CACAGCAGCA 
GCACGGACTA 
CCCTGATGGC 
CCACATCCAT 
GGCGGCGTGA 
GCTACCAGTT 
TGGAGGGCCA 
TGAGCCCTGT 
GGAGCCCAGT 
AGOGGACCCT 
GGCTTAGCTA 
TCCTCACTGT 
TGTCAGATGC 
GGATTAGCTG 
CTGCCACAGA 
TGCGAGGCAG 
CAGTGAGGAC 
GGGTTCCTGG 



11 . 
I 

GGCGCTGGGC 
TCCTAGCTGA 
CGCCGGGATC 
CACGOGCCTT 
CAGCAATTTC 
AGCCAGTGCA 
GTTCGGCCTG 
CTACAAGGGG 
CCTGCCCCAG 
GTCCCAGGAC 
TGCTGTGGGG 
CTCCGACTTC. 
TTCCCGGAGA 
GACCTCTGCT 
GTGGACAGCG 
GCTGGGACAG 
TGTGCGGCTG 
CGCCAACAGC 
GGAACTGACC 
AGGTGCCACT. 
GGAGCTGGGC 
TGAGGTGACC 
TCGCACTGAC 
CCTCtTTTCC 
GACTGGCTTG 
GGATGGGCTG 
CGAGGTGGCC 
AACAGACCTG 
CCCTGGTGCC 
GGTGCTTCCT 
CACTGTGCGG 
CCGCCGGGAG 
AACGCGAGTG 
GAGCACAGGC: 
CATCACAGGG 
AGAGGAGGGC 
GGTCCATGTG 
CGCCACAGGA 



21 . 
I 

TCGGACCTGC 
CGGCTTTTAC 
CTGGCAGAGG 
TACGCCGCTG 
CGCGAGGTCC 
CAGGGTGTGC 
GATGCACTTG 
GGCAACACTC 
CTGGCCCGAC 
CTGGTGGACA 
ATCAAGAATG 
TTCTTCTTCG 
GTGTGCACGA 
CCACGAGACC 
GCCAGTGGCC 
CCACTGCCGA 
CGGGGTCTCC 
ATCGGGGAGG 
ATCCAGAATA 
GGCTACCGTG 
CCTGGGCAGG 
GTGAGCACCC 
GCTTCTGTTG 
TGGAACTTGG 
GAGCCACCGC 
CAGCCGGGCA 
ACCCCTGCAA 
CAAGCCACCG 
ACCCAGTACC 
GGGAGTCAGA. 
GTGTCTGCTC 
CCGGAAACTC 
AGGGTGGCCT 
AGTGGTCCGG 
CTGCAGCCTG 
CCTGCTGCAG 
ACTCAG GCCA 
TACAGGGTTT 



31 
I 

CAAGGCCACC 
TGCCTAGGAT 
CGCCCCGAGT 
ACATTGTGTT 
GCAGCTTTCT 
GCTTTGCCAC 
GCTCTGGGGG 
GCACAGGGGC 
CTGGTGTCCC 
CAGCTGCCCA 
CTGACCCTGA 
TCAATGACTT 
CTGCTGGTGG 



41 
I 

GCAGGGGGGA 
GACGCTGCGG 
GCGAGCCCAG 
CTTACTGGAT 
CGAAGGGCTG 
AGTGCAGTAC 
TGATGTGATC 
TGCAATTCTC 
CAAGGTCTGC 
AAGGCTGAAG 
GGAGCTGAAG 
CAGCATCTTG 
CGTGCCTGTG 
TGAGCCAAGC 
CTACAAGGTC 
GGAGGTGAAC 
CGAGTACCAA 
GACAGCTCGG 
CAGCCTCCTG 
GGTCCTCAGT 
GCTGCGTGAC 
CAGTGTGGGG 
GCGCCCGGTC 
CCGTGGCTAC 
ACTGCCCTCT 
CCTCACACTC 
CACTGGACCA 
GCAGCGGGTG 
GCGCAGCACC 
CTTGGATGAC 
CCGTGAGGGC 
TCCAGGGCTG 
CCCTGGAGCC 
GACACTGCCC 
CCAGGTGGCT 
TCGAACGGAC 
TGTCACCATT 
AGCCCACGGC 



51 
I 

GCAAGGGACA 
CTTCTGGTGG 
CACAGGGAGA 
GGCTCCTCAT 
GTGCTGCCTT 
AGCGATGACC 
CGCGCCATCC 
CATGTGGCTG 
ATCCTGATCA 
GGGCAGGGGG 
CGAGTTGCCT 
AGGACACTAC 
ACCCGACCTC 
AGCCAATCCT 
CAGTACACTC 
GTCCCAGCTG 
GTGACTGTGA 
ACCACTGCCC 
GTGGCCTGGC 
GGTGGGCCCA 
TTGGAGCCTG 
CCCGCCACTT 
ATCCTGGGCC 
CGGTTGGAAT 
GATGTGACCC 
TACACTCTGC 
GAGCTGCCTG 
CGAGTGTCCT 
CAGGGGGTTG 
GTTCAGGCTG 
AGTGCCAGTG 
CGGGTTGTGG 
AGTGGATTTC 
CCAGACTCTA 
GTGTCGGTAC 
CCACTGGGCC 
ACCTGGACCA 
CCAGAGAAAT 
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CCCAGTTGGT TTCTGGGGAG GCCACGGTGG 
AGTATACGGT GCATGTGAGG GCCCATGTGG 
. TTGTGAGGAC TGCCCCTGAG CCTGTGGGTC 
CCAGCGACGT TCTACGGATC ACCTGGGTAG 
5 • CCTGGGGCCG GAGTGAAGGC GGCCCCATGA 
CTGCAGAGAT CCGGGGTCTC GAAGGTGGAG 
TOGGGGACCG CGAGGGCACA CCTGTCTCCA 
* CAGCCCTGGG GACGCTTCAC GTGGTGCAGC 

AGCCGGTGCC CAGAGCGCAG GGCTTCCTTC 

10 AGTCCCGGGT CCTGGGGCCC GAGCTCAGCA 
CACAGTACCG CGTGAGGCTG AGTGTCCTAG 
TGACTGCGCG CACTGAGTCA CCTCGTGTTC 
CGATCGACTC GGTGACTTTG GCCTGGACTC 
CCTGGCGGCC ACTCAGAGGC CCTGGCCAGG 

1 5 GGATCTCAAG CTCCCAGCGG GTGACAGGGC 
TGACGCCTGT* CCTGGATGGT GTGCGGGGTC 
GCCCCCGTGG CCTGGCGGAT GTGGTGTTCC 
GTGOGGAGGC TAOGAGGAGG GTCCTGGAGC 
CACAGGCAGT TCAGGTTGGC CTGCTGTCTT 

20 TGAATGGCTC CCATGACCTT GGCATTATCT 
ACCCAAGTGG GAACAACCTG GGCACAGCGG 
CAGATGCTCC TGGGOGCCGC CAGCACGTAC 
CCTTGAGAGG TGACATATTC AGCCCCATCC 
TGATGTTGGG AATGGCTGGA GOGGACCCAG 

25 ACTCTGTCCA GACCTTCTTC GCCGTGGATG 
GTCTGGOCAC AGCCCTGTGT CAGGCATCCT 
CAGTGTATTG TCCAAAGGGC CAGAAGGGGG 
TTGGGCCTCC TGGCGACCCT GGCCTCCCGG 
CCCCTGGAAG TGCCACTGCC AAGGGCGAGA 

30 GCAGCCCTGG CCGCGCCGGG AATCCTGGGA 
CACGGTTGCC TGGCCCTCGT GGGGACCCGG 
AGCCGGGGGC TCCCGGACAA GTCATCGGAG 
GGGACCCTGG ACCATCGGGC CCCCCTGGAC 
GTGGCCCCCC AGGGCTTCCT GGAACAGCCA 

35 GGGGTCCCCC TGGACCAGGT GAAGGTGGCA 
TTCCCGGAAG CCCTGGACCC CAAGGCCCCG 
GTGACTCTGA GGATGGAGCT CCAGGCCTCC 
GCCCACGGGG ACCTCCTGGA GCTATTGGCC 
. TGGGTGAGGC TGGAGAGAAG GGCGAACGTG 

40 TGCCAGGGGT TGCTGGACGT CCTGGAGCCA 
GCCGCCAAGG AGAGAAGGGG GAGCCTGGTC 
CTGTTGCTGG ACCCAAAGGA GAAAAGGGAG 
CCGGAGTCCA AGGGGAACGG GGCCCACCCG 
AGGGAGACCC TGGAGACCGG GGTCCCATTG 

45 ACTCAGGGCC TCCTGGAGAG AAGGGAGACC 
GCCCCCGAGG ACGAGATGGT GAAGTTGGAG 
CGGGTTTGCC TGGAAAAGCA GGCGAGCGTG 
CTGTGGGTGA AAAGGGAGAC CAGGGAGATC 
GATCATCTGG ACCCAAGGGT GACCGTGGGG 

50 TGGTAGACAC AGGACCTGGA GCCAGAGAGA 
GTCCTCGAGG GCCCAAGGGT GATCCTGGCC 
AAGGGTTTCG GGGACCCCCA GGCCCACAGG 
AAAAGGGTGA CCGGGGTCCC CCTGGGCTGG 
GAGCCGCTGG GCCCTCTGGG CCGAATGGTG 

55 ACGGGCTTCC AGGCCTCCGT GGAGAACAAG 
TACCGGGAAA GCCAGGCGAG GATGGGAAAC 
GGGACCCTGG AGAAGACGGG AGGAAGGGAG 
AAGGTCGTGA TGGCCCCAAG GGTGAGCGTG 
CTCCAGGCCT CCCAGGGCCA GTGGGCCCTC 

60 GCACGGGCCC CAAGGGTGAC CGTGGGGAGA 
GAGAGCGTGG CCTGCGAGGA GAGCCTGGAA 
CTGCTGGCAT CAAGGCATCT GCCCTGCGGG 
GTAGCTTCCT GCCTGTGCCC GAACGGCGTC 
GCCCCCCAGG CAAGGAGGGC CCCATCGGCT 

65 GTGGAGACCC TGGCCCTCAG GGGCCACCTG 
GGCCTTCCGG CCTTGCCGGG GAGCCTGGAA 
CTGGGGGTGT GGGAGAGGCA GGAAGGCCAG 
AACGTGGAGA ACAGGGCAGA GATGGCCCTC 
GACCCCCTGG CCCCAAGGTG TCTGTGGATG 

70 GACCCCCTGG ACTCAAGGGT GCTAAGGGGG 
AAGGAGACAG GGGTGTGCCA GGCATCAAAG 
AGGACGGCAA CCCGGGTCTA CCAGGAGAGC 
GTCTGCAGGG TCCAAGAGGC CCCCCTGGCC 
CTGGTGCCCC GGGTCTTGCT GGCCCTGCAG 

75 AGCCTGGAGA GACAGGACCT CCAGGACGGG 
TTCCTGGACC CCCCGGCCCT TCAGGCCTTG 
GACAAGTGGG GGAGACAGGG AAGCCGGGAG 
ATGGAGACAG AGGGAGCCCT GGTGTGCCAG 
CTAAAGGAGA- ACCTGGCCCC ACGGGGGCCC 

80 CAAAGGGAGA GAAGGGAGCC CCTGGAGGCC 
CCAAAGGTGA- CCGAGGACTG CCAGGGCCGC 
GGGAGCCCGG . AGACCCTGGG GAAGATGGTC 
AGGGTGACCC AGGAGTCGGG GTCCCGGGCT 
AGGGAGATCT GGGCCTCCCT- GGCCTGCCCG 

85 AGACAGGCCC TCGAGGAGAG ATGGGTCAGC 
-GCCCCCCAGG GAGAGAAGGA ATCCCAGGAC 
TGGGACCACC TGGGGCCTCT GGACTCAAAG 



CTGAGCTGGA TGGACTGGAG CCAGATACTG 2340 

CTGGCGTGGA TGGGCCCCCT GCCTCTGTGG 2400 

GTGTGTCGAG GCTGCAGATC CTCAATGCTT 2460 

GGGTCACTGG AGCCACAGCT TACAGACTGG 2520 

GGCACCAGAT ACTCCCAGGA AACACAGACT 2580 ' 

TCAGCTACTC AGTGCGAGTG ACTGCACTTG 2640 

TTGTTGTCAC- TACGCCGCCT GAGGCTCCGC 2700 

GCGGGGAGCA CTCGCTGAGG CTGCGCTGGG 2760 

TGCACTGGCA ACCTGAGGGT GGCCAGGAAC 2820 

GCTATCACCT GGACGGGCTG GAGCCAGCGA 2880 

GGCCGGCTGG AGAAGGGCCC TCTGCAGAGG 2940 

CAAGCATTGA ACTACGTGTG GTGGACACCT 3000 

CAGTGTCCAG GGCATCCAGC TACATCCTAT 3060 

AAGTGCCTGG GTCCCCGCAG ACACTTCCAG 3120 

TAGAGCCTGG CGTCTCTTAC ATCTTCTCCC 3180 

CTGAGGCATC TGTCACACAG ACGCCAGTGT 3240 

TACCACATGC CACTCAAGAC AATGCTCACC 330O 

GTCTGGTGTT GGCACTTGGG CCTCTTGGGC 3360 

ACAGTCATCG GCCCTCCCCA CTGTTCCCAC 3420 

TGCAAAGGAT CCGTGACATG CCCTACATGG 3480 

TGGTCACAGC TCACAGATAC ATGTTGGCAC 3540 

CAGGGGTGAT GGTTCTGCTA GTGGATGAAC 3600 

GTGAGGCCCA GGCTTCTGGG CTTAATGTGG 3660 

AGCAGCTGCG TCGCTTGGCG CCGGGTATGG 3720 

ATGGGCCAAG CCTGGACCAG GCAGTCAGTG 3780 

TCACTACTCA GCCCCGGCCA GAGCCCTGCC 3840 

AACCTGGAGA . GATGGGCCTG AGAGGACAAG 3900 

GCAGGACCGG TGCTCCCGGC CCCCAGGGGC 3960 

GGGGCTTCCC TGGAGCAGAT GGGCGTCCAG 40 20 

CCCCTGGAGC CCCTGGCCTA AAGGGCTCTC 4080 

GAGAGCGAGG ACCTCGAGGC CCAAAGGGGG 4140 

GTGAAGGACC TGGGCTTCCT GGGCGGAAAG 4200 

CTCGTGGACC ACTGGGGGAC CCAGGACCCC 4260 

TGAAGGGTGA CAAAGGCGAT CGTGGGGAGC 4320 

TTGCTCCTGG GGAGCCTGGG CTGCCGGGTC 4380 

TTGGCCCCCC TGGAAAGAAA GGAGAAAAAG 4440 

CAGGACAACC TGGGTCTCCG GGTGAGCAGG 4500 

CCAAAGGTGA CCGGGGCTTT CCAGGGCCCC 4560 

GACCCCCAGG CCCAGCGGGA TCCCGGGGGC 4620 

AGGGTCCTGA AGGGCCACCA GGACCCACTG 4680 

GCCCTGGGGA CCCTGCAGTG GTGGGACCTG 4740 

ATGTGGGGCC CGCTGGGCCC AGAGGAGCTA 4800 

GCTTGGTTCT TCCTGGAGAC CCTGGCCCCA 4860 

GCCTTACTGG CAGAGCAGGA CCCCCAGGTG 4920 

CTGGGCGGCC TGGCCCCCCA GGACCTGTTG 4980 

AGAAAGGTGA CGAGGGTCCT CCGGGTGACC 5040 

GCCTTCGGGG GGCACCTGGA GTTCGGGGGC 5100 

CTGGAGAGGA TGGACGAAAT GGCAGCCCTG 5160 

AGCCGGGTCC CCCAGGACCC CCGGGACGGC 5220 

AGGGAGAGCC TGGGGACCGC GGACAAGAGG 5280 

TCCCTGGAGC CCCTGGGGAA AGGGGCATTG 5340 

GGGACCCAGG TGTCCGAGGC CCAGCAGGAG 5400 

ATGGCCGGAG CGGACTGGAT GGGAAACCAG 5460 

CTGCAGGCAA AGCTGGGGAC CCAGGGAGAG 5520 

GCCTCCCTGG CCCCTCTGGT CCCCCTGGAT 5580 

CTGGCCTGAA TGGAAAAAAC GGAGAACCTG 5640 

AGAAAGGAGA TTCAGGCGCC TCTGGGAGAG 5700 

GAGCTCCTGG TATCCTTGGA CCCCAGGGGC 5760 

CTGGCCAGGG TTTTCCTGGT GTCCCAGGAG 5820 

CTGGATCCAA AGGGGAGCAG GGCCTCCCTG . 5880 

GTGTGCCGAA TGTGGATCGG TTGCTGGAAA 5940 

AGATCGTGGA GACCTGGGAT GAGAGCTCTG 6000 

GAGGCCCCAA GGGGGACTCA GGCGAACAGG 6060 

TTCCTGGAGA ACGCGGGCTG AAGGGCGACC 6120 

GTCTGGCCCT TGGGGAGAGG GGCCCCCCCG 6180 

AGCCTGGTAT TCCCGGGCTC CCAGGCAGGG 6240 

GAGAGAGGGG AGAACGGGGA GAGAAAGGAG 6300 

CTGGACTCCC TGGAACCCCT GGGCCCCCCG 6360 

AGCCAGGTCC TGGACTCTCT GGAGAACAGG 6420 

AGCCGGGCAG CAATGGTGAC CAAGGTCCCA 6480 

GAGACCGGGG AGAGCCTGGA CCGAGGGGTC 6540 

GTGGTATGGC TGGGCCTGAA GGGAAGCCGG 6600 

CAGTGGGTGG TCATGGAGAC CCTGGACCAC 6660 

GACCCCAAGG ACCTTCTGGC CTGAAGGGGG 6720 

GCCTGACTGG ACCTACTGGA GCTGTGGGAC 6780 

TGGGTCCACA GGGGTCTCCA GGTTTGCCTG ; 6840 

CCCCAGGTCG AGATGGTGCC AGTGGAAAAG 6900 

GGTCACCAGG TCTGCCTGGC CCTGTCGGAC 6960 

CTGGACAGGC TGTGGTCGGG" CTCCCTGGAG 7020 

TTGCTGGAGA CCTGGTGGGT GAGCCGGGAG 7080 

GAGGCGAGAA GGGTGAAGCT GGCCGTGCAG 7140 

AGAAAGGGGC TCCAGGACCC AAAGGTTTCA 7200 

CCCCTGGGCC TCCTGGCCCT CCAGGTGTGA 7260 

GTGCTCCTGG TGTTGTTGGG TTCCCGGGTC 7320 

CAGGCCCTAG TGGAGAGCGG GGTCTGGCAG 7380 

CCCTGGGGCC ACCTGGACCA CCGGGGTCAG 7440 

GAGACAAGGG AGACCCTGGA GTAGGGCTGC 7500 
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CTGGGCCCCG AGGCGAGCGT GGGGAGCCAG GCATCCGGGG TGAAGATGGC CGCCCOGGCC 7560 

AGGAGGGACC COGAGGACTC ACGGGGCCCC CTGGCAGCAG GGGAGAGCGT GGGGAGAAGG 7620' 

GTGATGTTGG GAGTGCAGGA CTAAAGGGTG ACAAGGGAGA CTCAGCTGTG ATCCTGGGGC 7680 

CTOCAGGCCC . ACGGGGTGCC AAGGGGGACA TGGGTGAA CG AGGGCCTCGG GGCTTGGATG 7740 

GTGACAAAGG ACCTCGGGGA GACAATGGGG ACCCTGGTGA CAAGGGCAGC AAGGGAGAGC 7800 

CTGGTGACAA GGGCTCAGCC GGGTTGC CA G GACTGCGTGG ACTCCTGGGA CCCCAGGGTC 7860 

AACCTGGTGC AGCAGGGATC CCTGGTGACC CGGGATCCCC AGGAAAGGAT GGAGTGCCTG 7920 

GTATCCGAGG AGAAAAAGGA GATGTTGGCT TCATGGGTCC CCGGGGCCTC AAGGGTGAAC 7980 

GGGGAGTGAA GGGAGCCTGT GGCCTTGATG GAGAGAAGGG AGACAAGGGA GAAGCTGGTC 8040 

ccccaggccg ccccgggctg' GCAGGACACA AAGGAGAGAT GGGGGAGCCT GGTGTGCCGG 8100 

* GCCAGTCGGG GGCCCCTGGC AAGGAGGGCC TGATOGGTCC CAAGGGTGAC CGAGGCTTTG 8160 

ACGGGCAGCC AGGCCCCAAG GGTGACCAGG GOGAGAAAGG GGAGCGGGGA ACCCCAGGAA 8220 

TTGGGGGCTT CCCAGGCCCC AGTGGAAATG ATGGCTCTGC TGGTCCCCCA GGGCCACCTG 8280 

GCAGTGTTGG TCCCAGAGGC CCOGAAGGAC TTCAGGGCCA GAAGGGTGAG CGAGGTCCCC 8340 

CCGGAGAGAG AGTGGTGGGG GCTCCTGGGG TCCCTGGAGC TCCTGGCGAG AGAGGGGAGC 8400 

AGGGGCGGCC AGGGCCTGCC GGTCCTCGAG GCGAGAAGGG AGAAGCTGCA CTGACGGAGG 8460 

ATGACATCCG GGGCTTTGTG CGCCAAGAGA TGAGTCAGCA CTGTGCCTGC CAGGGCCAGT 8520 

TCATCGCATC TGGATCACGA CCCCTCCCTA GTTATGCTGC AGACACTGCC GGCTCCCAGC 8580 

TCCATGCTGT GCCTGTGCTC CGCGTCTCTC ATGCAGAGGA GGAAGAGCGG GTACCCCCTG 8640 

AGGATGATGA GTACTCTGAA TACTCCGAGT ATTCTGTGGA GGAGTACCAG GACCCTGAAG 8700 . 

CTCCTTGGGA TAGTGATGAC CCCTGTTCCC TGCCACTGGA TGAGGGCTCC TGCACTGCCT 8760 

ACACCCTGCG CTGGTACCAT CGGGCTGTGA CAGGCAGCAC AGAGGCCTGT CACCCTTTTG 8820 

TCTATGGTGG CTGTGGAGGG AATGCCAACC GTTTTGGGAC CCGTGAGGCC TGCGAGCGCC 8880 

GCTGCCCACC CCGGGTGGTC CAGAGCCAGG GGACAGGTAC TGCCCAGGAC TGAGGCCCAG 8940 

ATAATGAGCT GAGATTCAGC ATCCCCTGGA GGAGTCGGGG TCTCAGCAGA ACCCCACTGT 9000 

CCCTCCCCTT GGTGCTAGAG GCTTGTGTGC ACGTGAGCGT GCGAGTGCAC GTCCGTTATT 9060 

TCAGTGACTT GGTCCCGTGG GTCTAGCCTT CCCCCCTGTG GACAAACCCC CATTGTGGCT 9120 

CCTGCCACCC TGGCAGATGA CTCACTGTGG GGGGGTGGCT GTGGGCAGTG AGCGGATGTG 9180 

ACTGGCGTCT GACCCGCCCC TTGACCCAAG CCTGTGATGA CATGGTGCTG ATTCTGGGGG 9240 
GCATTAAAGC TGCTGTTTTA AAAGGCAAAA AA 

Seq ID NO: 63 Protein sequence: 
Protein Accesslon#: NP 000085.1 
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1 11 21 31 41 51 

I i I I I I 

MTLRLLVAAL CAGILAEAPR VRAQHRERVT CTRLYAADIV FLLDGSSSIG RSNFREVRSF 
LEGLVLPFSG AASAQGVRFA TVQYSDDPRT EFGLDALGSG GDVTRAIREL SYKGGNTRTG 
AAILHVADHV FLPQLARPGV PKVCILITDG KSQDLVDTAA QRLKGQGVKL FAVGIKNADP 
EELKRVA5QP TSDFFFFVND FSILRTLLPL VSRRVCTTAG GVPVTRPPDD STSAPRDLVL 
SEPSSQSfcRV QWTAASGPVT GYKVQYTPLT GLGQPLPSER QEVNVPAGET SVRLRGLRPL 
TEYQVTVIAL YANSIGEAVS GTARTTALEG PELTIQNTTA HSLLVAWRSV PGATGYRVTW 
RVLSGGPTQQ QELGPGQGSV LLRDLEPGTD* YEVTVSTLFG RSVGPATSLM ARTDASVEQT 
LRPVILGPTS ILLSWNLVPE ARGYRLEWRR ETGIiEPPQKV VLPSDVTRYQ LDGLQPGTEY 
RLTLYTLLEG HEVATPATW PTGPELPVSP VTDLQATELP GQRVRVSWSP VPGATQYR 1 1 
VRSTQGVERT LVLPGSQTAF DLDDVQAGLS YTVRVSARVG PREGSASVLT VRREPETPLA 
VPGLRVWSD ATRVRVAWGP VPGASGFRIS WSTGSGPESS QTLPPDSTAT DITGLQPGTT 
YQVAVSVLRG REEGPAAVIV ARTDPLGPVR TVHVTQASSS SVTITWTRVP GATGYRVSWH 
SAHGPEKSQXi VSGEATVAEI* DGLEPDTEYT VHVRAHVAGV DGPPASWVR TAPEPVGRVS 
RLQILNASSD VLRITWVGVT GATAYRIAWG RSEGGPMRHQ ILPGNTDSAE IRGLEGGVSY 
SVRVTALVGD REGTPVSIW TTPPEAPPAL GTLHWQRGE HSLRLRWEPV PRAQGFLLHW 
QPEGGQEQSR VLGPELSSYH LDGLEPATQY RVRLSVLGPA GEGPSAEVTA RTESPRVPSI . 
ELRWDTSID SVTLAWTPVS RASSYILSWR PLRGPGQEVP GSPQTIiPGIS SSQRVTGLEP 
GVSYIFSLTP VLDGVRGPEA SVTQTPVCPR GLADWFLPH ATQDNAHRAE ATRRVLERLV 
LALGPLGPQA VQVGLLSYSH RPSPLFPLNG SHDLGIILQR IRDMPYMDPS GNNLGTAWT 
AHRYMLAPDA PGRRQHVPGV MVLLVDEPLR GDIFSPIREA QASGLNWML GMAGADPEQL 
RRLAPGMDSV QTFFAVDDGP SLDQAVSGLA TALCQASFTT QPRPEPCPVY CPKGQKGEPG 
EMGLRGQVGP PGDPGLPGRT GAPGPQGPPG SATAKGERGF PGADGRPGSP GRAGNPGTPG 
APGLKGSPGL PGPRGDPGER GPRGPKGEPG APGQVIGGEG PGLPGRKGDP GPSGPPGPRG 
PLGDPGPRGP PGLPGTAMKG DKGORGERGP PGPGEGGIAP GEPGLPGLPG SPGPQGPVGP 
PGKKGEKGDS EDGAPGLPGQ PGSPGEQGPR GPPGAIGPKG DRGFPGPLGE AGEKGERGPP 
GPAGSRGIiPG VAGRPGAKGP EGPPGPTGRQ GEKGEPGRPG DPAWGPAVA GPKGEKGDVG . 
PAGPRGATGV QGERGPPGLV LPGDPGPKGD PGDRGPIGLT GRAGPPGDSG PPGEKGDPGR 
PGPPGFVGPR GRDGEVGEKG DEGPPGDPGL PGKAGERGLR GAPGVRGPVG EKGDQGDPGE 
DGRNGSPGSS GPKGDRGEPG PPGPPGRLVD TGPGAREKGE PGDRGQEGPR GPKGDPGLPG 
APGERGIEGF RGPPGPQGDP GVRGPAGEKG DRGPPGLDGR SGLDGKPGAA GPSGPNGAAG 
KAGDPGRDGL PGLRGEQGLP GPSGPPGLPG KPGEDGKPGL NGKNGEPGDP GEDGRKGEKG 
DSGASGREGR DGPKGERGAP GILGPQGPPG LPGPVGPPGQ GFPGVPGGTG PKGDRGETGS 
KGSQGLPGER GLRGEPGSVP NVDRLLETAG IKASALREIV ETWDESSGSF LPVPERRRGP 
KGDSGEQGPP GKEGPIGFPG ERGLKGDRGD PGPQGPPGLA LGERGPPGPS GLAGEPGKPG 
IPGLPGRAGG VGEAGRPGER GERGEKGERG EQGRDGPPGL PGTPGPPGPP GPKVSVDEPG 
PGLSGEQGPP GLKGAKGEPG SNGDQGPKGD RGVPGIKGDR GEPGPRGQDG NPGLPGERGM 
AGPEGKPGLQ GPRGPPGPVG GHGDPGPPGA PGLAGPAGPQ GPSGLKGEPG ETGPPGRGLT 
GPTGAVGLPG PPGPSGLVGP QGSPGLPGQV GETGKPGAPG RDGASGKDGD RGSPGVPGSP 
GLPGPVGPKG EPGPTGAPGQ AWGLPGAKG EKGAPGGLAG DLVGEPGAKG DRGLPGPRGE 
KGEAGRAGEP GDPGEDGQKG APGPKGFKGD PGVGVPGSPG PPGPPGVKGD LGLPGLPGAP 
GWGFPGQTG PRGEMGQPGP SGERGLAGPP GREG I PGP U3 PPGPPGSVGP PGASGLKGDK 
GDPGVGLPGP RGERGEPGIR GEDGRPGQEG PRGLTGPPGS RGERGEKGDV GSAGLKGDKG 
DSAVILGPPG PRGAKGDMGE RGPRGLDGDK GPRGDNGDPG DKGSKGEPGD KGSAGLPGLR 
GLLGPQGQPG AAGIPGDPGS PGKDGVPGIR GEW30VGFMG PRGLKGERGV KGACGLDGEK ■ 
GD KG BAG PPG RPGLAGHKGE MGEPGVPGQS GAPGKEGLIG PKGDRGFDGQ PGPKGDOGEK 
GERGTPGIGG PPGPSGNBGS AGPPGPPGSV GPRGPEGLQG QKGERGPPGE RWGAPGVPG 
APGERGEQGR PGPAGPRGEK GEAALTEDDI RGFVRQEMSQ HCACQGQFIA SGSRPLPSYA 
ADTAGSQLHA VPVLRVSBAE EEERVPPEDD EYSEYSEYSV EEYQDPEAPW DSDDPCSLPL - 
DEGSCTAYTL RWYHRAVTGS TEACHPFVYG GCGGNANRFG TREACERRCP PRWQSQGTG 
TAQD 



60 
120 
180 
* 240 
300 
360 
' 420 
480 
540 
600 
660 
720 
780 
840 . 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380. 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 * 
2820 
2880 
2940 



212 



WO 02/086443. .1 

Seq ID NO: «4 DOT sequence 
Nucleic Acid Accession 8: NM_006945 
Coding sequence: .1-219 /. ' • 



PCT/US02/12476 



.5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



•1 . 11 21 31 . 41 SI 

I I I" I I I 

ATGTCTTATC AACAGCAGCA GTGCAAGCAG CCCTGCCAGC . CACCTCCTGT GTGCCCCACG 
CCAAAGTGCC CAGAGCCATG TCCACCCCCG AAGTGCCCTG AGCCCTGCCC ACCACCAAAG 
TGTCCACAGC CCTGCCCACC . TCAGCAGTGC CAGCAGAAAT ATCCTCCTGT GACACCTTCC 
CCACCCTGCC AGCCAAAGTA' TCCACCGAAG AGCAAGTAA 

Seq ID NO: 65 Protein . sequence : 
Protein Accession ft: NP_008876 

1 ■ U . 21 31 41" 51 . 

.1 - I L I I I 

MSYQQQQCKQ PCQPPPVCPT PKCPEPCPPP KCPEPCPPPK CPQPCPPQQC QQKYPPVTPS 
PPCQPKYPPK SK 

Seq ID NO: 66 DNA sequence 

Nucleic Acid Accession ft: NM_005629.1 

Coding sequence: 639-2546 



60 
120 
180 



1 
I 

TAGTOGGAGC 
CCGCCGCCGG 
CCGCCGCCAC 
CGGGGCGGGG 
CCGATGTCGC 
GCCGCCGCCC 
GCCGGGCCCC 
CTCACTTTCA 
CGTCCGCCCG 
CCTCGGGGCC 
GCCGCGACCC 
ACGGCATCTA 
ACGGGGCCCC 
CCGTGCCGCC 
TCGCCGTGGG 
GTGTGTTCCT 
TAGAGATCTC 
CCCTGTTCAA 
ACATCATGGT 
CCTGGGCCAC 
AAGACTGTGC 
CCCCTGTCAT 
CAGGGGCCCT 
TCTGTGTCTG 
CCTACGTGGT 
GCATCATTTA 
ATGCGGGGAC 
GCAGCTACAA 
ACAGTGGGAC 
CAGAGCAGGG 
CCTACCCGCG 
TCATGCTGTT 
GCCTCCTCGA 
CCCTCTGTTG 
ACGTCTTCCA 
TTTGGGAGTG 
CCTGTATGAT 
CGCTGGTCTG 
ACAACACCTA 
CCATGCTGTG 
CTGAGCGCTG 
CTCAGGACGC 
TCGTCGTGGT 
GCCATAGCAG 
CACTTTTGGG 
ACTTTTTCCA 
TCCCCCTCCA 
CAGTGCTGCA 
CTCTGCAGCA 
TGTGAGGAAG 
GAGGGGCAGC 
TTATAGAAGC 
CCAGTATCAA 
TATAGATCTC 
GGAGGCTTGT 
TTTCTAAAAA 
CCTACCTTAC 
GCAAAACAAA 
GTCCCCAGCC 
GCTGTCCCTT 
GGGCTGCTAA 
GACCCTGGAC 




CTCCCCGGTG 
CGGCCCGGCC 
TAGCGTGTCC 
GGCCAAGGGC 
GCGCGAGACC 
CTTGGGCAAC 
TATTCCCTAC 
GCTGGGCCAG 
AGGCCTGGGC 
GCTGGCCTGG 
ATGTGGCCAC 
CAATGCCAGC 
CGAGTTCTGG 
CAACTGGGAG 
GAAGGGGGTC 
CCTGGTCGTG 
CTATCTCAAG 
CCAGATTTTC 
CCGCTTCAAC 
CAGCTTCTTT 
CGTGCACATC 
GGCTGTCACG 
GCTGCTTGGT 
CCTCCTCCCG 
TGCCCTCTGC 
GCTGTTTGAC 
CGTGGTGGTG 
CGGGTACCGA 
CATGGGCATC 
CGTGTACCCG 
CGTGCCGCTG 
GCAGCACCTG 
AGATGTCAGG 
GGAGAGTGTC 
CCCCTGCTTC 
GTCTGCCTGG 
TTTTTAATAA 
GCCCTAGCCG 
CTCCTCCTGC 
CACCCGTGGG 
GGAAGGAGGG 
AGAACCAAGG 
TTAGAGAGCC 
TTGTGTGAGC 
TATCTCTTAG 
ATTTTGCACA 
GAGGAAGGAG 
CCCTCTGCCC 
AGCTTCGAGC 
CCAGACTGGA 
TGCCACAAGT 
CCTGGCCTGC 
ACGGCTCCCA 



21 
I 

GTCGCTGAGC 
GCGAGGCGCG 
GGAGTCGCGG 
ACAGGCCCCT 
TTAGGATGAG 
CAGGAGCCTC 
GCGCGCCCCC 
GGACTGCTTC 
CCCCGGCCCG 
CCGCCGGTGC 
GTGCGGCCCG 
GGCGACGAGA 
GACGGCCCCG 
TGGACGCGCC 
GTGTGGCGCT 
GTCCTGATCG 
TTCATGAAGG 
TACGCCTCCA 
GGCTTCTATT 
ACCTGGAACA 
CTGGCCAACC 
GAGAACAAAG 
GTGACCCTTT 
AAATCCACGG 
CTGCTGGTGC 
CCTGACTGGT 
TTTTCTTACG 
AACAACTGCT 
GCTGGCTTCG 
TCCAAGGTGG 
CTGATGCCAG 
CTCGACAGCC 
GCCTCCTACT 
TTTGTCATCG 
TACT ACT CGG 
GCCTGGGTGT 
CCTTGCCCCT 
TTCATCTTCA 
TGGTGGGGTG 
CACCTCCTGG 
ACCCAGCCCA 
GGCCTGACCA 
ATGTGACAAC 
AGCCCCACCG 
GGGAGGAGGG 
AACGCCAAAA 
AGCTGGTCCT 
CCCTGCCACG 
TGACCCCTCA 
AGAGACGGGA 
CAAATATTTC 
AGCCAGCAAT 
TTGGGTGCGA 
CAAAGGTGAA 
TTTTATAAAA 
CCCAAACCAT 
CTAGCCAAGG 
.TGTTGCGTGT 
TTGGAAAAGT 
CTGTGGGGCA 
TCAGGCTTCC 
CGTCCAGGCT 



31 
1 

CCGCCGCGGC 
CCOGAGCCGC 
GCCAGCCGGG 
GCTCCGGCCG 
TCTCGGGTCG 
GGGAGCCGCC 
GGGCCCCCGA 
TGACTGCGCC 
GCCGCCCCCC 
CCCCCGCCTG 
CCGGGGCCAT 
AGAAGGGCCC 
TGGGCCTGGG 
AGATGGACTT 
TCCCCTACCT 
CCCTGGTTGG 
CCGGCAGCAT 
TGGTGATCGT 
ACCTGGTCAA 
CTCCCGACTG 
TCACCTGTGA 
TCTTGAGGCT 
GTCTGCTGGC 
GAAAGATCGT 
GTGGAGTGCT 
CAAAGCTGGG 
CCATTGGCCT 
ACAAGGACGC 
TGGTCTTCTC 
CAGAGTCAGG 
TGGCCCCACT 
7VGTTTGTAGG 
ACTTCOGTTT 
ATCTCTCCAT 
CCAGCGGCAC 
ACGGAGCTGA 
GGATGAAATG 
ACGTTGTGTA 
AGGCCATGGG 
GCTGCCTCCT 
TCTGGGGCCT 
CCCTGACCCC 
TCAGCTCACA 
CACCCCTCCA 
GAGAAAGCAC 
ATATCACAAC 
AGGCCCCGCC 
CCCACCCCCT 
CCCCAGAAGC 
GGGAGGAGAG 
AGCTGGGCTA 
GGAACCTTCT 
GTGCACGCGT 
TGCCAGATGT 
ACTTGAGAGA 
CCTCTCCTTA 
AGTGTGAATT 
GTGAGTCTGT 
GCATGGTGGG 
AGAGGCTGCA 
CACCCTGTGC 
TAAGGTGGAT 



41 
I 

CCCGAGAGCG 
CGCCGCCGCC 
CAGCCTCCGC 
TCGTTTGCAG 
GGCGAGGAGC 
GCCGCCGCCG 
CACACATGAG 
GCCCGCGCCC 
GGCCCCCGGC 
ACCGCCGCCC 
GGCGAAGAAG 
CCTCATCGCG 
GACACCCGGC 
CATCATGTCG 
GTGCTACAAG 
AGGAATCCCC 
CAATGTCTGG 
CTTCTACTGC 
GTCCTTTACC 
CGTGGAGATC 
CCAGCTTGCT 
GTCTGGGGGA 
CTGCTGGGTG 
GTACTTCACT 
GCTGCCTGGC 
GTCCCCTCAG 
GGGGGCCCTC 
CATCATCCTG 
CATCCTGGGC 
GCCGGGCCTG 
CTGGGCTGCC 
TGTGGAGGGC 
CCAAAGGGAG 
GGTGACTGAT 
CACCCTGCTC 
CCGCTTCATG 
GTGCTGGTCC 
CTACGAGCCG 
CTGGGCCTTC 
CAGGGCCAAG 
CCACCACTTG 
AGTGTCCGAG 
TCACCAGCTC 
GGGGGCCTGC 
CATGAGTGCT 
CCACCAAAAA 
TAGTGCCCCA 
GCCCACCTCT 
AGCAGTGGCA 
AGAGGAGAAG 
TACCCCTCTC 
GGTTCCTGCG 
GCGTGAGTAC 
AAATGGCGCC 
ATGAGATTTC 
CCACTCCCAT 
TATAGATCTA 
TGTGTGGATG 
GGCCTCGGGG 
ATATTCCGTC 
GGGGCACACC 
GCACTTCCCG 



51 
I 

GCTGCAGCCG 
GCCACCGCCG 
GGGCCCCGGC 
ACOGCGGGCG 
CGCCGCAGCC 
CCGCCGCCCG 
ATTCTTCAGG 
CGCACCCCGC 
CGGCCCGCGC 
CCCGTGAGGC 
AGCGCCGAGA 
CCCGGGCCCG 
CGCCGCCTGG 
•TGCGTGGGCT 
AACGGCGGAG 
ATTTTCTTCT 
AACATCTGTC 
AACACCTACT 
ACCACGCTGC 
TTCCGCCATG 
GACCGCCGGT 
CTGGAGGTGC 
CTGGTCTACT 
GCTACATTCC 
GCCCTGGATG 
GTGTGGATAG 
ACAGCCCTGG 
GCTCTCATCA 
TTCATGGCTG 
GCCTTCATCG 
CTGTTCTTCT 
TTCATCACCG 
ATCTCTGTGG 
GGCGGGATGT 
TGGCAGGCCT 
GACGACATTG 
TTCTTCACCC 
CTGGTCTACA 
GCCCTGTCCT 
GGCACCATGG 
GAGTACCGAG 
AGCAGCAAGG 
ACCTCTGGTA 
CTTTCCCTGA 
CACTAAAACA 
TAGATGCCTC 
CCCCCACCCA 
CCAGGCTCTG 
GCTTGGGAAA 
GGAGGCAGGG 
CCCATCCCTG 
CCAATCGCCA 
GGAGAGTATA 
TCTGGGCAAA 
TGCTTGTATA 
CCCTGTGAGC 
ACTTTCATAG 
TGCGTGTGTG 
CTGTCCCCAC 
CTGGGTGTCT 
CCCAGGAAGG 
CACCTCCAGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
16B0 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
31B0 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 



213 
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CTTCTGTGTA GCAGCTTTAA CCCACGTTTG TCTGTCACGT CCAGTCCOGA GACGGCTGAG 3780 

TGACCCCAAG AAAGGCTTCC CCGACACCCA GACAGAGGCT GCAGGGCTGG GGCTGGGTGA 3840 

GGGTGGCGGG CCTGCGGGGA CATTCTACTG TGCTAAAAAG CCACTGCAGA CATAGCAATA - 3900 
AAAACATGTC ATTTTCC 
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Seq ID NO: 67 Protein sequence: 
Protein Accession 8: NP 005620.1 



10 



15 



20 



MAKKSAENGI 
PIMSCVGPAV 
INVWNICPLF 
CVErFRHEDC 
ACWVLVYFCV 
GSPQVWIDAG 
SILGPMAAEQ 
GVEGFITGLL 
TTLLWQAFWE 
YYEPLVYNNT 
LHHLEYRAGD 



11 
I 

YSVSGDEKKG 
GLGNVWRFPY 
KGLGYASMVI 
ANASLAKLTC 
WKGVKSTGKI 
TQIFFSYAIG 
GVHISKVAES 
DLLPASYYFR 
CWVAWVYGA 
YVYPWWGEAM 
ADVRGLTTLT 



21 
I 

PLIAPGPZX3A 
LCYKNGGGVP 
VPYCNTYYIM 
DQLADRRSPV 
VYPTATFPYV 
LGALTALGSY 
GPGLAFIAYP 
PQREISVALC 
DRFMDDIACM 
GWAFALSSML 
PVSESSKWV 



31 
I 

PAKGDGPVGL 
LIPYVLIALV 
VLAWGFYYLV 
IEFWENKVLR 
VLWLLVRGV 
NRFNNNCYKD 
RAVTLMPVAP 
CALCFVIDLS 
IGYRPCPWMK 
CVPLHLLGCL 
VBSVM 



41 

I ' 

GTPGGRLAVP 
GGIPIPFLEI 
KSFTTTLPWA 
LSGGLEVPGA 
LLPGALDGII 
AIILALINSG 
LWAALFFFML 
MVTDGGMYVF 
WCWSFFTPLV 
LRAKGTMAKR 



51 
I 

PRETWTRQMD 
SLGQFMKAGS 
TCGHTWNTPD 
LNWEVTLCtiL 
YYLKPDWSKL 
TSFFAGPWF 
LLLGLDSQFV 
QhFDYYSASG 
CMGIFIFKW 
WQHLTQPIWG 



60 
120 
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240 
300 
360 
420 
480 
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600 
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Seq ID NO: 68 DNA sequence 

Nucleic Acid Accession #: NM_021953.1 

Coding sequence: 178-2469 



1 11 21 31 41 51 

! I I I I I 

GGCACGAGGG GGACCCGGCC GGTCCGGCGC GAGCCCCCGT CCGGGGCCCT GGCTCGGCCC 60 

CCAGGTTGGA GGAGCCCGGA GCCCGCCTTC GGAGCTACGG CCTAACGGOG GCGGCGACTG 120 

CAGTCTGGAG GGTCCACACT TGTGATTCTC AATGGAGAGT GAAAACGCAG ATTCATAATG 180 

AAAGCTAGCC CCCGTCGGCC ACTGATTCTC AAAAGACGGA GGCTGCCCCT TCCTGTTCAA 240 

AATGCCCCAA GTGAAACATC AGAGGAGGAA CCTAAGAGAT CCCCTGCCCA ACAGGAGTCT 300 

AATCAAGCAG AGGCCTCCAA GGAAGTGGCG GAGTCCAACT CTTGCAAGTT TCCAGCTGGG 360 

ATCAAGATTA TTAACCACCC CACCATGCCC AACACGCAAG TAGTGGCCAT CCCCAACAAT 420 

GCTAATATTC ACAGCATCAT CACAGCACTG ACTGCCAAGG GAAAAGAGAG TGGCAGTAGT 480 

GGGCCCAACA AATTCATCCT CATCAGCTGT GGGGGAGCCC CAACTCAGCC TCCAGGACTC 540 

CGGCCTCAAA CCCAAACCAG CTATGATGCC AAAAGGACAG AAGTGACCCT GGAGACCTTG 600 

GGACCAAAAC CTGCAGCTAG GGATGTGAAT CTTCCTAGAC CACCTGGAGC CCTTTGCGAG 660 

CAGAAACGGG AGACCTGTGC AGATGGTGAG GCAGCAGGCT GCACTATCAA CAATAGCCTA 720 

TCCAACATCC AGTGGCTTCG AAAGATGAGT TCTGATGGAC TGGGCTCCCG CAGCATCAAG 780 

CAAGAGATGG AGGAAAAGGA GAATTGTCAC CTGGAGCAGC GACAGGTTAA GGTTGAGGAG 840 

CCTTCGAGAC CATCAGCGTC CTGGCAGAAC TCTGTGTCTG AGCGGCCACC CTACTCTTAC 900 

ATGGCCATGA TACAATTCGC CATCAACAGC ACTGAGAGGA AGCGCATGAC TTTGAAAGAC 960 

ATCTATACGT GGATTGAGGA CCACTTTCCC TACTTTAAGC ACATTGCCAA GCCAGGCTGG 1020 

AAGAACTCCA TCCGCCACAA CCTTTCCCTG CACGACATGT TTGTCGGGGA GACGTCTGCC 1080 

AATGGCAAGG TCTCCTTCTG GACCATTCAC CCCAGTGCCA ACCGCTACTT GACATTGGAC 1140 

CAGGTGTTTA AGCCACTGGA CCCAGGGTCT CCACAATTGC CCGAGCACTT GGAATCACAG 1200 

CAGAAACGAC CGAATCCAGA GCTCCGCCGG AACATGACCA TCAAAACCGA ACTCCCCCTG 1260 

GGCGCACGGC GGAAGATGAA GCCACTGCTA CCACGGGTCA GCTCATACCT GGTACCTATC 1320 

CAGTTCCCGG TGAACCAGTC ACTGGTGTTG CAGCCCTCGG TGAAGGTGCC ATTGCCCCTG 1380 

GCGGCTTCCC TCATGAGCTC AGAGCTTGCC CGCCATAGCA AGCGAGTCCG CATTGCCCCC 1440 

AAGGTGCTGC TAGCTGAGGA GGGGATAGCT CCTCTTTCTT CTGCAGGACC AGGGAAAGAG 15 00 

GAGAAACTCC TGTTTGGAGA AGGGTTTTCT CCTTTGCTTC CAGTTCAGAC TATCAAGGAG 1560 

GAAGAAATCC AGCCTGGGGA GGAAATGCCA CACTTAGCGA GACCCATCAA AGTGGAGAGC 1620 

CCTCCCTTGG AAGAGTGGCC CTCCCCGGCC CCATCTTTCA AAGAGGAATC ATCTCACTCC 1680 

TGGGAGGATT CGTCCCAATC TCCCACCCCA AGACCCAAGA AGTCCTACAG TGGGCTTAGG 1740 

TCCCCAACCC GGTGTGTCTC GGAAATGCTT GTGATTCAAC ACAGGGAGAG GAGGGAGAGG 1800 

AGCCGGTCTC GGAGGAAACA GCATCTACTG CCTCCCTGTG TGGATGAGCC GGAGCTGCTC 1860 

TTCTCAGAGG GGCCCAGTAC TTCCCGCTGG GCCGCAGAGC TCCCGTTCCC AGCAGACTCC 1920 

TCTGACCCTG CCTCCCAGCT CAGCTACTCC CAGGAAGTGG GAGGACCTTT TAAGACACCC 1980 

ATTAAGGAAA CGCTGCCCAT CTCCTCCACC CCGAGCAAAT CTGTCCTCCC CAGAACCCCT 2040 

GAATCCTGGA GGCTCACGCC CCCAGCCAAA GTAGGGGGAC TGGATTTCAG CCCAGTACAA 2100 

ACCTCCCAGG GTGCCTCTGA CCCCTTGCCT GACCCCCTGG GGCTGATGGA TCTCAGCACC 2160 

ACTCCCTTGC AAAGTGCTCC CCCCCTTGAA TCACCGCAAA GGCTCCTCAG TTCAGAACCC 2220 

TTAGACCTCA TCTCCGTCCC CTTTGGCAAC TCTTCTCCCT CAGATATAGA CGTCCCCAAG 2280 

CCAGGCTCCC CGGAGCCACA GGTTTCTGGC CTTGCAGCCA ATOGTTCTCT GACAGAAGGC 2340 

CTGGTCCTGG ACACAATGAA TGACAGCCTC AGCAAGATCC TGCTGGACAT CAGCTTTCCT 2400 

GGCCTGGACG AGGACCCACT GGGCCCTGAC AACATCAACT GGTCCCAGTT TATTCCTGAG 2460 

CTACAGTAGA GCCCTGCCCT TGCCCCTGTG CTCAAGCTGT CCACCATCCC GGGCACTCCA 2520 

AGGCTCAGTG CACCCCAAGC CTCTGAGTGA GGACAGCAGG CAGGGACTGT TCTGCTCCTC 2580 

ATAGCTCCCT GCTGCCTGAT TATGCAAAAG TAGCAGTCAC ACCCTAGCCA CTGCTGGGAC 2640 

CTTGTGTTCC CCAAGAGTAT CTGATTCCTC TGCTGTCCCT GCCAGGAGCT GAAGGGTGGG 2700 

AACAACAAAG GCAATGGTGA AAAGAGATTA GGAACCCCCC AGCCTGTTTC CATTCTCTGC 2760 

CCAGCAGTCT CTTACGTTCC CTGATCTTTG CAGGGTGGTC CGTGTAAATA GTATAAATTC 2820 

TCCAAATTAT CCTCTAATTA TAAATGTAAG CTTATTTCCT TAGATCATTA TCCAGAGACT 2880 

GCCAGAAGGT GGGTAGGATG ACCTGGGGTT TCAATTGACT TCTGTTCCTT GCTTTTAGTT 2940 

TTGATAGAAG GGAAGACCTG CAGTGCACGG TTTCTTCCAG GCTGAGGTAC CTGGATCTTG 3000 

' GGTTCTTCAC TGCAGGGACC CAGACAAGTG GATCTGCTTG CCAGAGTCCT TTTTGCCCCT 3060 

CCCTGCCACC TCCCCGTGTT TCCAAGTCAG CTTTCCTGCA AGAAGAAATC CTGGTTAAAA 3120 

AAGTCTTTTG TATTGGGTCA GGAGTTGAAT TTGGGGTGGG AGGATGGATG CAACTGAAGC 3180 

AGAGTGTGGG TGCCCAGATG TGCGCTATTA GATGTTTCTC TGATAATGTC CCCAATCATA 3240 

CCAGGGAGAC TGGCATTGAC GAGAACTCAG GTGGAGGCTT GAGAAGGCCG AAAGGGCCCC 3300 

TGACCTGCCT GGCTTCCTTA GCTTGCCCCT . CAGCTTTGCA AAGAGCCACC CTAGGCCCCA 3360 

GCTGACCGCA TGGGTGTGAG CCAGCTTGAG AACACTAACT ACTCAATAAA AGCGAAGGTG 3420 
GACCNAAAAA AAAAAAAAAA AAAA. 



214 



WO 02/086443 



PCT/US02/12476, 



Seq ID NO : 69 protein sequence: 
Protein Accession St NP_068772.l 
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10 
15: 
20 
25 
30 
35 
40 
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70 
75 

so ; 
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MKASPRRPLI 
GIKIINHPTM 
LRPQTQTSYD 
L5NIQRLRKM 
YMAMIQFAIN 
ANGKVSFWTI 
LGARRKMKPL 
PKVLLAEEGI 
SPPIiEEWPSP 



PIKETLPISS 
TTPLQSAPPL 
GLVLDTMNDS 



11 
I 

LKRRRLPLPV 
FNTQWAIPN 
AKRTEVTLET 
SSDGLGSRSl 
STERKRMTLK 
HPSANRYLTL 
LPRVSSYLVP 
APLSSAGPGK 
APSFKEESSH 
LPPCVDEPEI* 
TPSXSVliPRT 
ESPQRLLSSE 
LSKILLDISP 



21 
I 

QNAPSETSEE 
NANIHSIITA 
LGPKPAARDV 
KQ EMEEKENC 
DIYTWIEDHP 
DQVFKPLDPG 
IQFPVNQSLV 
EEKLLFGEGP 
SWEDSSQSPT 
LPSE6PSTSR 
PESWRLTPPA 
PLDLISVPFG 
PGLDEDPLGP 



31 
I 

EPKRSPAQQE 
LTAKGKESGS 
NLPRPPGALC 
KLEQrtQVKVE 
PYFKHIAKPG 
SPQLPEHLES 
LQPSVKVPLP 
SPLLPVQTIK 
PRPKKSYSGL 
WAAELPPPAD 
KVGGLDFSPV 
NSSPSDIDVP 
DNINWSQFIP 



41 
I 

SNQAEASKEV 
SGPNKFILIS 
EQKRETCADG 
EPSRPSASWQ 
HKNSIRHNLS 
QQKRPNPELR 
XiAASLMSSEL 
EEEIQPGEEM 
RSPTRCVSEM 



QTSQGASOPL 
KPGSPEPQVS 
ELQ ' 



51 
I 

AESNSCKFPA 
CGGAPTQPPG 
EAAGCTINNS 
NSVSERPPYS 
LHDMFVRETS 
RNMTIKTELP 
ARKSKRVRIA 
PKLARPIKVE 
LVIQHRERRE 
SQEVGGPFKT 
PDPLGLMDLS 
GLAANRSLTE 



Seq ID NO: 70 DNA sequence 

Nucleic Acid Accession #: BC006S29.1 

Coding sequence: 178-2424 



1 
I 

GGCACGAGGG 
CCAGGTTGGA 
CAGTCTGGAG 
AAAACTAGCC 
AATGCCCCAA 
AATCAAGCAG 
ATCAAGATTA 
GCTAATATTC 
GGGCCCAACA 
CGGCCTCAAA 
GGACCAAAAC 
CAGAAACGGG 
TCCAACATCC 
CAAGAGATGG 
CCTTGGAGAC 
ATGGCCATGA 
ATCTATACGT 
AAGAACTCCA 
AATGGCAAGG 
CAGGTGTTTA 
ACCGAACTCC 
TACCTGGTAC 
GTGCCATTGC 
GTCCGCATTG 
GGACCAGGGA 
CAGACTATCA 
ATCAAAGTGG 
GAATCATCTC 
TACAGTGGGC 
GAGAGGAGGG 
GAGCCGGA6C 
TTCCCAGCAG 
CCTTTTAAGA 
CTCCCCAGAA 
TTCAGCCCAG 
ATGGATCTCA 
CTCAGTTCAG 
ATAGACGTCC 
TCTCTGACAG 
GACATCAGCT 
CAGTTTATTC 
ATCCCGGGCA 
ACTGTTCTGC 
AGCCACTGCT 
GAGCTGAAGG 
GTTTCCATTC 
AAATAGTATA 
CATTATCCAG 
TCCTTGCTTT 
GGTACCTGGA 
GTCCTTTTTG 
AAATCCTGGT 
GGATGCAACT 
ATGTCCCCAA 
GGCCGAAAGG 
. CCACCCTAGG 
ATAAAAGCGA 



11 
I 

GGACCCGGCC 
GGAGCCCGGA 
GGTCCACACT 
CCCGTCGGCX: 
GTGAAACATC 
AGGCCTCCAA 
TTAACCACCC 
ACAGCATCAT 
AATTCATCCT 
CCCAAACCAG 
CTGCAGCTAG 
AGACCTGTGC 
AGTGGCTTCG 
AGGAAAAGGA 
CATCAGCGTC 
TACAATTCGC 
GGATTGAGGA 
TCCGCCACAA 
TCTCCTTCTG 
AGCAGCAGAA 
CCCTGGGCGC 
CTATCCAGTT 
CCCTGGCGGC 
CCCCCAAGGT 
AAGAGGAGAA 
AGGAGGAAGA 
AGAGCCCTCC 
ACTCCTGGGA 
TTAGGTCCCC 
AGAGGAGCCG 
TGCTCTTCTC 
ACTCCTCTGA 
CACCCATTAA 
CCCCTGAATC 
TACAAACCCC 
GCACCACTCC 
AACCCTTAGA 
CCAAGCCAGG 
AAGGCCTGGT 
TTCCTGGCCT 
CTGAGCTACA 
CTCCAAGGCT 
TCCTCATAGC 
GGGACCTTGT 
GTGGGAACAA 
TCTGCCCAGC 
AATTCTCCAA 
AGACTGCCAG 
TAGTTTTGAT 
TCTTGGGTTC 
CCCCTCCCTG 
TAAAAAAGTC 
GAAGCAGAGT 
TCATACCAGG 
GCCCCTGACC 
CCCCAGCTGA 
AGGTGGAAAA 



21 
I 

GGTCCGGCGC 
GCCCGCCTTC 
TGTGATTCTC 
ACTGATTCTC 
AGAGGAGGAA 
GGAAGTGGCA 
CACCATGCCC 
CACAGCACTG 
CATCAGCTGT 
CTATGATGCC 
GGATGTGAAT 
AGATGGTGAG 
AAAGATGAGT 
GAATTGTCAC 
CTGGCAGAAC 
CATCAACAGC 
CCACTTTCCC 
CCTTTCCCTG 
GACCATTCAC 
ACGACCGAAT 
ACGGCGGAAG 
CCCGGTGAAC 
TTCCCTCATG 
GCTGCTAGCT 
ACTCCTGTTT 
AATCCAGCCT 
CTTGGAAGAG 
GGATTCGTCC 
AACCCGGTGT 
GTCTCGGAGG 
AGAGGGGCCC 
CCCTGCCTCC 
GGAAACGCTG 
CTGGAGGCTC 
CCAGGGTGCC 
CTTGCAAAGT 
CCTCATCTCC 
CTCCCCGGAG 
CCTGGACACA 
GGACGAGGAC 
GTAGAGCCCT 
CAGTGCACCC 
TCCCTGCTGC 
GTTCCCCAAG 
CAAAGGCAAT 
AGTCTCTTAC 
ATTATCCTCT 
AAGGTGGGTA 
AGAAGGGAAG 
TTCACTGCAG 
CCACCTCCCC 
TTTTGTATTG 
GTGGGTGCCC 
GAGACTGGCA 
TGCCTGGCTT 
CCGCATGGGT 
AAAAAAAAAA 



31 
I 

GAGCCCCCGT 
GGAGCTACGG 
AATGGAGAGT 
AAAAGACGGA 
CCTAAGAGAT 
GAGTCCAACT 
AACACGCAAG 
ACTGCCAAGG 
GGGGGAGCCC 
AAAAGGACAG 
CTTCCTAGAC 
GCAGCAGGCT 
TCTGATGGAC 
CTGGAGCAGC 
TCTGTGTCTG 
ACTGAGAGGA 
TACTTTAAGC 
CACGACATGT 
CCCAGTGCCA 
CCAGAGCTCC 
ATGAAGCCAC 
CAGTCACTGG 
AGCTCAGAGC 
GAGGAGGGGA. 
GGAGAAGGGT 
GGGGAGGAAA 
TGGCCCTCCC 
CAATCTCCCA 
GTCTCGGAAA 
AAACAGCATC 
AGTACTTCCC 
CAGCTCAGCT 
CCCATCTCCT 
ACGCCCCCAG 
TCTGACCCCT 
GCTCCCCCCC 
GTCCCCTTTG 
CCACAGGTTT 
ATGAATGACA 
CCACTGGGCC 
GCCCTTGCCC 
CAAGCCTCTG 
CTGATTATGC 
AGTATCTGAT 
GGTGAAAAGA 
CTTCCCTGAT 
AATTATAAAT 
GGATGACCTG 
ACCTGCAGTG 
GGACCCAGAC 
GTGTTTCCAA 
GGTCAGGAGT 
AGATGTGCGC 
TTGACGAGAA 
CCTTAGCTTG 
GTGAGCCAGC 
AAAAAAA 



41 
1 

CCGGGGCCCT 
CCTAACGGCG 
GAAAACGCAG 
GGCTGCCCCT 
CCCCTGCCCA 
CTTGCAAGTT 
TAGTGGCCAT 
GAAAAGAGAG 
CAACTCAGCC 
AAGTGACCCT 
CACCTGGAGC 
GCACTATCAA 
TGGGCTCCCG 
GACAGGTTAA 
AGCGGCCACC 
AGCGCATGAC 
ACATTGCCAA 
TTGTCCGGGA 
ACCGCTACTT 
GCCGGAACAT 
TGCTACCACG 
TGTTGCAGCC 
TTGCCCGCCA 
TAGCTCCTCT 
TTTCTCCTTT 
TGCCACACTT 
CGGCCCCATC 
CCCCAAGACC 
TGCTTGTGAT 
TACTGCCTCC 
GCTGGGCCGC 
ACTCCCAGGA 
CCACCCCGAG 
CCAAAGTAGG 
TGCCTGACCC 
TTGAATCACC 
GCAACTCTTC 
CTGGCCTTGC 
GCCTCAGCAA 
CTGACAACAT 
CTGTGCTCAA 
AGTGAGGACA • 
AAAAGTAGCA 
TCCTCTGCTG 
GATTAGGAAC 
CTTTGCAGGG 
GTAAGCTTAT 
GGGTTTCAAT 
CACGGTTTCT 
AAGTGGATCT 
GTCAQCTTTC 
TGAATTTGGG 
TATTAGATGT 
CTCAGGTGGA 
CCCCTCAGCT 
TTGAGAACAC 



AGAGCTCCCG 
AGTGGGAGGA 
CAAATCTGTC 
GGGACTGGAT 
CCTGGGGCTG 
GCAAAGGCTC 
TCCCTCAGAT 
AGCCAATCGT 
GATCCTGCTG 
CAACTGGTCC 
GCTGTCCACC 
GCAGGCAGGG 
GTCACACCCT 
TCCCTGCCAG 
CCCCCAGCCT 
TGGTCCGTGT 
TTCCTTAGAT 
TGACTTCTGT 
TCCAGGCTGA 
GCTTGCCAGA 
CTGCAAGAAG 
GTGGGAGGAT 
TTCTCTGATA 
GGCTTGAGAA 
TTGCAAAGAG 
TAACTACTCA 
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120 
180 
240 
300 
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780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
19B0 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 



Seq ID NO: 71 Protein sequence: 
Protein Accession 8: AAH06529.1 



215 



WO 02/086443 



PCT/US02/12476 



5 

.10. 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



1 
I 

MXTSPRRPLI 
GIKIINHPTM 
LRPQTQTSYD 
LSNIQWLRKM 
YMAMIQFAIN 
ANGKVSFWTI 
SYLVPIQPPV 
AGPGKEEKLL 
EESSHSriEDS 
DEPELLPSEG 
VLPRTPBSWR 
LLSSEPLDLI 
LDISFPGLDE 



11 
I 

LKRRRLPLFV 
PNTQWAIPN 
AKRTEVTLET 
SSDGLGSRSI 
STERKRMTLK 
HPSANRYLTL 
NQSLVLQPSV 
FGEGFSPULP 
SQSPTPRPKK- 
PSTSRWAAEL 
LTPPAKVGGL 
SVPFGNSSPS 
DPLGPDNINW 



21 
I 

QNAPSETSEE 
NANIHSI ITA 
LGPKPAARDV 
KQEMBEKENC 
DIYTWIEDHF 
DQVFKQQKRP 
KVPLPLAASL 
VQTIKEEEIQ 
SYSGLRSPTR 
PFPADSSDPA 
DPSPVQTPQG 
DIDVPKPGSP 
SQFIPELQ 



31 
] 

EPKRSPAQQE 
LTAKGKESGS 
NLPRPPGALC 
HLEQRQVKVE 
PVPKHIAKPG 
NPELRRNMTI 
MSSELARHSK 
PGEEMPHLAR 
CVSEMLVIQH 
SQLSYSQEVG 
ASDPLPDPLG 
EPQVSGIAAN 



SGPNKFXLIS 
EQKRETCADG 
EPSRPSASWQ 
WKNSIRKNLS 
KTELPLGARR 
RVRIAPKVLL 
PIKVESPPLE 
RERRERSSSR 
GPFKTPIKET 
mDLSTTPLQ 
RSLTEGLVLD 



51 
I 

AESNSCKFPA 
CGGAPTQPPG 
EAAGCTINNS 
NSVSERPPYS 
LHDMFVRETS 
KMKPLLPRVS 
AEEGIAPLSS 
EWPSPAPSFX 
RiCQHLLPPCV 
LPISSTPSKS 
SAPPLESPQR 
TMNDSLSKIL 



Seq ID NO: 72 DNA sequence 
Nucleic Acid Accession #: U74G12.1 
Coding sequence: 178-2583 



GGCACGAGGG 
CCAGGTTGGA 
CAGTCTGGAG 
AAAACTAGCC 
AATGCCCCAA 
AATCAAGCAG 
ATCAAGATTA 
GCTAATATTC 
GGGCCCAACA 
CGGCCTCAAA 
GGACCAAAAC 
CAGAAACGGG 
TCCAACATCC 
CAAGAGATGG 
CCTTCGAGAC 
ATGGCCATGA 
ATCTATACGT 
AAGAACTCCA 
AATGGCAAGG 
CAGGTGTTTA 
CAGAAACGAC 
GGCGCACGGC 
CAGTTCCCGG 
GCGGCTTCCC 
AAGGTTTTTG 
CGAGATTTTG 
CTGCTAGCTG 
CTCCTGTTTG 
PlTCCAGCCTG 
TTGGAAGAGT 
GATTCGTCCC 
ACCCGGTGTG 
TCTCGGAGGA 
GAGGGGCCCA 
CCTGCCTCCC 
GAAACGCTGC 
TGGAGGCTCA 
CAGGGTGCCT 
TTGCAAAGTG 
CTCATCTCCG 
TCCCCGGAGC 
CTGGACACAA 
GACGAGGACC 
TAGAGCCCTG 
AGTGCACCCC 
CCCTGCTGCC 
TTCCCCAAGA 
AAAGGCAATG 
GTCTCTTACC 
TTATCCTCTA 
AGGTGGGTAG 
GAAGGGAAGA 
TCACTGCAGG 
CACCTCCCCG 
TTTGTATTGG 
TGGGTGCCCA 
AGACTGGCAT 
GCCTGGCTTC 
CGCATGGGTG 
AAAAAAAAAA 



11 

I . 

GGACCCGGCC 
GGAGCCCGGA 
GGTCCACACT 
CCCGTCGGCC 
GTGAAACATC 
AGGCCTCCAA 
TTAACCACCC 
ACAGCATCAT 
AATTCATCCT 
CCCAAACCAG 
CTGCAGCTAG 
AGACCTGTGC 
AGTGGCTTCG 
AGGAAAAGGA 
CATCAGCGTC 
TACAATTCGC 
GGATTGAGGA 
TCCGCCACAA 
TCTCCTTCTG 
AGCCACTGGA 
CGAATCCAGA 
GGAAGATGAA 
TGAACCAGTC 
TCATGAGCTC 
GGGAACAGGT 
GTACACCCAT 
AGGAGGGGAT 
GAGAAGGGTT 
GGGAG G AAA T 

AATCTCCCAC 
TCTCGGAAAT 
AACAGGATCT 
GTACTTCCCG 
AGCTCAGCTA 
CCATCTCCTC 
CGCCCCCAGC 
CTGACCCCTT 
CTCCCCCCCT 
TCCCCTTTGG 
CACAGGTTTC 
TGAATGACAG 
CACTGGGCCC 
CCCTTGCCCC 
AAGCCTCTGA 
TGATTATGCA 
GTATCTGATT 
GTGAAAAGAG 
TTCCCTGATC 
ATTATAAATG 
GATGACCTGG 
CCTGCAGTGC 
GACCCAGACA 
TGTTTCCAAG 
GTCAGGAGTT 
GATGTGCGCT 
TGACGAGAAC 
CTTAGCTTGC 
TGAGCCAGCT 
AAAAA 



21 
I 

GCCCGCCTTC 
TGTGATTCTC 
ACTGATTCTC 
AGAGGAGGAA 
GGAAGTGGCA 
CACCATGCCC 
CACAGCACTG 
CATCAGCTGT 
CTATGATGCC 
GGATGTGAAT 
AGATGGTGAG 
AAAGATGAGT 
GAATTGTCAC 
CTGGCAGAAC 
CATCAACAGC 
CCACTTTCCC 
CCTTTCCCTG 
GACCATTCAC 
CCCAGGGTCT 
GCTCCGCXX5G 
GCCACTGCTA 
ACTGGTGTTG 
AGAGCTTGCC 
GGTGTTTGGT 
CACCAGCTTG 
AGCTCCTCTT 
TTCTCCTTTG 
GCCACACTTA 
GGCCCCATCT 
CCCAAGACCC 
GCTTGTGATT 
ACTGCCTCCC 
CTGGGCCGCA 
CTCCCAGGAA 
CACCCCGAGC 
CAAAGTAGGG 
GCCTGACCCC 
TGAATCACCG 
CAACTCTTCT 
TGGCCTTGCA 
CCTCAGCAAG 
TGACAACATC 
TGTGCTCAAG 
GTGAGGACAG 
AAAGTAGCAG 
CCTCTGCTGT 
ATTAGGAACC 
TTTGCAGGGT 
TAAGCTTATT 
GGTTTCAATT 
ACGGTTTCTT 
AGTGGATCTG 
TCAGCTTTCC 
GAATTTGGGG 
ATTAGATGTT 
TCAGGTGGAG 
CCCTCAGCTT 
TGAGAACACT 



31 
I 

GAGOCCCCGT 
GGAGCTACGG 
AATGGAGAGT 
AAAAGACGGA 
CCTAAGAGAT 
GAGTCCAACT 
AACACGCAAG 
ACTGCCAAGG 
GGGGGAGCCC 
AAAAGGACAG 
CTTCCTAGAC 
GCAGCAGGCT 
TCTGATGGAC 
CTGGAGCAGC 
TCTGTGTCTG 
ACTGAGAGGA 
TACTTTAAGC 
CACGACATGT 
CCCAGTGCCA 
CCACAATTGC 
AACATGACCA 
CCACGGGTCA 
CAGCCCTCGG 
CGCCATAGCA 
TACATGAGTA 
TTTAATTTTA 
TCTTCTGCAG 
CTTCCAGTTC 
GCGAGACCCA 
TTCAAAGAGG 
AAGAAGTCCT 
CAACACAGGG 
TGTGTGGATG 
GAGCTCCCGT 
GTGGGAGGAC 
AAATCTGTCC 
GGACTGGATT 
CTGGGGCTGA 
CAAAGGCTCC 
CCCTCAGATA 
GCCAATCGTT 
ATCCTGCTGG 
AACTGGTCCC 
CTGTCCACCA 
CAGGCAGGGA 
TCACACCCTA 
CCCTGCCAGG 
CCCCAGCCTG 
GGTCCGTGTA 
TCCTTAGATC 
GACTTCTGTT 
CCAGGCTGAG 
CTTGCCAGAG 
TGCAAGAAGA 
TGGGAGGATG 
TCTCTGATAA 
GCTTGAGAAG 
TGCAAAGAGC 
AACTACTCAA 



41 

I 

CCGGGGCCCT 
CCTAACGGCG 
GAAAACGCAG 
GGCTGCCCCT 
CCCCTGCCCA 
CTTGCAAGTT 
TAGTGGCCAT 
GAAAAGAGAG 
CAACTCAGCC 
AAGTGACCCT 
CACCTGGAGC 
GCACTATCAA 
TGGGCTCCCG 
GACAGGTTAA 
AGCGGCCACC 
AGCGCATGAC 
ACATTGCCAA 
TTGTCCGGGA 
ACCGCTACTT 
CCGAGCACTT 
TCAAAACCGA 
GCTCATACCT 
TGAAGGTGCC 
AGCGAGTCCG 
AGTTCTTTAG 
TCTTTCTTTG 
GACCAGGGAA 
AGACTATCAA 
TCAAAGTGGA 
AATCATCTCA 
ACAGTGGGCT 
AGAGGAGGGA 
AGCCGGAGCT 
TCCCAGCAGA 
CTTTTAAGAC 
TCCCCAGAAC 
TCAGCCCAGT 
TGGATCTCAG 
TCAGTTCAGA 
TAGACGTCCC 
CTCTGACAGA 
ACATCAGCTT 
AGTTTATTCC 
TCCCGGGCAC 
CTGTTCTGCT 
GCCACTGCTG 
AGCTGAAGGG 
TTTCCATTCT 
AATAGTATAA 
ATTATCCAGA 
CCTTGCTTTT 
GTACCTGGAT 
TCCTTTTTGC 
AATCCTGGTT 
GATGCAACTG 
TGTCCCCAAT 
GCCGAAAGGG 
CACCCTAGGC 
TAAAAGCGAA 



51 
I 

GGCTCGGCCC 

GCGGCGACTG 

ATTCATAATG 

TCCTGTTCAA " 

ACAGGAGTCT 

TCCAGCTGGG 

CCCCAACAAT 

TGGCAGTAGT 

TCCAGGACTC 

GGAGACCTTG 

CCTTTGCGAG 

CAATAGCCTA 

CAGCATCAAG 

GGTTGAGGAG 

CTACTCTTAC 

TTTGAAAGAC 

GCCAGGCTGG 

GACGTCTGCC 

GACATTGGAC 

GGAATCACAG 

ACTCCCCCTG 

GGTACCTATC 

ATTGCCCCTG 

CATTGCCCCC 

TGGCGATCTG 

TTTATCAGTG 

AGAGGAGAAA 

GGAGGAAGAA 

GAGCCCTCCC 

CTCCTGGGAG 

TAGGTCCCCA 

GAGGAGCCGG 

GCTCTTCTCA 

CTCCTCTGAC 

ACCCATTAAG 

CCCTGAATCC 

ACAAACCTCC 

CACCACTCCC 

ACCCTTAGAC 

CAAGCCAGGC 

AGGCCTGGTC 

TCCTGGCCTG . 

TGAGCTACAG 

TCCAAGGCTC 

CCTCATAGCT 

GGACCTTGTG 

TGGGAACAAC 

CTGCCCAGCA 

ATTCTCCAAA 

GACTGCCAGA 

AGTTTTGATA 

CTTGGGTTCT 

CCCTCCCTGC 

AAAAAAGTCT 

AAGCAGAGTG 

CATACCAGGG 

CCCCTGACCT 

CCCAGCTGAC 

GGTGGACAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
7B0 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 



Seq ID NO: 73 Protein sequence;. 
Protein Accession #: AAC51128.1 



216 



WO 02/086443 



PCT/US02/12476^ 



1 11 21 31 41 51 

! I I I I I 

MKTSPRRPLI LKRRRLPLPV QNAPSETSEE EPKRSPAQQE SNQAEASKEV AESNSCKFPA 60. 

GIRIINHPTM PNTQWAIPN NANIHSIITA LTAKGKESGS SGPNKFILIS .CGGAPTQPPG 120 

LRPQTQTSYD AKRTEVTLET LGPKPAARDV NLPRPPGALC EQKRBTCADG EAAGCTINNS 180 

LSNIQWLRKM SSDGLGSRSI KQEMEEKENC HLEQRQVKVE EPSRPSASWQ NSVSERPPYS 240 

YMAMIQPAIN STERKHMTLK DIYTWIEDHF PYFKHIAXPG WKNSIRHNLS LHDMFVRETS 300 

ANGKVSPWTI HPSANRYLTL DQVFKPLDPG SPQLPEHLES QQKRPNPELR RNMTIKTELP 360 

LGARRKMKPL LPRVSSYLVP IQFPVNQSLV. LQPSVKVPLP LAASLMSSEL ARHSKRVRXA 420 

PKVFGEQWF GYMSKFPSGD LRDFGTPITS LFNFIFLCLS VLLAEEGIAP LSSAGPGKEE 480 

KLLFGEGFSP LLPVQTIKEE EIQPGEEMPH LARPIKVESP PLEEWPSPAP . SFKEESSHSW . 540 

EDSSQSPTPR PKKSYSGLRS PTRCVSEMLV IQHRERRERS RSRRKQHLL? PCVDEPELLF 600. 

SEGPSTSRWA AELPFPADSS DPASQLSYSQ EVGGPFKTPI KETLPISSTP SKSVLPRTPS 660 

SWRLTPPAKV GGLDFSPVQT SQGASDPLPD PLGLMDLSTT PLQSAPPLES PQRLLSSEPL 720 

DLISVPFGNS SPSDIDVPKP GSPEPQVSGL AANRSLTEGL VLDTMNDSLS KILLDISFPG 780 
LDEDPLGPDN INWSQFIPEL Q 

Seq ID NO: 74 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 111-416 

1 11 21 31 41 51 . 

ill I I I 

GGGAAGAGCC AGGCTGAGCC TTATAAAGGA CTGCTCTTTG TCCAAACACA CACATCTCAC 60 
TCATCCTTCT ACTCGTGACG CTTCCCAGCT CTGGCTTTTT GAAAGCAAAG ATGAGCAACA 120 
CTCAAGCTGA GAGGTCCATA ATAGGCATGA TCGACATGTT TCACAAATAC ACCAGACGTG 180 
ATGACAAGAT TGAGAAGCCA AGCCTGCTGA CGATGATGAA GGAGAACTTC CCCAACTTCC 240 
TTAGTGCCTG TGACAAAAAG GGCACAAATT ACCTCGCCGA TGTCTTTGAG AAAAAGGACA 300 
AGAATGAGGA TAAGAAGATT GATTTTTCTG AGTTTCTGTC CTTGCTGGGA GACATAGCCA 360 
CAGACTACCA CAAGCAGAGC CATGGAGCAG CGCCCTGTTC CGGGGGCAGC CAGTGACCCA 420 
GCCCCACCAA TGGGCCTCCA GAGACCCCAG GAACAATAAA ATGTCTTCTC CCACCAGA 

Seq ID NO: 75 Protein sequence: 
Protein Accession #; Eos sequence 

1 . : 11 ' 21 31 41 51 

I I I 1 I I 

MSNTQAERSI IGMIDMFHKY TRRDDKIEKP SLLTMMKENF PNFLSACDKK GTNYLADVFE 60 
KKDKNEDKKI DFSEFLSLLG DtATDYHKQS HGAAPCSGGS Q 

Seq ID NO: 76 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 111-416 

1 11 21 31 41 51 

I I I I I I . 

GGGAAGAGCC AGGCTGAGCC TTATAAAGGA CTGCTCTTTG TCCAAACACA CACATCTCAC 60 

TCATCCTTCT ACTCGTGACA CTTCCCAGTT CTGGCTTTTT GAAAGCAAAG ATGAGCAACA 120 

CTCAAGCTGA GAGGTCCATA ATAGGCATGA TCGACATGTT TCACAAATAC ACCGGACGTG 180 

ATGGCAAGAT TGAGAAGCCA AGCCTGCTGA CGATGATGAA GGAGAACTTC CCCAATTTCC 240 

TCAGTGCCTG TGACAAAAAG GGCATACATT ACCTCGCCAC TGTCTTTGAG AAAAAGGACA 300 

AGAATGAGGA TAAGAAGATT GATTTTTCTG AGTTTCTGTC CTTGCTGGGA GACATAGCCG 360 

CAGACTACCA CAAGCAGAGC CATGGAGCGG CGCCCTGTTC TGGGGGAAGC CAGTGATCCA 420 
GCCCCACCAA GGGGCCTCCA GAGACCCCAG GAACAATAAG TGTCTCCTCC CACCAGA 

Seq ID NO: 77 Protein sequence: 
Protein Accession #: XPJJ48124.1 

1 11 21 31 41 51 

I I I I I I 

MSNTQAERSI IGMIDMFHKY TGRDGKIEKP SLLTMMKENF PNFLSACDKK GIHYLATVFE 60 
KKDKNEDKKI DFSEFLSLLG DIAADYHKQS HGAAPCSGGS Q 



Seq ID NO: 78 DNA sequence 
Nucleic Acid Accession 'ft: Z73 678.1 
Coding sequence: 253-2433 

1 . 11 21 31 41 51 

I I I I I I 

GGGGTGGTGC AGGGCAGGGG TGGTATATCC TGTCTGACGG AGGGCGGGCC TCGCCAGTGC 60 

CAGAGAGGGA CGAACCAGGG TGGAAGCGCC AGGAGCAGCT GCAGGGAGCC CTCACGCGGA 120 

CCTCGCACTC TATGGCCGTA GGGAGCCGCT GAGAGCGAGA AGAGCACGCT . CCTGCCCGCC 180 

CGCTGCACCG CACCTCGCCT CGCCTCTCTG CTCTCCTAGG CCCCGGCCGC GCGCCACCCG 240 

CCTCCCGCCA CCATGAACCA CTCGCCGCTC AAGACCGCCT TGGCGTACGA ATGCTTCCAG 300 

GACCAGGACA ACTCCACGTT GGCTTTGCCG TCGGACCAAA AGATGAAAAC AGGCACGTCT 360 

GGCAGGCAGC GCGTGCAGGA GCAGGTGATG ATGACCGTCA AGCGGCAGAA GTCCAAGTCT ,420 

TCCCAGTCGT CCACCCTGAG CCACTCCAAT CGAGGTTCCA TGTATGATGG CTTGGCTGAC 480. 

AATTACAACT ATGGGACCAC CAGCAGGAGC AGCTACTACT CCAAGTTCCA GGCAGGGAAT • 540 

GGCTCATGGG GATATCCGAT CTACAATGGA ACCCTCAAGC GGGAGCCTGA CAACAGGCGC 600 

TTCAGCTCCT 'ACAGCCAGAT. GGAGAACTGG AGCCGGCACT ACCCCCGGGG CAGCTGTAAC 660 

ACCACCGGCG CAGGCAGCGA CATCTGCTTC ATGCAGAAAA TCAAGGCGAG CCGCAGTGAG 720 

CCCGACCTCT ACTGTGACCC ACGGGGCACC CTGCGCAAGG GCACGCTGGG CAGCAAGGGC 780 

CAGAAGACCA CCCAGAACCG CTACAGCTTT TACAGCACCT GCAGTGGTCA GAAGGCCATA 840 

AAGAAGTGCC CTGTGCGCCC GCCCTCTTGT GCCTCCAAGC AGGACCCTGT GTATATCCCG 900 
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WO 02/086443 

CCCATCTCCT GCAACAAGGA CCTGTCCTTT 
AGTGAGGACA TCGAGTGCAG TGGGCTGACC 
CAGGATGAGA AGTACCAGGC CATTGGGGCC 
GAATCTGCCA AGCAACAGGT CTATCAGCTG 
CGCAGCCCCA ACCAGAACGT CCAGCAGGCC 
AGGAGCACCA CCAACAAGCT GGAGACCCGG 
CTCCTGAGGA GAACCGGGAA CGCCGAGATC 
CTGTCTTCCA CTGACGAGCT GAAGGAGGAA 
GACCGCGTCA TCATTCCCTT CtCTGGCTGG 
GTGGTGGACC CTGAGGTCTT CTTCAATGCC 
GATGCAGGCC GCCAGACCAT. GCGTAACTAC 
GTCCAGAACT GTGTAGCGGC CAGCCGCTGT 
GTTCTGCACA ACCTCTCCTA CCGCCTGGAC 
GAGTATAACG CCCGCAACGC CTACACCGAG 
AGCGACAAGA TGATGAACAA CAACTATGAC 
AAGGGCAGCG GCTGGTTGTA CCATTCAGAT 
AAGAGCAAGA AAGATGCTAC CCTGGAGGCC 
AGCAAGGGGC TGATGTCCAG TGGCATGAGC 
CCACAAATTG CCCGCCTCCT GCAATCTGGC 
CTCCTGAGCA ACATGTCCCG CCACCCTCTG 
CCGGAGGTGA CCAGGCTCCT CACCAGCCAC 
TTGTCCTCGG CCTGCTACAC TGTGAGGAAC 
CAGTACTTCT CCAGCAGCAT GCTCAACAAC 
CCCAAGGCCG CAGAAGCTGC CCGGCTTCTC 
CAGGGTGTCC TCAGACAGCA AGGTTTCGAT 
AACAGCCTCA GGAACTTCAC CTCCCGATTC 
TGCAGGAAGA TATGACCCAG CTGAGAAGCC 
CATCCTGTGC AGTATTTGGG AAAGTTCACA 
ATAGTGGAAA GATTTTTAGA TTTTTTTTTT 
AGGGAGGTTG GGGCGGGGGG GGCTTTCTTG 
TTTCTTCCTC TGAGAAATGG TATATATATG 
GCGTGCATGT GTGTGTGTGT GAGTGTCTTA 
TAAGCTATTT TGTTGCAGCT CATAAGGTGG 
TAGGCAAGGA CAACATGTGC TTTTTGGTGA 
CAGGGCAAGG GGGCCATCAC TGCAGTCAGG 
AGTGGTCTCC AAGGGTGCAG GAGTAACTGG 
GCTTTCCACG AAGGGAGGTC TGGTGTATCT 
ACCCCTCCAG CAGCGCCACA AGGACTGAGG 
AGGACACTCT CGCATACTTT GCCAAATGAG 
GGTGCCTTCC ACCCTCTTGG GCTGTGTGCC 
TGCATTCAGA GGTCTTGTAA TCTACTTGTT 
TTAAGAAAAG CTATTTTATT GCAGCTCTTT 
TCATATTCCC AGTGGAGAGG GGAACAAGTG 
GTGGGATGGA GTTGGGGTAT AGAAATTAAC 
AGTCAATTGA GGGAGTGTTT GGGTCCCAGG 
AGGAAAGGAA AGTGCCATAT CAGGGTACCG 
CCATGCCCCA CTTCCCCTGA CCCCAGCTGT 
ATGTGATAAA CAGGGCTATT AGGGGTATCA 
TCAGACCAGC AGCAGCAGGA GGGCTCCCGA 
CCTTGGTGTA CACTAAGACA GAGCAGAGCC 
TCTACCTCCA TGCTAGCATT GCTGGTGTTA 
TCTCTAGAAG AATATAAGAT GCTCCTCCTC 
TCCTCTTCTG CACCACCCCC GAGTCCAAAC 
AACACTGATG TGGACTCAGT ATGACAACTG 
TGTCCTCCGA GAGGTGTCCC CGGCTGTTAG 
ATACCCTCCC TTGCTTCTGT TCACACTGGG 
AGGGACCCAC GTGGGAGCCT GGATCCCTGG 
CTTTGTTGTC ATCAGAACCC AGAGGAATTC 
TCTGTGCGGG GCAGTGTCCT AAGCACTTAG 
CCCGTCCTCA TGCGGCTTAT GTTTTCTGGA 
GGGCTCCCCC GGCTGGGGGC TGTGCAGTCC 
CATGTGAACC TTACCAGCCC AGGCGGATGC 
GCATCCCCTC GCCTCATGTT CCTCCCACCT 
CCCTGCCCTG GGAACCAGGC AGCCTTCCAG 
AGGGCTGACT TTGGTGACAC TGCCCATTCC 
CTGACCCAGG CCTGTCACTT TGAGAGGGGC 
AGAACAAGGA GCTTGCCAGG CTTCATGTAG 
ATTGGCCTCA CACTAGCCTA GGCCAATGCC 
TGAGGAAGGA CACAGACTCT GCCCTGGGAT 
GTCATTGGCC ACCAGCCACC TCTGCAGTGG 
CCTCCTGGGG ACCCAAGAGG CAGTGTTGCT 
ACTGGCTGGG AGGACCAAGA CTGCGGCTGG 
TGTGAGGGAT CTTGGAGCTT CCCTGTAGCC 
AACCTTGTGC CGGCCAGGCC CAGTTTCCTT 
GGAAATAGAG AAAATCAATA AATTGCTAGT 

Seq ID NO: 79 Protein sequence: 
. Protein Accession # :. CAA9B022 .1 



PCT/US02/12476 



GGCCACTCTA 
ATCCCCAAGG 
TATTACATCC 
GGAGGCATCT 
GCGGCAGGGG 
AGGCAGAATG 
CAGAAGCAGC 
CTCATTGCOG 
TGCGATGGCA 
ACAGGCTGCT 
TCAGGGCTCA 
GACGACAAGT 
GCCGAGGTGC 
AAGTCCTCCA 
TGCCCCCTGC 
GCCATCCGCA 
TGTGCTGGTG 
CAGTTGATTG 
AACTCTGATG 
CTGCACAGAG 
ACTGGCAATA 
CTGATGGCCT 
ATCATCAACC 
CTGTCTGACA 
AGGAACATGC 
TAAGAAGAGA 
CTCAGGCCTC 
AGAAACTGAG 
CCTTGGGGAA 
AGTTAAAGGG 
TGTCTAATGT 
AAGCATAACC 
TGAAAAGGAC 
GCTGCTCATA 
CCCTCAGAGG 
GGCTGGGCCA 
CATGGGAGAA 
TTGGGTAGGT 
GCCTGCTCAG 
CATCAGAGCA 
GCAGGAGAAA 
CCCAAGAGCT 
GGGCTGGGCA 
CAGGAAGATG 
AGACTTGGAC 
GTACCGGCAA 
CTTGTCTCCA 
GCCACGTCGA 
GGGCCTTATG 
CAGCGCTCCC 
GAGAGGAATT 
CTCACCCCTT 
CCACCTCTTG 
AGATGGGGGA 
CCAGCTGTGC 
AGGCCCACTC 
ACTGTCCTGG 
TTCTCCTAAA 
ACTA CAT CAG 
GGAAAGTGGA 
GGTCAGGGCG 
CCCTTCCCCT 
TCAAAGAATG 
ACCTCAGGGG 
CTCTCAGGCC 
AAAACTGAGA 
CCGACACACG 
CAAAATAAGG 
CTCCTGTGCT 
GGACCACACT 
GTCTGCGTGT 
GGTGGGCAGG 
CACCTTCCCC 
GTGTGATACA 
GTTTCTTTGA 



GGGCCAGCTC 
CTGTGCAGTA 
AGCATACCTG 
GCAAGCTGGT 
CCCTGCGCAA 
GGATCCGOGA 
TGACTGGGCT 
ACGCCCTGCC 
ATAGCAACAT 
TGAGGAACCT 
TTGATTCCCT 
CTGTGGAAAA 
CCACCCGCTA 
CTGGCTGCTT 
CTGAGGAAGA 
CCTACCTGAA 
CCCTGCAGAA 
GGCTGAAGGA 
TGGTGCGGTC 
TGATGGGGAA 
CCAGCAACTC 
CGCAGCCACA 
TGTGCCGAAG 
TGTGGTCCAG 
TGGGAACCTT 
CTGTCCAAGC 
GCTGGATGGG 
AAGAAACCTA 
ACTGGCAGGC 
GCTTATATGT 
AAGTGTGTGC 
ACAAACTGCA 
TCTCCTGTGT 
ATTCCTGAAA 
AGTCCTGCAG 
GCCTCCCCCC 
TCTGGGGTGT 
GTGAGGTTCC 
AGGAGTAGGA 
GGCTCAGCCT 
GAAGGTAAAA 
GTTCTGGGAA 
TATACCTATT 
TTTCCACCAA 
GGGGGGAGTT 
GCTCACATCT 
CTCTGTGAAA 
GCCCCCAGAC 
AGAAAACCTG 
AAGCCTTCCT 
AACTTCCTGG 
CTCAGCCTCC 
CCCCAGCATT 
AGCCAGACAT 
TGTGGTGCTG 
CTGGCTCACC 
GCATAGGTTT 
AAATACGTAT 
GGAAGAACAC 
GACACAAGTC 
GGAGGGGAAA 
TAGCACTACC 
AAGAGCCCCA 
CTGAGGCAGA 
AGCTCAGGTC 
GGGGCTTTTC 
TCTCAGGATT 
AGTTCCAATT 
AGCGGCCAAT 
AGCAGCCCTG 
CCACCTTGGA 
GAAGGGAAGC 
TTGCTTCATG 
CTAATGTATT 
AAAAAAAAA 



CAAGATCTGC 

CCTGAGCTCC 

CTTCCAGGAT 

GGACCTCCTC 

CCTGGTGTTC 

GGCAGTCAGC 

GCTCTGGAAC 

TGTTCTGGCC 

GTCCCGGGAA 

GAGCTCGGCC 

CATGGCCTAT 

CTGCATGTGT 

CCGCCAGCTG 

CAGCAACAAG 

GACCAACCCC 

CCTCATGGGC 

CCTGACAGCC 

AAAGGGCCTG 

CGGAGCCTCC 

CCAGGTGTTC 

CGAAGACATC 

ACTGGCCAAG 

CAGTGCCTCA 

CAAGGAACTG 

AGCTGGGGCC 

AAGTTAGGCT 

GTTTTCTGTC 

AAAACTGTGG 

AATGGGGGTT 

GATGTCAATA 

ATGCATGTGC 

AAAAGCTAGG 

TTCTTACTCA 

TGTGTGGTGC 

GCTTCCTACC 

TTACAAGGCT 

CTGTAGTGTC 

AGAGGACAGC 

GCTGAAAGAT 

GCAAAGGCCC 

AATGATTTTT 

TGGCTGGTCT 

CCGGCTTCTA 

GCCTGCTGTG 

TGGGTAGACT 

CAGCCAGGGG 

CCCACAGGGG 

TCTGTGCACT 

TGTGGACATC 

CCTTCCAGCT 

TCTGTGCCCT 

TCCCAAGTCT 

CAGGCTGGAA 

GTGAGGACGC . 

TGGGTCTGTC 

TCTCCCTCTC 

CAGGGGCCTC 

GGCATACCAA 

AGACCACATC 

CTTGGCTTTA 

TGCACCGCTG 

CTGGCCTCCT 

TGGGCCCAGC 

CTATTAGGGC 

ACCCGGGCCT 

CTAGAGAAAG 

TTAAGTCCAC 

TGGGGCCAAA 

GACAAATCCA 

ACTCCACACT 

ATCTGGCTGA 

CGGGGGCTGC 

TTTGTAGAGG 

TGCTTTTTTT 



960 
1020 
10B0 
* 1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 

ieoo 

I860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 . 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

37B0 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 



MNHSPLKTAL 
TLSHSNRGSM 
SQMENWSRHY 
QNRYSFYSTC 
ECSGLTIPKA 
QNVQQAAAGA 



11 
I 

AYBCFQDQDN 
YDGLADNYNY 
PRGSCNTTGA 
SGQKAIKKCP. 
VQYLSSQDEK 
LRNLVFRSTT 



21 
I 

STLALPSDQK 
GTTSRSSTYS 
GSDICFMQKI 
VRPPSCASKQ 
YQAIGAYYIQ 
MKLiETRRQNG 



31 
I 

MKTGTSGRQR 
KP0AGNGSKG 
KASRSEPDLY 
DPVYIPPISC 
HTCFQDESAK 
IRBAVSLLRR 



41 
I 

VQEQVMMTVK 
YPIYNGTLKR 
CDPRGTLRKG 
NKDLSPGHSR 
QQVYQLGGIC 
TGNAEIQKQL 



51 
I 

RQKSKSSQSS 
EPDNRRFSSY 
TLGSKGQKTT 
ASSKICSEDI 
KLVDLLRSPN 
TGLLWNIiSST 



60 
120 
180 
240 
300 
360 
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DELKEELIAD ALPVLADRVI IPFSGWCDGN SNMSREWDP EVFFNATGCIj RNLSSADAGR 420 

QTMRNYSGLI DSLMAYVQNC VAASRCDDKS VENCMCVLHN LSYRLDAEVP TRYRQLEYNA 480 

RNAYTEKSST GCFSNKSDKM MNNNYDCPLP EEETNPKGSG WLYHSDAIRT YLNLMGKSKK 540 

DATLEACAGA LQNLTASKGL MSSGMSQLIG LKEKGLPQIA RLLQSGNSDV VRSGASLLSN 600 

MSRHPLLERV KGNQVFPEVT RLLTSHTGNT SNSEDILSSA CYTVRNLMAS QPQLAKQYFS 660 

SSMLNNIINL CRSSASPKAA EAARLLLSDM WSSKELQGVL RQQGFDRNML GTLAGANSLR ' 720 
NFTSRF 



Seq ID* NO: 80 DNA sequence 

Nucleic Acid Accession ft: NM_006516.1 

Coding sequence: 180-1658 

1 11 21 31 41 51 

I I I I I I 

TAGTCGCGGG TCCCCGAGTG AGCACGCCAG GGAGCAGGAG ACCAAACGAC GGGGGTCGGA 60 

GTCAGAGTCG CAGTGGGAGT CCCCGGACCG GAGCACGAGC CTGAGCGGGA GAGCGCCGCT 120 

CGCACGCCCG TCGCCACCCG CGTACCCGGC GCAGCCAGAG CCACCAGCGC AGCGCTGCCA 180 

TGGAGCCCAG CAGCAAGAAG CTGACGGGTC GCCTCATGCT GGCTGTGGGA GGAGCAGTGC 240 

TTGGCTCCCT GCAGTTTGGC TACAACACTG GAGTCATCAA TGCCCCCCAG AAGGTGATCG 300 

AGGAGTTCTA CAACCAGACA TGGGTCCACC GCTATGGGGA GAGCATCCTG CCCACCACGC 360 

TCACCACGCT CTGGTCCCTC TCAGTGGCCA TCTTTTCTGT TGGGGGCATG ATTGGCTCCT 420 

TCTCTGTGGG CCTTTTCGTT AACCGCTTTG GCCGGCGGAA TTCAATGCTG ATGATGAACC 480 

TGCTGGCCTT CGTGTCCGCC GTGCTCATGG GCTTCTCGAA ACTGGGCAAG TCCTTTGAGA 540 

TGCTGATCCT GGGCCGCTTC ATCATCGGTG TGTACTGCGG CCTGACCACA GGCTTCGTGC 600 

CCATGTATGT GGGTGAAGTG TCACCCACAG CCTTTCGTGG GGCCCTGGGC ACCCTGCACC 660 

AGCTGGGCAT CGTCGTCGGC ATCCTCATCG CCCAGGTGTT CGGCCTGGAC TCCATCATGG 720 

GCAACAAGGA CCTGTGGCCC CTGCTGCTGA GCATCATCTT CATCCCGGCC CTGCTGCAGT 780 

GCATCGTGCT GCCCTTCTGC CCCGAGAGTC CCCGCTTCCT GCTCATCAAC CGCAACGAGG 840 

AGAACCGGGC CAAGAGTGTG CTAAAGAAGC TGCGCGGGAC AGCTGACGTG ACCCATGACC 900 

TGCAGGAGAT GAAGGAAGAG AGTCGGCAGA TGATGCGGGA GAAGAAGGTC ACCATCCTGG 960 

AGCTGTTCCG CTCCCCCXJCC TACCGCCAGC CCATCCTCAT CGCTGTGGTG CTGCAGCTCT 1020 

CCCAGCAGCT GTCTGGCATC AACGCTGTCT TCTATTACTC CACGAGCATC TTCGAGAAGG 1080 

CGGGGGTGCA GCAGCCTGTG TATGCCACCA TTGGCTCCGG TATCGTCAAC ACGGCCTTCA 1140 

CTGTCGTGTC GCTGTTTGTG GTGGAGCGAG CAGGCCGGCG GACCCTGCAC CTCATAGGCC 1200 

TCGCTGGCAT GGCGGGTTGT GCCATACTCA TGACCATCGC GCTAGCACTG CTGGAGCAGC 1260 

TACCCTGGAT GTCCTATCTG AGCATCGTGG CCATCTTTGG CTTTGTGGCC TTCTTTGAAG 1320 

rGGGTCCTGG CCCCATCCCA TGGTTCATCG TGGCTGAACT CTTCAGCCAG GGTCCACGTC 1380 

CAGCTGCCAT TGCCGTTGCA GGCTTCTCCA ACTGGACCTC AAATTTCATT GTGGGCATGT 1440 

GCTTCCAGTA TGTGGAGCAA CTGTGTGGTC CCTACGTCTT CATCATCTTC ACTGTGCTCC 1500 

TGGTTCTGTT CTTCATCTTC ACCTACTTCA AAGTTCCTGA GACTAAAGGC CGGACCTTCG 1560 

ATGAGATCGC TTCCGGCTTC CGGCAGGGGG GAGCCAGCCA AAGTGATAAG ACACCCGAGG 1620 

AGCTGTTCCA TCCCCTGGGG GCTGATTCCC AAGTGTGAGT CGCCCCAGAT CACCAGCCCG 1680 

GCCTGCTCCC AGCAGCCCTA AGGATCTCTC AGGAGCACAG GCAGCTGGAT GAGACTTCCA 1740 

AACCTGACAG ATGTCAGCCG AGCCGGGCCT GGGGCTCCTT TCTCCAGCCA GCAATGATGT 1800 

CCAGAAGAAT- ATTCAGGACT TAACGGCTCC AGGATTTTAA CAAAAGCAAG ACTGTTGCTC 1860 

AAATCTArTC AGACAAGCAA CAGGTTTTAT AATTTTTTTA TTACTGATTT TGTTATTTTT 1920 

ATATCAGCCT GAGTCTCCTG TGCCCACATC CCAGGCTTCA CCCTGAATGG TTCCATGCCT 1980 

GAGGGTGGAG ACTAAGCCCT GTCGAGACAC TTGCCTTCTT CACCCAGCTA ATCTGTAGGG 2 040 

CTGGACCTAT GTCCTAAGGA CACACTAATC GAACTATGAA CTACAAAGCT TCTATCCCAG 2100 

GAGGTGGCTA TGGCCACCCG TTCTGCTGGC CTGGATCTCC CCACTCTAGG GGTCAGGCTC 2160 

CATTAGGATT TGCCCCTTCC CATCTCTTCC TACCCAACCA CTCAAATTAA TCTTTCTTTA 2220 

CCTGAGACCA GTTGGGAGCA CTGGAGTGCA GGGAGGAGAG GGGAAGGGCC AGTCTGGGCT 2280 

GCCGGGTTCT AGTCTCCTTT GCACTGAGGG CCACACTATT ACCATGAGAA GAGGGCCTGT 2340 

GGGAGCCTGC AAACTCACTG CTCAAGAAGA CATGGAGACT CCTGCCCTGT TGTGTATAGA 2400 

TGCAAGATAT TTATATATAT TTTTGGTTGT CAATATTAAA TACAGACACT AAGTTATAGT 2460 

ATATCTGGAC AAGCCAACTT GTAAATACAC CACCTCACTC CTGTTACTTA CCTAAACAGA 2520 

TATAAATGGC TGGTTTTTAG AAACATGGTT TTGAAATGCT TGTGGATTGA GGGTAGGAGG 2580 

TTTGGATGGG AGTGAGACAG AAGTAAGTGG GGTTGCAACC ACTGCAACGG CTTAGACTTC 2640 

GACTCAGGAT CCAGTCCCTT ACACGTACCT CTCATCAGTG TCCTCTTGCT CAAAAATCTG 2700 

TTTGATCCCT GTTACCCAGA ' GAATATATAC ATTCTTTATC TTGACATTCA AGGCATTTCT 2760 

ATCACATATT TGATAGTTGG TGTTCAAAAA AACACTAGTT TTGTGCCAGC CGTGATGCTC 2820 
AGGCTTGAAA TCGCATTATT TTGAATGTGA AGGGAA 

Seq ID NO: 81 Protein sequence: 
Protein Accession #:. NP_006507.1 



PC17US02/12476 



MEPSSKKLTG 
LTTLWSLSVA 
MLILGRFIIG 
GNKDLWPLLL 
LQEMKEESRQ 
AGVQQPVYAT 
LPWMSYLSIV 
CFQYVEQLCG 
ELPHPLGADS 



11 
I 

RLMLAVGGAV 
IFSVGGMIGS 
VYCGLTTGFV 
SIIFIPALLQ 
MMREKKVTIL 
IGSGIVNTAP 
AIFGFVAFFE 
PYVFIIFTVL 
QV 



21 

I 

LGSIiQFGYNT 
FSVGLFVNRF 
PMYVGEVSPT 
CIVLPFCPES 
ELFRSPAYRQ 
TWSLFWER 
VGPGPIPWFI 
LVLFFIFTYF 



31 

I 

GVINAPQKVI 
GRRNSMLMMN 
AFRGALGTLH 
PRFLLINRNE 
PILIAWLQL 
AGRRTLHLIG 
VAELFSQGPR 
KVPETKGRTF 



41 
I 

EEFYNQTWVH 
LliAFVSAVLM 
QLGIWGILI 
ENRAKSVLKK 
SQQLSGINAV 
LAGMAGCAIL 
PAAIAVAGFS 
DEIASGFRQG 



51 
I 

RYGESILPTT 60 

GFSKW3KSFE 120 

AQVFGLDSIM 180 

LRGTADVTHD 240 

FYYSTSIFEK 300 

MTIALALLEQ 360 

NWTSNFIVGM 420 " 
480 



Seq ID NO: 82 DNA sequence 
Nucleic Acid Accession #: BC001291 
Coding sequence: 44-541 

1 11 21 31 41 51 

I II I I I 

GGGGGCGCCG CGCGCTGACC CTCCCTGGGC ACCGCTGGCG ACGATGGCGC TGCTCGCCTT 
GCTGCTGGTC GTGGCCCTAC CGCGGGTGTG GACAGACGCC AACCTGACTG CGAGACAACG 
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AGATCCAGAG GACTCCCAGC GAACGGACGA GGGTGACAAT 
TGAGAGAGAA AACACTTTCG AGTGCCAGAA CCCAAGGAGG 
CTGCGTTATA GCGGCCGTGA AAATATTTCC ACGTTTTTTC 
CGCTGGTTGT GCAGOGATGG AGAGACCCAA .GCCAGAGGAG 
GCCCATGCCC TTCTTTTACC TCAAGTGTTG TAAAATTCGC 
ACCTATCAAC TCATCAGTGT TCAAAGAATA TGCTGGGAGC 
GCTGTGGCTG GCCATCCTCC TGCTGCTGGC CTCCATTGCA 
AGCCACGGGA CTGCCACAGA CTGAGCCTTC CGGAGCATGG 
ACCTGTTGCA TTAAACTTGT TTTCTGTTGA TTACCTCTTG 
GGGATGGGAG AGTGGGGATC AGGTGCAGTT GGCTCTTAAC 
ACATTCAGAG GAAGTCCAGA TCTCCTGAGT AGTGATTTTG 
AAATCAAACC TTGTAACTCA TTTATTGCTG ATGGCCACTC 
CCTCTGAGGG CTTCAGTATT GATGGGGAGG GAGGCCTAAG 
TGCTGAGATG CTTCCGACCT TTCAGGTGAC GCAGGAACAC 
GGGTGAAGAC ATCCCTGGAG TGAAGGACTC CTCAGCATGG 
AGGGCTGCCC CCATTCCAGT GGTGGAGGCG CTGTGGATGG 
CTACCAGATT CCAGGAGGCA GAAGATAACT AATTGTGTTG 
ACCAGCTGGC ACAGGTGCAC AGATTCATAA ATTCCCACAC 
ACTTAGGCCA AGTAGAGAGC - ATCAGGGTAA ATGGCGTTCA 
CATCCATGGG GAGCTGAGAA ATCAGACTCA AAGTTCCACC 
TTCAAAAGTT CACGAAAAAA AAAAAAAAAA AAAAAAAAAA 



PCTAJS02/12476 



Seq ID NO: 83 Protein sequence: 
Protein Accession I: AAH01291 



11 
I 



21 
1 



31 
I 



AGAGTGTGGT 


GTCATGTTTG 


180. 




TGCAAATGGA 


CAGAGCCATA 


240 




ATGGTTGCGA 


AGCAGTGCTC 


300. 




AAGCGGTTTC 


TCCTGGAAGA 


360 ■ 




TACTGCAATT 


TAGAGGGGCC 


420 




ATGGGTGAGA 


GCTGTGGTGG 


480 




GCCGGCCTCA 


GCCTGTCTTG 


540 




ACTCGCTCCA 


gaccgttgtc 


600 




GTTTGACTTC 


CCAGGGTCTT 


660 




CCTCAAGGGT 


TCTTTAACTC 


720 




GTGACAAGTT 


TTTCTCTTTG 


780 




TTTTCCTTGA 


CTCCCCTCTG 


840 




TACCACTCAT 


GGAGAGTATG 


900 




TGGGGGAGTC 


TGAATGATTG 


960 




GGGGCAGTGG 


GGCACACGTT 


1020 




CTGCTTTTCC 


TCAACCTTTC 


1080 ; 




AAGAAACTTA 


GACTTCACCC 


1140 




GTGTGTGTTC 


AACATCTGAA 


1200 




TTTCTCTGTT 


AAGATGCAGC 


1260 




AAAAACAAAT 


ACAAGGGGAC 


1320 




AAAAAAAAAA 


AAA 






41 


51 






1 

VWCHVCERKN 


1 

TFECQNPRRC 


60 




RFLLEEPMPF 


FYLKCCKIRY 


120 





30 CNLEGPPINS SVFKBYAGSM GESCGGLWUV ILLLLASIAA GLSLS 



Seq ID NO: 84 DNA sequence 

Nucleic Acid Accession #i NMJJ22893.1 

Coding sequence: 229-2726 



1 

I 

TTTTTTTTTT 
' TGCGCCATCT 
TTTTCTCTGG 
CGCCCGCCGC 
AAGCAAGGCA 
ATTCTTACAG 
CTCCTCACCT 
GAGCACAAAC 
CCTTCCCCTT 
GTCACGCCAG 
GAACACATAG 
GGAGCTCTAA 
GATGAGCCCA 
CTCTTGCAAC 
AGTCCCCTGA 
CCACCTCTCC 
GGATCAGTAT 
CTGTTTAGTC 
GAAGAAATGG 
CCAATGGCTA 
AACACGTCTA 
CCATTCCAGC 
TCCGCCCCTC 
ACGTTCAAAT 
TACAAGTGCA 
AAGACGCACA 
GCCAGCTCCC 
TCCGTGGTGG 
GAGGAGGAAG 
CTGACGGAGA 
CACGAGAACA 
GACGTCATGC 
GTCCTGGGCG 
TGCGACGAAG 
CGCGGCTGCT 
AGCCCCAGCT 
CCCCCGGCCA 
GCCTCCAGGC 
GCCTCCTCGT 
GAGCTGGACG 
ATTAGTGGTC 
GAGTACTGTG 
ACGGGCGAAA 
CTCACCAGGC 
TGTAAGATGC 
GATCGAGTGT 
CTCCCACCTG 
CCTGTAGGAT 
ACGAAGCTAA 
TTCTTTTTTC 



11. 
I 

TTTTTTGCTT 
TTGTATTATT 
AGTCTCCTTC 
CGCCGCCGCC 
AACCCCAGCA 
ATGATGAACC 
GTGGGCAGTG 
GGAAACAATG 
CACCAATCGA 
AAGATGACGA 
CAGATAAACT 
TCCCCACGCC 
GCAGCTACAC 
ACGCACAGAA 
CCCCGCGGGT 
ATGGGATTCA 
CGAGAGAGGC 
CACCACCGAG 
CCCTGGCCAC 
TGGAGCCTCC 
GCCCACCGCT 
CAGGTAGCAA 
CTCCCTCCCA 
TTCAGAGCAA 
ACCTGTGCGA 
TGCACAAATC 
OGGAACCCGG 
CCAAGTTCAA 
AGGAGGACGA 
GCGAGAGGGT 
GCTCGCGGGG 
AGGGCATGGT 
AGAAGCATAA 
ACTCGGTGGC 
CCCOGGGCGA 
CGCTGAGCCC 
CGATGCCCAA 
AGCTCAAAGA 
CGGAGCACTC 
GAGGGATCTC 
OGGGCACGGG 
GGAAAGTCTT 
GGCCTTATAA 
ACATGAAAAC 
CTTTTAGCGT 
TGAATAATGA 
ACACCCCCTT 
TTTTTTCTAG 
GAATATGAGA 
TTTTTCCTTT 



21 
I 

AAAAAAAAGC 
TCTAATTTAT 
TTTCTAACCC 
GCCGCCGCCG 
CTTAAGCAAA 
AGACCACGGC 
CCAGATGAAC 
CAATGGCAGC 
GATGAAAAAA 
TTGTTTATCA 
TCTGCACTGG 
TGGGATGAGT 
ATGTACAACT 
CACTCATGGA 
TGGTATCCCT 
TATTGCAGAC 
TTCCGGCCTG 
ACATCACTTG 
CCATCACCCG 
CGCCATGGAT 
GTCCCCAGGC 
GCCGCCCTTC 
GCCCCCGGTC 
CCTGGTGGTG 
CCACGCGTGC 
GTCCCCCATG 
CACCAGCGAC 
GAGCGAGAAC 
CGAGGAAGAG 
GGACTACGGC 
CGCGGTCGTG 
GCTCAGCTCC 
GCGCGGCCAC 
CGGCGAGTCG 
GTCGGCCTCG 
CTTCTCTAAG 
CACGGAGAAC 
TCCCTTCCTT 
CTCGGAGAAC 
GGGGCGCAGC 
CAGGCCCAGC 
CAAGAACTGT 
ATGCGAGCTG 
GCATGGCCAG 
GTACAGTACC 
TATAAAAACT 
TTTCACCACT 
TCCCATGTGA 
GTGCTTGTCA 
TTTTTTTTTT 



31 

I 

CATGACGGCT 
TTTGGATGTC 
GGCTCTCCCG 
CCCGCCCCGC 
CGGGAATTCT 
CCGTTGGGAG 
TTCCCATTGG 
CTCTGCTTAG 
GCATCCAATC 
ACGTCATCTA 
AGGGGCCTCT 
GCAGAATATG 
TGCAAACAGC 
TTAAGAATCT 
TCAGGACTAG 
AATAACCCCT 
GCAGAAGGGC 
GACCCCCACC 
AGTGCCTTTG 
TTCTCTAGGA 
CGGCCCAGCC 
CTGGCGACGC 
AAGTCCAAGT 
CACCGGCGCA 
ACCCAGGCCA 
ACGGTCAAGT 
TTGGTGGGCA 
GACCCCAACC 
GAAGAAGAGG 
TTCGGGCTGA 
GGCGTGGGCG 
ATGCAGCACT 
CTGGCOGAGG 
GACCGCATAG 
GGGGGCCTGT 
CGCATCAAGC 
GTGTACTCGC 
AGCTTOGGAG 
GGGAGCTTGC 
GGCACGGGAA 
TCAAAAGAGG 
AGCAATCTCA 
TGCAACTATG 
GTGGGGAAGG 
CTGGAGAAAC 
GAATAGAGGT 
CCCTTTCCCC 
TTTAAACAAA 
CCAGCACACC 
TCCTTTATGT 



CATGCGAGTT 
GCCACACGGG 
GCAAGCTGAA 
CCGACGACGG 
GCGCCAGCAG 
TGATCCCGGA 
AGGAAGAGGA 
GCCTGGAGGC 
ACGAGAGCCG 
TCAGCGAGGC 
CCGAGGGCCA 
ACGATGGCAC 
CCAAAAAGCT 
TCGAGAAGGA 
AGTGGCTCGC 
ACTCCAGACA 
GCTTCTCCAC 
GTGGAGGGAG 
GCAGACGCAG 
CTGTCCACAG 
CCTGTGCCCA 
ACGTTTACAA 
ACATGAAAAA 
ATATTAATAC 

atcgccctcc 

CAAACAAACA 
TGTTTTTTTT 
TCTCACCGTT 



51 
I 

TCATCTTCCC 
GATGAAGATA 
AGCCGTCGTC 
GTCTCGCCGC 
TCTTGAAGCC 
GGATCATGAC 
TATTTTTATC 
GGATAAGCCA 
TGGCATCCAG 
CCCCAAACAG 
TTCTGCACAT 
TATTTGTAAA 
TGCATGGTTT 
CGAACACGGA 
TCCTTCCCAG 
AAGAATACCA 
CACTCCCCCC 
CCTGGGGGCG 
GCGGTTGAAT 
GCTGGCAGGG 
GTTACTGCAA 
TCCTCTGCAA 
CTGCGGCAAG 
CGAGAAGCCC 
GCGCCACATG 
TCTCTCCACC 
CGCGCTCAAG 
GAACGGGGAC 
GGAGGAGGAG 
GGCGCGCCAC 
CGCCCTGCCC 
CTTCCACCAG 
CAGGGACACT 
TGTTAATGGC 
GCTGCTGGGC 
GTTOGACCTG 
CGGCTACGCG 
ATCGCCTTTT 
ACCGCCCGGG 
CACGCCCCAT 
CGACACTTGT 
GAGAAGCCAC 
GAGTAGCAAG 
ATGTGAAATT 
ATGGCACAGT 
CCCTCCCTCA 
AGCCCCACTC 
AACAGAAGTA 
CTTTTTC TT T 
TGAATGCATG 



60 
• 120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500. 
1560 
1620 
1680 
1740 
1800 
I860' 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 



220 
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ATCTGTATGG GGCAATACTA TTGCATTTTA CGCAAACTTT GAGCCTTTCT CTTGTGCAAT 3060 

AATTTACATG TTGTGTATGT TTTTTTTTAA ACTTAGACAG CATGTATGGT ATGTTATGGC 3120 

- TATTTT AAAT TGTCCCTAAT TCGTTGCTGA GCAAACATGT TGCTGTTTCC AGTTCCGTTC 3180. 

TGAGAGAAAA AGAGAGAGAG AGAGAAAAAG ACCATGCTGC A TACA TTCTG TAATACATAT 3240 

CATGTACAGT TTTATTTTAT AACGTGAGGA GGAAAAACAG TCTTTGGATT AACCCTCTAT 3300 

- AGACAGAATA GATAGCACTG AAAAAAAATC TCTATGAGCT AAATGTCTGT CTCTAAAGGG -3360 

TTAAATGTAT CAATTGGAAA 'GGAAGAAAAA AGGCCTTGAA TTGACAAATT AACAGAAAAA 3420 

CAGAACAAGT TTATTCTATC ATTTGGTTTT AAAATATGAG TGCCTTGGAT CTATTAAAAC 34BO 

CACATCGATG GTTCTTTCTA CTTGTTATAA ACTTGTAGCT TAATTCAGCA TTGGGTGAGG 3540 

TAATAAACCT TAGGAACTAG CATATAATTC TATATTGTAT TTCTCACAAC AATGGCTACC 3600 

TAAAAAGATG ACCCATTATG TCCTAGTTAA TCATCATTTT TCCTTTAGTT TAATTTTATA 3660 

AACAAAACTG ATTATACCAG TATAAAAGCT ACTTTGCTCC TGGTGAGAGC TTAAAAGAAA 3720 

TGGGCTGTTT TGCCCAAAGT TTTATTTTTT TTAAACAATG ATT AAAT TGA ATGTGTAATG 3780 

TGCAAAAGCC CTGGAACGCA ATTAAATACA CTAGTAAGGA GTTCATTTTA TGAAGATATT 3840 

TGCTTTAATA ATGTCTTTTT AAAAATACTG GCACCAAAAG AAATAGATCC AGATCTACTT 3900 

GGTTGTCAAG TGGACAATCA AATGATAAAC TTTAAGACCT TGTATACCAT ATTGAAAGGA 3960 

AGAGGCTGAC AATAAGGTTT GACAGAGGGG AACAGAAGAA AATAATATGA TTTATTAGCA 4020 

CAACGTGGTA CTATTTGCCA TTTAAAACTA GAACAGGTAT ATAAGCTAAT ATTGATACAA 4080 

TGATGATTAA CTATGAATTC TTAAGACTTG CATTTAAATG TGACATTCTT AAAAAAAGAA 4140 

GAGAAAGAAT TTTAAGAGTA GCAGTATATA TGTCTGTGCT CCCTAAAAGT TGTACTTCAT 4200 

TTCTTTTCCA TACACTGTGT GCTATTTGTG TTAACATGGA AGAGGATTCA TTGTTTTTAT 4260 

TTTTATTTTT TTAATTTTTT CTTTTTTATT AAGCTAGCAT CTGCCCCAGT TGGTGTTCAA 4320 

ATAGCACTTG ACTCTGCCTG TGATATCTGT ATCTTTTCTC TAATCAGAGA TACAGAGGTT 4380 

GAGTATAAAA TAAACCTGCT CAGATAGGAC AATTAAGTGC ACTGTACAAT TTTCCCAGTT 4440 

TACAGGTCTA TACTTAAGGG AAAAGTTGCA AGAATGCTGA AAAAAAATTG AACACAATCT 4500 

CATTGAGGAG CATTTTTTAA AAACTAAAAA AAAAAAAACT TTGCCAGCCA TTTACTTGAC 4560 

TATTGAGCTT ACTTACTTGG ACGCAACATT GCAAGCGCTG TGAATGGAAA CAGAATACAC 4620 

TTAACATAGA AATGAATGAT TGCTTTCGCT TCTACAGTGC AAGGATTTTT TTGTACAAAA 4680 

CTTTTTTAAA TATAAATGTT AAGAAAAATT TTTTTTAAAA AACACTTCAT TATGTTTAGG 4740 

GGGGAACTGC ATTTTAGGGT TCCATTGTCT TGGTGGTGTT ACAAGACTTG TTATCCATTT 4800 

AAAAATGGTA GTGGAAATTC TATGCCTTGG ATACACACCG CTCTTCAGGT TGTAAAAAAA 4860 

AAAAACATAC ATTGGGGAAA GGTTTAAGAT TATATAGTAC TTAAATATAG GAAAATGCAC 4920 

ACTCATGTTG ATTCCTATGC TAAAATACAT TTATGGTCTT TTTTCTGTAT TTCTAGAATG 49B0 

GTATTTGAAT TAAATGTTCA TCTAGTGTTA GGCACTATAG TATTTATATT. GAAGCTTGTA 5040 

TTTTTAACTG TTGCTTGTTC TCTTAAAAGG TATCAATGTA CCTTTTTTGG TAGTGGAAAA 5100 

AAAAAAGACA GGCTGCCACA GTATATTTTT TTAATTTGGC AGGATAATAT AGTGCAAATT 5160 

ATTTGTATGC TTCAAAAAAA AAAAAAAGAG AGAAACAAAA AAGTGTGACA TTACAGATGA 5220 

GAAGCCATAT AATGGCGGTT TGGGGGAGCC TGCTAGAATG TCACATGGAT GGCTGTCATA 5280 

GGGGTTGTAC ATATCCTTTT TTGTTCCTTT TTCCTGCTGC CATACTGTAT. GCAGTACTGC 5340 

AAGCTAATAA CGTTGGTTTG TTATGTAGTG TGCTTTTTGT CCCTTTCCTT CTATCACCCT 5400 

ACATTCCAGC ATCTTACCTT CATATGCAGT AAAAGAAAGA AAGAAAAAAA AAGGAAAAAA 5460 

AAAAAAAAAC CAATGTTTTG CAGTTTTTTT CATTGCCAAA AACTAAATGG TGCTTTATAT 5520 

TTAGATTGGA AAGAATTTCA TATGCAAAGC ATATTAAAGA GAAAGCCCGC TTTAGTCAAT 5580 

ACTTTTTTGT AAATGGCAAT GCAGAATATT TTGTTATTGG CCTTTTCTAT TCCTGTAATG 5640 

AAAGCTGTTT GTCGTAACTT GAAATTTTAT CTTTTACTAT GGGAGTCACT ATTTATTATT 5700 

GCTTATGTGC CCTGTTCAAA ACAGAGGCAC TTAATTTGAT CTTT TATTTT TCTTTGTTTT 5760 

TATTTTTTTT TTTATTTAGA TGACCAAAGG TCATTACAAC CTGGCTTTTT ATTGTATTTG 5820 

TTTCTGGTCT TTGTTAAGTT CTATTGGAAA AACCACTGTC TGTGTTTTTT TGGCAGTTGT 5880 

CTGCATTAAC CTGTTCATAC • ACCCATTTTG TCCCTTTATT GAAAAAATAA AAAAAATTAA 5940 
A* * 

Seq ID NO i 85 Protein sequence: 
Protein Accession #: NP_075044.1 

1 11 21 31 - 41 51 

1 I I I I 1- 

MSRRKQGKPQ HLSKREFSPE PLEAILTDDE PDHGPLGAPE GDHDLLTCGQ OQMNFPLGDI 60 

LIFIEHKRKQ CNGSLCLEKA VDKPPSPSPI EMKKASNPVB VGIQVTPEDD DCLSTSSRRI 120 

CPKQEHIADK LLHWRGLSSP RSAHGALIPT PGMSAEYAPQ GICKDEPSSY- TCTTCKQPFT 180 

SAWFLLQHAQ NTHGIiRIYLE' SEHGSPLTPR VGIPSGLGAE CPSQPPliHGI HIADNNPFNL 240 

LRIPGSVSRE ASGLAEGRFP PTPPLPSPPP RHHLDPHRIE RLGAEEMALA - THHPSAPDRV 300 

LRLNPMAMEP PAMDFSRRLR ELAGNTSSPP LSPGRPSPMQ RLLQPFQPGS KPPFLATPPL 360 

PPLQSAPPPS QPPVKSKSCE FCGKTFKFQS NLWHRRSHT GEKPYKCNLC DHACTQASKL 420 

KRHMKTHMHK SSPMTVKSDD GLSTASSPEP GTSDLVGSAS SALKSWAKF KSENDPNLIP 480 

EKGDEEEEED DEEEEEEEEE EEEEI*TESER VDYGFGLSLE AARHHENSSR GAWGVGDES 540 

RALPDVMQGM VLSSMQHFSE AFHQVLGEKH KRGHLAEAEG HRDTCDEDSV AGESDRIDDG 600 

TVNGRGCSPG ESASGGLSKK LLLGSPSSLS PFSKRIKLBK EFDLPPATMP NTENVYSQWL 660 

AGYAASRQLK DPFLSFGDSR QSPFASSSEH SSENGSLRFS TPPGELDGGI SGRSGTGSGG 720 

STPHISGPGT GRPSSKEGRR SDTCEYCGKV FKNCSNLTVH RRSHTGERPY KCELCNYACA 780 
QSSKLTRHMK THGQVGKDVY KCEICKMPFS VYSTLEKHMK KWHSDRVLNN DIKTE 

Seq ID NO: 86 DNA sequence - 

Nucleic Acid Accession #: XM_035292.2 

Coding sequence: . y " ■ '• 

1 - 11 21-31 41 51 

) ) ' • I I 1 I 

GCTCGCTGGG CCGCGGCTCC CGGGTGTCCC AGGCCCGGCC GGTGCGCAGA GCATGGCGGG 60 

TGCGGGCCCG AAGCGGCGCG CGCTAGCGGC . GCCGGCGGCC GAGGAGAAGG AAGAGGCGCG 120 

GGAGAAGATG CTGGCCGCCA AGAGCGCGGA CGGCTCGGCG CCGGCAGGCG AGGGCGAGGG . 1B0 

CGTGACCCTG CAGCGGAACA TCACGCTGCT CAACGGCGTG GCCATCATCG ■ TGGGGACCAT 240 

TATCGGCTCG GGCATCTTCG TGACGCCCAC GGGCGTGCTC AAGGAGGCAG . GCTCGCCGGG 300 

GCTGGCGCTG GTGGTGTGGG CCGCGTGCGG CGTCTTCTCC ATCGTGGGCG CGCTCTGCTA 360 

CGCGGAGCTC GGCACCACCA TCTCCAAATC GGGCGGCGAC TACGCCTACA^ TGCTGGAGGT 420 

CTACGGCTCG CTGCCCGCCT TCCTCAAGCT CTGGATCGAG CTGCTCATCA TCCGGCCTTC 480 

* ATCGCAGTAC ATCGTGGCCC TGGTCTTCGC CACCTACCTG CTCAAGCCGC TCTTCCCCAC S40 
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CTGCCCGGTG CCCGAGGAGG CAGCCAAGCT CGTGGCCTGC CTCTGCGTGC TGCTGCTCAC 
GGCCGTGAAC TGCTACAGCG TGAAGGCCGC CACCCGGGTC CAGGATGCCT TTGCCGCCGC 
CAAGCTCCTG GCCCTCGCCC TGATCATCCT GCTGGGCTTC GTCCAGATCG GGAAGGGTGA - 
TGTGTCCAAT CTAGATCCCA ACTTCTCATT TGAAGGCACC AAACTGGATG TGGGGAACAT 
TGTGCTGGCA TTATACAGCG GCCTCTTTGC CTATGGAGGA TGGAATTACT TGAATTTCGT 
CACAGAGGAA ATGATCAACC CCTACAGAAA CCTGCCCCTG GCCATCATCA TCTCCCTGCC 
CATCGTGACG CTGGTGTACG TGCTGACCAA CCTGGCCTAC TTCACCACCC TGTCCACCGA 
GCAGATGCTG TCGTCCGAGG CCGTGGCCGT < GGACTTCGGG AACTATCACC TGGGCGTCAT 
: GTCCTGGATC ATCCCCGTCT TCGTGGGCCT GTCCTGCTTC GGCTCCGTCA ATGGGTCCCT 
GTTCACATCC TCCAGGCTCT TCTTCGTGGG GTCCCGGGAA GGCCACCTGC CCTCCATCCT 
CTCCATGATC CACCCACAGC TCCTCACCCC CGTGCCGTCC CTCGTGTTCA CGTGTGTGAT 
GACGCTGCTC TACGCCTTCT CCAAGGACAT CTTCTCCGTC ATCAACTTCT TCAGCTTCTT 
CAACTGGCTC TGCGTGGCCC TGGCCATCAT CGGCATGATC TGGCTGCGCC ACAGAAAGCC 
TGAGCTTGAG CGGCCCATCA AGGTGAACCT GGCCCTGCCT GTGTTCTTCA TCCTGGCCTG 
CCTCTTCCTG ATCGCCGTCT CCTTCTGGAA GACACCCGTG GAGTGTGGCA TCGGCTTCAC 
CATCATCCTC AGGGGGCTGC CCGTCTACTT CTTCGGGGTC TGGTGGAAAA ACAAGCCCAA 
GTGGCTCCTC CAGGGCATCT TCTCCACGAC CGTCCTGTGT CAGAAGCTCA TGCAGGTGGT 
CCCCCAGGAG ACATAGCCAG GAGGCCGAGT GGCTGCOGGA GGAGCATGC 

Seq ID NO: 87 Probein sequence; 
Protein Accession ft: XP_035292.2 



PO7US02/12476 



l 
I 

MAGAGPKRRA 
GTIIGSGIFV 
LEVYGSLPAF 
LLTAVNCYSV 
GNIVLALYSG 
STEQMLSSEA 
SILSMIHPQL 
RKPELERPIK 
KPKWLLQGIF 



11 
I 

LAAPAAEEKE 
TPTGVLXEAG 
LKLWIELLII 
KAATRVQDAF 
LFAYGGWNYL 
VAVDFGNYHL 
LTPVPSLVFT 
VNLALPVFPI 
STTVLCQKLM 



21 
1 

EAREKMIAAK 
SPGLALWWA 
RPSSQYIVAL 
AAAKLLALAL 
NFVTEEMINP 
GVMSWIIPVF 
CVMTLLYAPS 
LACLFLIAVS 
QWPQET 



31 

I - 

SADGSAPAGE 
ACGVFSIVGA 
VFATYLLKPL 
IILLGFVQIG 
YRNLPLAIII 
VGLSCFGSVN 
KDIFSVINFP 
FHKTPVECGI 



41 

I 

GEGVTLQRNI 
LCYAELGTTI 
FPTCPVPEEA 
KGDVSNLDPN 
SLPIVTLVYV 
GSLFTSSRLF 
SFFNWLCVAL 
GFTIIIiSGLP 



51 
I 

TLLNGVAIIV 
SKSGGDYAYM 
AKLVACLCVL 
FSFEGTKLDV 
LTNLAYFTTL 
FVGSREGHLP 
AIIGMIWLRH 
VYFFGVWWKN 



Seq ID NO: 88 DNA sequence 

Nucleic Acid Accession #: NM_00S268.1 

Coding sequence: 168-989 



TAAAAAGCAA 
TCTGGATATG 
AGCCCTGAGG 
TCTTTGAGGG 
TGTCTCTGGT 
GTGATGACCA 
TTGATGAGTT 
CATGCCCCTC 
ACCGAGAAGC 
GTGGGCTCTG 
TTCTCTATGT 
ACGCAGATCC 
TTTTCACCCT 
TCATCTACCT 
TGTGCACAGG 
CGGGTGACCT 
GAGACCATGT 
CCTGGATGGG 
CATGAGGTAG 
TCAACTCCAG 
GCTCGGTTTC 



11 
I 

AAGAATTCGC 
AAATTCAAGC 
AGTAGTCACT 
ACTCCTGAGT 
CTTCATCTTC 
CAAGGACTTC 
CTTCCCTGTG 
ACTGCTCGTG 
CCATGGGGAG 
GTGGACATAT 
GTTCCACTCA 
ATGTCCCAAT 
CTTCATGGTG 
GGTGAGCAAG 
TCATCACCCC 
CATCTTTCTG 
GAAGAAAACC 
GAGGCTCTAG 
GGGCAGGCAA 
CCACCTGCCC 
CTTTTCTAGA 



21 
I 

GGCCGCGTCG 
TGCTTGCTGA 
CAGTAGCAGC 
GGGGTCAACA 
CGCGTGCTGG 
GACTGCAATA 
TCCCATGTGC 
GTCATGCACG 
AACAGTGGGC 
GTCTGCAGCC 
TTCTACCCCA 
ATAGTGGACT 
GCCACAGCTG 
AGATGCCACG 
CACGGTACCA 
GGCTCAGACA 
ATCTTGTGAG 
CATCTCTCAT 
GAGAGAGGAT 
CAGCTCX3ACG 
ATGGAAATAG 



31 
I 

ACACGGGCTT 
GTCCTATTGC 
TGACGCGTGG 
AGTACTCCAC 
TGTACCTGGT 
CTCGCCAGCC 
GCCTCTGGGC 
TGGCCTACCG 
GCCTCTACCT 
TAGTGTTCAA 
AATATATCCT 
GCTTCATCTC 
CCATCTGCAT 
AGTGCCTGGC 
CCTCTTCCTG 
GTCATCCTCC 
GGGCTGCCTG 
AGGTGCAACC 
TCAGACGCTC 
GCACTGGGCC 
TGAGGGCCAA 



41 

I 

CCCCGAAAAC 
CGGCTGCTGG 
GTCCACCATG 
AGCCTTTGGG 
GACGGCCGAG 
CGGCTGCTCC 
CCTGCAGCTT 
GGAGGTTCAG 
GAACCCCGGC 
GGCGAGCGTG 
CCCTCCTGTG 
CAAGCCCTCA 
CCTGCTCAAC 
AGCAAGGAAA 
CAAACAAGAC 
TCTCTTACCA 
GACTGGTCTG 
TGAGAGTGGG 
TGGGAGCCAG 
AGTTCCCCCT 
TGC 



51 
I 

CTTCCCCGCT 
GAGCCAGGAG 
AACTGGAGTA 
CGCATCTGGC 
CGTGTGTGGA 
AACGTCTGCT 
ATCCTGGTGA 
GAGAAGAGGC 
AAGAAGCGGG 
GACATCGCCT 
GTCAAGTGCC 
GAGAAGAACA 
CTCGTGGAGC 
GCTCAAGCCA 
GACCTCCTTT 
GACCGCCCCC 
GCAGGTTGGG 
GGAGCTAAGC 
TTCCTAGTCC 
CTGCTCTGCA 



Seq ID NO: 89 Protein sequence: 
Protein Accession #: NP_005259.1 



11 



21 31 41 51 

I I I I 

MNWSIFEGIiIi SGVNKYSTAF GRIWLSLVFI FRVLVYLVTA ERVWSDDHKD FDCNTRQPGC 

SNVCFDEFFP VSHVRLWALQ LILVTCPSLL WMHVAYREV QEKRHREAHG ENSGRLYLNP 

GKKRGGLWWT YVCSLVFKAS VDIAFLYVFH SFYPKYILPP WKCHADPCP NIVDCFISKP 

SEKNIFTLFM VATAAICILL NLVELIYLVS KRCHECLAAR KAQAMCTGHH PHGTTSSCKQ 
DDLLSGDLIF LGSDSHPPLL PDRPRDHVKK TIL 



Seq ID NO: 90 DNA sequence 

Nucleic Acid Accession ft: NM_002391. 

Coding sequence: 26-457 



CGGGCGAAGC 
CGCCCTGCTG 
CCCGGGGAGC 
CGGCGTGGGT 
GCCCTGCAAC 
TGCGTGTGAT 



11 
I 

AGCGCGGGCA 
GCGCTCACCT 
GAGTGCGCTG 
TTCCGCGAGG 
TGGAAGAAGG 
GGGGGCACAG 



21 
I 

GCGAGATGCA 
CCGCGGTCGC 
AGTGGGCCTG 
GCACCTGCGG 
AGTTTGGAGC 
GCACCAAAGT 



600 ■ 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 . 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



60 
120 
180 
240 
300 
360 
420 
480 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 



60 
120 
180 
240 



31 41 51 

t I I . 

GCACCGAGGC TTCCTCCTCC TCACCCTCCT 
CAAAAAGAAA GATAAGGTGA AGAAGGGCGG 
GGGGCCCTGC ACCCCCAGCA GCAAGGATTG 
GGCCCAGACC CAGCGCATCC GGTGCAGGGT. 
CGACTGCAAG TACAAGTTTG AGAACTGGGG 
CCGCCAAGGC ACCCTGAAGA AGGCGCGCTA 



60 
12*0 
180 
240 
300 
360 



222 
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CAATGCTCAG TGCCAGGAGA CCATCCGCGT CACCAAGCCC TGCACCCCCA AGACCAAAGC 420 

AAAGGCCAAA GCCAAGAAAG GGAAGGGAAA GGACTASACG CCAAGCCTGG ATGCCAAGGA 480 

. GCCCCTGGTG TCACATGGGG CCTGGCCACG CCCTCCCTCT CCCAGGCCCG AGATGTGACC 540 ■ 

CACCAGTGCC TTCTGTCTGC * TCGTTAGCTT TAATCAATCA TGCCCTGCCT TGTCCCTCTC 600 

5 ACTCCCCAGC CCCACCCCTA AGTGCCCAAA GTGGGGAGGG ACAAGGGATT CTGGGAAGCT 660 

TGAGCCTCCC. CCAAAGCAAT GTGAGTCCCA GAGCCOGCTT TTGTTCTTCC CCAC AATTCC 720 

* ATTACTAAGA AACACATCAA ATAAACTGAC TTTTTCCCCC CAATAAAAGC TCTTCTTTTT 780 

TAATAT ■ .... 

.10 Seq -ID NO: 91 Protein . sequence : 

Protein Accession ; 8 : NP_0023B2.1 . 

1 11 21 31 41 51 

1C I I I ." I I I 

1 J MQHRGFLLLT LLALIALTSA VAKKKDKVKK GGPGSECAEW AWGPCTPSSK DCGVGFREGT 60 
CGAQTQRIRC RVPCNWKKEF GADCKYKFEN WGACDGGTGT KVRQGTLKKA RYNAQCQBTI 120 

RVTKPCTPKT KAKAKAKKGK GKD « 



20 Seq ID NO: 92 DNA sequence 

Nucleic Acid Accession #: NM_005130.1 
Coding sequence: 98-802 

1 11 21 31 41 51 

25 | II I I I 

CTCTACCTGA CACAGCTGCA GCCTGCAATT CACTCCCACT GCCTGGGATT GCACTGGATC 60 

CGTGTGCTCA GAACAAGGTG AACGCCCAGC TGCAGCCATG AAGATCTGTA GCCTCACCCT 120 

GCTCTCCTTC CTCCTACTGG CTGCTCAGGT GCTCCTGGTG GAGGGGAAAA AAAAAGTGAA 180 

GAATGGACTT CACAGCAAAG TGGTCTCAGA ACAAAAGGAC ACTCTGGGCA ACACCCAGAT 240 

30 TAAGCAGAAA AGCAGGCCCG GGAACAAAGG CAAGTTTGTC ACCAAAGACC AAGCCAACTG 300 

CAGATGGGCT GCTACTGAGC AGGAGGAGGG CATCTCTCTC AAGGTTGAGT GCACTCAATT 360 

GGACCATGAA TTTTCCTGTG TCTTTGCTGG CAATCCAACC TCATGCCTAA AGCTCAAGGA 420 

TGAGAGAGTC TATTGGAAAC AAGTTGCCCG GAATCTGCGC TCACAGAAAG ACATCTGTAG 480 

ATATTCCAAG ACAGCTGTGA AAACCAGAGT GTGCAGAAAG GATTTTCCAG AATCCAGTCT 540 

35 TAAGCTAGTC AGCTCCACTC TATTTGGGAA CACAAAGCCC AGGAAGGAGA AAACAGAGAT 600 

GTCCCCCAGG GAGCACATCA AGGGCAAAGA GACCACCCCC TCTAGCCTAG CAGTGACCCA 660 

GACCATGGCC ACCAAAGCTC CCGAGTGTGT GGAGGACCCA GATATGGCAA ACCAGAGGAA 720 

GACTGCCCTG GAGTTCTGTG GAGAGACTTG GAGCTCTCTC TGCACATTCT TCCTCAGCAT 780 

. AGTGCAGGAC ACGTCATGCT AATGAGGTCA AAAGAGAACG GGTTCCTTTA AGAGAT GTCA 840. 

40 TGTCGTAAGT CCCTCTGTAT ACTTTAAAGC TCTCTACAGT CCCCCCAAAA TATGAACTTT 900 

TGTGCTTAGT GAGTGCAACG AAATATTTAA ACAAGTTTTG TATTTTTTGC TTTTGTGTTT 960 

TGGAATTTGC CTTATTTTTC TTGGATGCGA TGTTCAGAGG CTGTTTCCTG CAGCATGTAT 1020 

TTCCATGGCC CACACAGCTA TGTGTTTGAG CAGCGAAGAG TCTTTGAGCT GAATGAGCCA 1080 

GAGTGATAAT TTCAGTGCAA CGAACTTTCT GCTGAATTAA TGGTAATAAA ACTCTGGGTG 1140 
TTTTTCAAAA AAAAAAAAAA AAA 

Seq ID NO: 93 Protein sequence: 
Protein Accession NP_005121.1 

50 1 11 21 31 41 51 

I 1 1 I I I 

MKICSLTLLS FLLLAAQVLL VEGKKKVKNG LHSKWSEQK DTLGNTQIKQ KSRPGNKGKP 60 

VTKDQANCRW AATEQEEGIS LKVECTQLDH EPSCVFAGNP TSCLKLKDER VYWKQVARNL 120 

RSQKDICRYS KTAVKTRVCR KDFPESSLKL VSSTLFGNTK PRKEKTEMSP REHIKGKETT 180 
55 PSSLAVTQTM ATKAPECVED PDMANQRKTA LEFCGBTWSS LCTFFLSIVQ DTSC 



45 



60 



Seq ID NO: 94 DNA sequence 

Nucleic Acid Accession #: NM_012101 

Coding sequence: 125-1891 



11 21 31 41 51 

II I I I I 

CTCCTCACAG GTGTGTCTCT AGTCCTCGTG GTTGCCTGCC CCACTCCCTG CCGAGACGCC 60 

TGCCAGAAAG GTCACCTATC CTGAACCCCA GCAAGCCTGA AACAGCTCAG CCAAGCACCC 120 

65 TGCGATGGAA GCTGCAGATG CCTCCAGGAG CAACGGGTCG AGCCCAGAAG CCAGGGATGC 180 

CCGGAGCCCG TCGGGCCCCA GTGGCAGCCT GGAGAATGGC ACCAAGGCTG ACGGCAAGGA 240 

TGCCAAGACC ACCAACGGGC ACGGCGGGGA GGCAGCTGAG GGCAAGAGCC TGGGCAGCGC 300 

CCTGAAGCCA GGGGAAGGTA GGAGCGCCCT GTTCGCGGGC AATGAGTGGC GGCGACCCAT 360 

CATCCAGTTT GTCGAGTCCG GGGACGACAA GAACTCCAAC TACTTCAGCA TGGACTCTAT 420 

70 GGAAGGCAAG AGGTCGCCGT ACGCAGGGCT CCAGCTGGGG GCTGCCAAGA AGCCACCCGT 480- 

TACCTTTGCC GAAAAGGGCG ACGTGCGCAA GTCCATTTTC TCGGAGTCCC GGAAGCCCAC 540 

GGTGTCCATC ATGGAGCCCG GGGAGACCCG GCGGAACAGC TACCCCCGGG CCGACACGGG 600 

CCTTTTTTCA CGGTCCAAGT CCGGCTCCGA GGAGGTGCTG TGCGACTCCT GCATCGGCAA 660 

CAAGCAGAAG GCGGTCAAGT CCTGCCTGGT GTGCCAGGCC TCCTTCTGCG AGCTGCATCT 720 

75 CAAGCCCCAC CTGGAGGGCG CCGCCTTCCG AGACCACCAG CTGCTCGAGC CCATCCGGGA 780 

CTTTGAGGCC CGCAAGTGTC CCGTGCATGG CAAGACGATG GAGCTCTTCT GCCAGACCGA 840 

CCAGACCTGC ATCTGCTACC TTTGCATGTT CCAGGAGCAC AAGAATCATA GCACCGTGAC 900 

AGTGGAGGAG GCCAAGGCCG AGAAGGAGAC GGAGCTGTCA CTGCAAAAGG AGCAGCTGCA 960 

GCTCAAGATC ATTGAGATTG AGGATGAAGC TGAGAAGTGG CAGAAGGAGA AGGACCGCAT 1020 

80 CAAGAGCTTC ACCACCAATG AGAAGGCCAT CCTGGAGCAG AACTTCCGGG ACCTGGTGCG 1080 

GGACCTGGAG AAGCAAAAGG AGGAAGTGAG GGCTGCGCTG GAGCAGCGGG AGCAGGATGC 1140 

TGTGGACCAA GTGAAGGTGA TCATGGATGC TCTGGATGAG AGAGCCAAGG TGCTGCATGA 1200 " 

GGACAAGCAG ACCCGGGAGC AGCTGCATAG CATCAGCGAC TCTGTX3TTGT TTCTGCAGGA 1260 

ATTTGGTGCA TTGATGAGCA ATTACTCTCT CCCCCCACCC CTGCCCACCT ATCATGTCCT 1320 

85 GCTGGAGGGG GAGGGCCTGG GACAGTCACT AGGCAACTTC AAGGACGACC TGCTCAATGT 1380 

. ATGCATGCGC CACGTTGAGA • AGATGTGCAA GGCGGACCTG AGCCGTAACT TCATTGAGAG 1440 

GAACCACATG GAGAACGGTG GTGACCATCG CTATGTGAAC AACTACACGA ACAGCTTCGG 1500 
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AGTGCACCGG 
ACATCATACC 
AACAATCTCT 
ATTCAGAACT 
GGCTATCCCT 
GGCAAGCAGA 
GGGTCCAACG 
TCCTCCTGAC 
CCTGGTCCTG 
CTCCGACTTC 
TGCTCACCAT 
CCTGCCCTAA 
CCGCATGGTA 
TTCTCTCCAA 
ATCTCCCATT 
TCGTCCTACC 
AAACCTCTCA 
TGAGTCTGTG 
ATTCCTCTGT 
CAAATAGCTA 
ACTTTGGCAT 
CACTGCCCCC 
CCATGTTGCA 
TGGGGACATC 
CTTGCAGGGG 
GGTCTGTC 



GGGTGAGTGG 
TGGGGTCCGG 
GAAGAATTTC 
CTCCTCCAGC 
CTCCCTGAAA 
TTGGAAATCT 
CAAOGGGATT 
CCCCTGCTCT 
TGGGAGGGAG 
GTTCCGGCCT 
TGACCTCAGA 
TAGGTTGGGG 
CAGTGAGTAC 
CTATAGACGT 
ACAGCCACCC 
GTGCTCTCTC 
CTGCAGATGG 
ACAGCCACTT 
TAGCCAAGAT 
TCCACCCATG 
TTCAGTCTAC 
GTGCCTTACA 
TGATTACCCC 
AGCAGCACAG 
GATGGGCCAG 
ATAAACCATT 



ACACCATGAA 
AGCCCTCGTC 
ATGGCACCAA 
CTGACAATGA 
CCCTCATGCG 
CTATGCTGTC 
AAGCCCCATG 
CCTGCTGCTC 
CACCTGCCCT 
CCCACTGGCC 
CATTCCTGTG 
CCCGCCAGCC 
TCAGCCTGCC 
GGCCCTATCC 
CACATGGCCC 
TATCAATGCC 
GTGTCTTGAC 
GTCCCTGGAG 
TCCCTCTGCT 
CTGGCCCAGC 
TCTCTCTGGC 
ACCCTCAGCC 
TATCAGGGTG 
TCCCGTCTCA 
TTGGGGAGGG 



GAGATACTCC 
TCCTGGCOGC 
AGGTAACTAC 
CCTGCCCGTC 
GAGCCAAAGC 
TCACTACCGG 
AGCTCCTGGC 
TTGCCTTCTA 
CTGCAGCCCT 
ACACTCCATT 
CTCAGAGGCC 
TCCTCCTCTC 
TCTCCCGCCC 
CCCAATGTTG 
ACCTCCTGCT 
CAGCATGGCA 
ATCACCCTAC 
GGTGGCTTCT 
GAGATAAAGA 
TACCATTTAC 
GATGGAGTGT 
GTTGCCCCAT 
CTCAAGGATT 
ACAGCCCCAG 
AGACATCCAG 



Seq ID NO: 95 Protein sequence: 
Protein Accession # : NPJJ36233 . 1 



MEAADASRSN 
KPGEGRSALF 
FAEKGDVRKS 
QKAVKSCLVC 
TCICYLCMFQ 
SFTTNEKAIL 
KQTREQLHSI 
MRHVEKMCKA 
VRTSYQPSSP 
LKGYPSLMRS 



11 

I . 

GSSPEARDAR 
AGNEVJRRPII 
IPSESRKPTV 
QASFCELHLK 
EHKNHSTVTV 
EQNFRDLVRD 
SDSVLFLQEF 
DLSRNFIERN 
GRFTKETTQK 
QSPKAQPQTW 



21 
I 

SPSGPSGSLE 
QFVESGDDKN 
SIMEPGETRR 
PHLEGAAFRD 
EEAKAEKETE 
LEKQKEEVRA 
GALMSNYSLP 
HMENGGDHRY 
NFNNLYGTKG 
KSGKQTMLSH 



31 
I 

NGTKADGKDA 
SNYFSMDSME 
NSYPRADTGL 
HQLLEPIRDF 
LSIvQKEQLQL 
ALEQREQDAV 
PPLPTYHVLL 
VKNYTNSFGG 
NYTSRVWEYS 
YRPFYVNKGN 



Seq ID NO: 96 DNA sequence 

Nucleic Acid Accession #t NM_080668.1 

Coding . sequence : 83-841 



GGCACGAGGG 
GAGCTCGAGA 
GCGCTCCGGG 
AGGCTCTGAA 
AGTCAGAAAG 
CCAATCACCT 
TGGCAGGGAG 
CAGCACTCCT 
CAGAGACTTG 
CTCTGCCTCT 
GGCAGAAGAC 
GGTTTGTGCA 
GAAACAGAAA 
GGCTGCGGCC 
AGATGCAGTG 
CCTGTGGAGA 
GCTGCGCATG 
TTAGGAAATG 
CCTTCCTATC 
GAATCAGTGG 
CTTGTTGGGA 
CCCTCCTGAG 
TTCAGTTCCG 
GAGGGCAATT 
AAGGATGTAG 
GTTTTCTAGT 
GCTGCTTGGA 
GCTCATGGCC 
TGCAGGGGTG 
ACCGAGTCTA 
CAACTATGCT 
' ATAGCAATTT 
CTCATGATCT 
TTTGGATTTG 
TTTGATGTTT 
CAGAGAATGC 
AGGCGGGCCC 



11 . 
I 

CAGCGAGTGG 
CGGAG CCTAG 
CCAAGGGCCC 
CTCCCGAGCA 
CCCATCGTCT 
CGCAGGAGCC 
CTTACTAAGG 
GTGCCGAACC 
. GAAATGTCTA 
ACCTCCACCC 
TTGTCCGGAG 
AAGCCCTGGG 
CGTAAGAAGA 
ATGAATGCCG 
GGGGGTGCAC 
GGACACTTAG 
AGGACTGTCT 
GGGCCGCCTG 
TCCCCAAAGT 
CCCAGCCAGA 
GGGGTGGCTG 
TTGCCTTCTG 
TGGGTTTCCC 
CTGTCTTGGA 
GGAGCCTTAG 
TCTGGTCTGC 
GCAAAGGGTG 
TCTGGGCTGG 
GAAGGTGGCC 
CTTGGCCCTA 
TGTAAAGTCC 
TAGTTTTTGG 
CTGGAGAATT 
'AAGGCTGCCC 
AGAAGTTCGT 
CAGGGAAGAT 
TAATAAAACC 



21 
I 

CCTTCCCGGT 
TTATGTCTGG 
CATCTCCTAC 
TCCTCCCTGA 
TAAAGAGGAT 
CTAGGATTTC 
AGGACCTTTT 
CTGAGGCCGA 
AGAAAGTCAG 
CAGGCCGCCG 
TCTCGCCAGT 
CCCCAGACAT 
AGAAAATGCC 
AGTTTGAAGC 
CTGGCCAGAC 
GGTCCCCTCC 
GCCTTTGAGG 
GCCCAGCCAC 
ACCATAGCCA 
AGTTAAAGGG 
CTTGGAAATA 
TTGGTGGCTT 
CTTTGATTCT 
GGAAGAAGCT 
.TTGGGTCTCA 
TGTTGAGATG 
CATTTCAGGG 
TCCCTTGCAC 
CCTCTTGTCA 
GAAGAGAAAG 
TCGGAAAGTT 
CCTCCCTATC 
TCCCCATCTT 
TGCCCCCGAC 
GGAAGTAGAC 
GAGTGCTGGG 
CTCTGCCAGG 



31 
I 

TGGCGCGGGC 
GAGGCGAACG 
TAAGCCTCTG 
AATCTGGCCG 
CGTGGCCCAT 
CTTTTTCTTG 
CAAGACACAC 
GTCCAGCTCC 
GCGTTCCTAC 
GTCCTGCTTT 
GGTGTGCTCC 
GACTCTCCCT 
AGAGATCTTG 
TGCTGAGCAG 
TCTCCCTCCT 
CCTGGTCTTG 
GCTTGGGCAG 
TCACTGGTGT 
GTTTCCAGAT 
CTGAGGGTTG 
GGCCCAGGGG 
TCTTCTTGAA 
GTAAATAGTC 
GGACATTCAG 
GACCATAAGT 
TTTGGTAAAT 
TGTGGCCACC 
AGGGCCCACG 
CCCATACCCA 
TTGAAGAGTC 
TCCTCGCGTA 
CTCTCACATG 
TCTCTTCTTT 
TCTCCTGCCG 
GCTGAGGTGT 
TCAGGGTACT 
TCTGGGAGTC 



ATGTACCTGA 
TTCACCAAGG 
ACCTCCCGGG 
GTCCAAGGCA 
CCCAAGGCCC 
CCATTCTACG 
GGAAGGAACG 
AGCTACTGTG 
CTGCCAGCCT 
CAGACTCCTT 
AACCCATCAC 
GGGCTGGATC 
ACGCCCTGCT 
TCAGCAGATG 
TCCCAGAGGA 
GAACCTGCAG 
CCAGGCGGTG 
CCTGACTGGC 
ATTCCCTTAA 
CATTTGCCTA 
GGCTGGGCTG 
CAGAGGCTGC 
GGAGAGGAGA 
GCCTATGGGG 
CTTGGGCTTT 



41 
I 

KTTNGHGGEA 
GKRSPYAGLQ 
FSRSKSGSEE 
EARKCPVHGK 
KIIEIEDEAE 
DQVKVIMDAL 
EGEGLGQSLG 
EWSAPDTMKR 
SSIQNSDNDIj 
GIGSNEAP 



41 

I 

CCGGGGCGGC 
CGGTCCGGAG 
CGGAGGTCCC 
AAGACACCCA 
GCTGTAGAGG 
GAGAAAGAAA 
AGCGTCCCTG 
AAGGAAGGAG 
AGCCGGCTGG 
GGCTTCGAGG 
AAACTCACCG 
GGAATCTCCC 
AAAACGGAGC 
TTTGATCTCC 
GTCCTGTACA 
TTACCTGTGT 
CAGCGGCAGC 
CCTGTCTCTT 
GGGCCACAGA 
AGGTGAGAGG 
CTCTGCCAGC 
CCCACCTGTG 
CCAGAGAGAA 
CCTGTGGAGT 
GTGTACTACA 
GCCAGGTTGA 
AGGTGCTGTG 
CTGGAGTCTT 
TTTCTTACAA 
CCAGACCTAC 
CCAGACAGCG 
AGAACACTGC 
CCATCGTGTG 
CACCCCTGGC 
GCAGAGGAGC 
TGGATGAAAC 
CCAGGCCATC 



CACCCAAAGG 
AGACCACCCA 
TCTGGGAGTA 
GCTCCTCCTT 
AGCCCCAGAC 
TCAACAAAGG 
AGGCGCCACA 
CTTGTCTGGG 
CTTGGGGGCA 
TCCTGCCTTG 
AGGGGTGAGA 
TGGGGGCTAG 
GTCTCCAGGC 
CCTGGACAGC 
CTGGCCCTAC 
TGGCCAAGGG 
GGTCTCCACC 
AGGATGACCT 
CATGATATAA 
CAGAATTTCA 
ACCGCAAAAG 
CTCCTCCTTC 
CAAAACCAGG 
GCTCTGGAAG 
CCCCTTTGGA 



51 
I 

AEGKSLGSAL 
LGAAKKPPVT 
VLCDSCIGNK 
TMELFCQTDQ 
KWQKEKDRIK 
DERAKVLHED 
NFKDDLLNVC 
YSMYLTPKGG 
PWQGSSSFS 



51 

! 

GGCGCTGGAG 
GAGCCGCTCA 
AGCGGAAATC 
GTGCGGCTGC 
TCCCAGCTGT 
ACGAGCCCCC 
CCACCCCCAC 
AGCTGGACGC 
AGACCCTGGG 
GGCTGCTGGG 
AGGTCCCCAG 
CACCACCCGA 
TGGATGAGTG 
TGGTTGAATG 
TAGCCACCTC 
GTGTGCTGGT 
CATCTTGGTT 
GTCGTCCTGT 
CTGGGGAGGA 
CACCTCTGCT 
CTCGGCCTCT 
TAAAGAGGTT 
TTCGTGGGCT 
CTGAGTTTTG 
CAGAAGCTGT 
TAGGGCGCTG 
AGTTTCTGTG 
ACCACTCTGC 
AATAAGTTAC 
TAGCATTTTG 
GCGGGGGCTG 
CTGGATGCAT 
GATTCAATAG 
CATTGTACCT 
TGGTGGATAA 
GGTGCAGGCC 
TGCTCAACGC 



1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
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2340 
2400 
2460 
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2700 
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60 
120 
180 
240 
300 
360 
420 
480 
540 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 ' 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
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TCTGTGGTTT GTCAGACCTG, CAAGCAAGCC CCCTGCTGGG 6AAGCCTA6G TGTCCTTGAG 2280 

CTCAACCGCA CTGAAGAACT CTTGTCCTCA CTGGCTGATG CRGCAGAACT CTTGGGAAAT 2340 

GTCTTAGTCC . TGCAGAATCA GGAGTCACCA GATGATGCAG AGTTGAGATC ATCATTGCAA 2400 

AGTTCTCTGT TCCTGAGGAA CTAAATTTAA GGAAAAAATG GGATTTTGTT TTAGAGTTGG '2460 
' AAAAAAAGCC TGATTAAAGA GTTTCTGCCT GTTAAAAAAA AAAAAAAAAA AAAAAA 

Seq ID NO: 97. Protein sequence: 
Protein Accession 8: NP_542399.1 

1 11 21 31 41 51 

I . I - I I I I 

MSGRRTRSGG AAQRSGPRAP SPTKPLRRSQ RKSGSELPSI LPEIWPKTPS AAAVRKPIVL €0 

KRIVAHAVEV PAVQSPRRSP RISFFLBKEN EPPGRELTKE DLFKTHSVPA TPTSTPVPNP 120 

EAESSSKEGE LDARDLEMSK KVRRSYSRLE TLGSASTSTP GRRSCFGFEG LLGAEDLSGV 180 

SPWCSKLTE VPRVCAKPWA- PDMTLPGISP PPEKQKRKKK KMPEILKTEL DEWAAAMNAE 240 
FEAAEQFDLL VE 



Seq ID NOs 98 DNA sequence 

Nucleic Acid Accession # x Eos sequence 

Coding sequence :. 58-12444 

1 11 21 31 41 51 

111(11 

GGGGCATTTC CGGGTCCGGG CCGAGCGGGC GCACGCGCGG GAGCGGGACT CGGCGGCATG ' 60 

GCGGGCTCCG GAGCCGGTGT GCGTTGCTCC CTGCTGCGGC TGCAGGAGAC CTTGTCCGCT '. 12 D 

GCGGACCGCT GCGGTGCTGC CCTGGCCGGT CATCAACTGA TCCGCGGCCT GGGGCAGGAA 180 

TGCGTCCTGA GCAGCAGCCC CGCGGTGCTG GCATTACAGA CATCTTTAGT TTTTTCCAGA 240 

GATTTCGGTT TGCTTGTATT TGTCCGGAAG TCACTCAACA GTATTGAATT TCGTGAATGT 300 

AGAGAAGAAA TCCTAAAGTT TTTATGTATT TTCTTAGAAA AAATGGGOCA GAAGATCGCA 360 

CCTTACTCTG TTGAAATTAA GAACACTTGT ACCAGTGTTT ATACAAAAGA TAGAGCTGCT 420 

AAATGTAAAA TTCCAGCCCT GGACCTTCTT ATTAAGTTAC TTCAGACTTT TAGAAGTTCT 480 

AGACTCATGG ATGAATTTAA AATTGGAGAA TTATTTAGTA AATTCTATGG AGAACTTGCA 540 

TTGAAAAAAA AAATACCAGA TACAGTTTTA GAAAAAGTAT ATGAGCTCCT AGGATTATTG 600 

GGTGAAGTTC ATCCTAGTGA GATGATAAAT AATGCAGAAA ACCTGTTCCG CGCTTTTCTG 660 

GGTGAACTTA AGACCCAGAT GACATCAGCA GTAAGAGAGC CCAAACTACC TGTTCTGGCA 720 

GGATGTCTGA AGGGGTTGTC CTCACTTCTG TGCAACTTCA CTAAGTCCAT GGAAGAAGAT 780 

CCCCAGACTT CAAGGGAGAT TTTTAATTTT GTACTAAAGG CAATTCGTCC TCAGATTGAT 840 

CTGAAGAGAT ATGCTGTGCC CTCAGCTGGC TTGCGCCTAT TTGCCCTGCA TGCATCTCAG 900 

TTTAGCACCT GCCTTCTGGA . CAACTACGTG TCTCTATTTG AAGTCTTGTT AAAGTGGTGT 960 

GCCCACACAA ATGTAGAATT GAAAAAAGCT GCACTTTCAG CCCTGGAATC CTTTCTGAAA 1020 

CAGGTTTCTA ATATGGTGGC GAAAAATGCA GAAATGCATA AAAATAAACT GCAGTACTTT 1080 

ATGGAGCAGT TTTATGGAAT CATCAGAAAT GTGGATTCGA ACAACAAGGA GTTATCTATT 1140 

GCTATCCGTG GATATGGACT TTTTGCAGGA CCGTGCAAGG TTATAAACGC AAAAGATGTT 1200 

GACTTCATGT ACGTTGAGCT CATTCAGCGC TGCAAGCAGA TGTTCCTCAC CCAGACAGAC 1260 

ACTGGTGACG ACCGTGTTTA TCAGATGCCA AGCTTCCTCC AGTCTGTTGC AAGCGTCTTG 1320 

CTGTACCTTG ACACAGTTCC TGAGGTGTAT ACTCCAGTTC TGGAGCACCT CGTGGTGATG 1380 

CAGATAGACA GTTTCCCACA GTACAGTCCA AAAATGCAGC TGGTGTGTTG CAGAGCCATA 1440 

GTGAAGGTGT TCCTAGCTTT GGCAGCAAAA GGGCCAGTTC TCAGGAATTG CATTAGTACT 1500 

GTGGTGCATC AGGGTTTAAT CAGAATATGT TCTAAACCAG TGGTCCTTCC AAAGGGCCCT 1560 

GAGTCTGAAT CTGAAGACCA CCGTGCTTCA GGGGAAGTCA GAACTGGCAA ATGGAAGGTG 1620 

CCCACATACA AAGACTACGT GGATCTCTTC AGACATCTCC TGAGCTCTGA CCAGATGATG 1680 

GATTCTATTT TAGCAGATGA AGCATTTTTC TCTGTGAATT CCTCCAGTGA AAGTCTGAAT 1740 

CATTTACTTT ATGATGAATT TGTAAAATCC GTTTTGAAGA TTGTTGAGAA ATTGGATCTT 1800 

ACACTTGAAA TACAGACTGT TGGGGAACAA GAGAATGGAG ATGAGGCGCC TGGTGTTTGG I860 

ATGATCCCAA CTTCAGATCC AGCGGCTAAC TTGCATCCAG CTAAACCTAA AGATTTTTCG 1920 

GCTTTCATTA ACCTGGTGGA ATTTTGCAGA GAGATTCTCC CTGAGAAACA AGCAGAATTT 1980 

TTTGAACCAT GGGTGTACTC ATTTTCATAT GAATTAATTT TGCAATCTAC AAGGTTGCCC 2040 

CTCATCAGTG GTTTCTACAA ATTGCTTTCT ATTACAGTAA GAAATGCCAA GAAAATAAAA 2100 

TATTTCGAGG GAGTTAGTCC AAAGAGTCTG AAACACTCTC CTGAAGACCC AGAAAAGTAT 2160 

TCTTGCTTTG CTTTATTTGT GAAATTTGGC AAAGAGGTGG CAGTTAAAAT GAAGCAGTAC 2220 

AAAGATGAAC TTTTGGCCTC TTGTTTGACC TTTCTTCTGT CCTTGCCACA CAACATCATT 2280 

GAACTCGATG TTAGAGCCTA CGTTCCTGCA CTGCAGATGG CTTTCAAACT GGGCCTGAGC 2340 

TATACCCCCT TGGCAGAAGT AGGCCTGAAT GCTCTAGAAG AATGGTCAAT TTATATTGAC 2400 

AGACATGTAA TGCAGCCTTA TTACAAAGAC ATTCTCCCCT GCCTGGATGG ATACCTGAAG 2460 

ACTTCAGCCT TGTCAGATGA GACCAAGAAT AACTGGGAAG TGTCAGCTCT TTCTCGGGCT 2520 

GCCCAGAAAG GATTTAATAA AGTGGTGTTA AAGCATCTGA AGAAGACAAA GAACCTTTCA 2580 

TCAAACGAAG CAATATCCTT AGAAGAAATA AGAATTAGAG TAGTACAAAT GCTTGGATCT 2640 

CTAGGAGGAC AAATAAACAA AAATCTTCTG ACAGTCACGT CCTCAGATGA GATGATGAAG 2700 

AGCTATGTGG CCTGGGACAG AGAGAAGCGG CTGAGCTTTG CAGTGCCCTT TAGAGAGATG 2760 

AAACCTGTCA TTTTCCTGGA TGTGTTCCTG CCTCGAGTCA CAGAATTAGC GCTCACAGCC 2820 

AGTGACAGAC AAACTAAAGT TGCAGCCTGT GAACTTTTAC ATAGCATGGT TATGTTTATG 2880 

TTGGGCAAAG CCACGCAGAT GCCAGAAGGG GGACAGGGAG CCCCACCCAT GTACCAGCTC 2940 

TATAAGCGGA CGTTTCCTGT GCTGCTTCGA CTTGCGTGTG ATGTTGATCA GGTGACAAGG 3000 

CAACTGTATG AGCCACTAGT TATGCAGCTG ATTCACTGGT TCACTAACAA CAAGAAATTT 3060 

GAAAGTCAGG ATACTGTTGC CTTACTAGAA GCTATATTGG ATGGAATTGT GGACCCTGTT 3120 

GACAGTACTT TAAGAGATTT TTGTGGTCGG TGTATTCGAG AATTCCTTAA ATGGTCCATT 3180 

AAGCAAATAA CACCACAGCA GCAGGAGAAG AGTCCAGTAA ACACCAAATC GCTTTTCAAG 3240 

CGACTTTATA GCCTTGCGCT TCACCCCAAT GCTTTCAAGA GGCTGGGAGC ATCACTTGCC 3300 

TTTAATAATA TCTACAGGGA ATTCAGGGAA GAAGAGTCTC TGGTGGAACA GTTTGTGTTT 3360 

GAAGCCTTGG TGATATACAT GGAGAGTCTG GCCTTAGCAC ATGCAGATGA GAAGTCCTTA 3420 

GGTACAATTC AACAGTGTTG TGATGCCATT GATCACCTAT GCCGCATCAT TGAAAAGAAG 3480 

CATGTTTCTT TAAATAAAGC AAAGAAACGA CGTTTGCCGC GAGGATTTCC ACCTTCCGCA 3540 

TCATTGTGTT TATTGGATCT GGTCAAGTGG CTTTTAGCTC ATTGTGGGAG GCCCCAGACA 3600 

GAATGTCGAC ACAAATCCAT TGAACTCTTT TATAAATTCG TTCCTTTATT GCCAGGCAAC 3660 

AGATCCCCTA ATTTGTGGCT GAAAGATGTT CTCAAGGAAG AAGGTGTCTC TTTTCTCATC 3720 

AACACCTTTG AGGGGGGTGG CTGTGGCCAG CCCTCGGGCA TCCTGGCCCA GCCCACCCTC 3780 

TTGTACCTTC GGGGGCCATT CAGCCTGCAG GCCACGCTAT GCTGGCTGGA CCTGCTCCTG 3840 
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GCCGCGTTGG AGTGCTACAA CAOGTTCATT 
CTAGGTACTG AAGCCCAGTC TTCACTTTTG 
GCCATGCATG - ACATTATAGC . AGCAGAAAAG . 
ACAAGCCCAC AAGAGGGAGA AAGGTACAAC 
ATGGAGTTTA CCACGACTCT GCTAAACACC 
GACTTGTGTA ATACACACCT GATGAGAGTC 
ATAGGTTTCA ACATCGGAGA CGTCCAGGTT 
CTGATGAAAG CTCTAAAGAT GTCCCCATAC 
AAAATAACAG CACAGAGCAT TGAGGAGCTT 
CAAGTGGACA GGAGCAGGCT GGCTGCTGTT 
GGGCTTCTGC ATAATATATT ACCGTCTCAG 
GAACTTCTTT CCCTGGTTTA TAAAGGCATT 
TCTCTAGACC TCAGTTGTAA GCAGCTGGCC 
GGAGGACTGT GTGAGCGCCT TGTGAGTCTT 
tccttgggca GCTCACAGGG CAGCGTCATC 
TTGTTCTCAG AAACGATCAA CACGGAATTA 
CTCATGCAGT CTTCAGTGGA TAATACCAAA 
GACCAGAGCT TCAGGGAGCG AGCAAACCAG 
ATTCTGCAAC ACTGGAAGAA GTGTGATTCA 
AAAATGGCAG TGCTGGCCTT ACTGGCAAAA 
AATACAAGTC ATGGTTCATT CCCTGAAGTC 
ACAAAGCTGG ATCTACATTT AAAGGGCCAA 
CTCACTGGAG GCAGTCTGGA GGAACTTAGA 
TTCCCCATGC AGTCCAGGGA ATTTCCTCCA 
TGCATGAAAA AGTTTCTAGA TGCATTGGAA 
ATGACAGAAG TTCTTTGTCG GGAACAGCAG 
TTCAGGAGGA TTGCCAGAAG GGGTTCATGT 
TATGAAATGT TCAGGAAGGA TGACCCCCGC 
CGCTCCCTCC TCACTCTGCT GTGGCACTGT 
ACAATTGTGG TGGATGCCAT TGATGTGTTG 
ACCTTTGATA CTCAAATCAC CAAGAAGATG 
TCTCGCCTTC" CCAAAGATGA TGTTCATGCT 
GGCTCGTGTA TTACAGAAGG AAATGAACTT 
GCATTTACAG AGAACATGGC AGGAGAGAAT 
TCTGCAGCAT ACAACTGGGC CATATCTGTC 
TACCAAGGTT TTCTGTTTAG TGAAAAACCA 
ATCGACCTGA AGCGCCGCTA TAATTTTCCT 
AAAAAGTACA TTGAAATTAG GAAAGAAGCC 
CCTTCCTATA TGTCTTCOCT GTCATATTTG 
CAATTTGATT TCTCAACCGG AGTTCAGAGC 
GCCACTGGTC GTTTTCGGAG ACGGGAGCAG 
GAGCTGGAGA TGGACGAGCT CAATCGGCAT 
AAGCACATGC ACAGAAGCCT GGGCCCGCCT 
CTTCCTTCTT GGATGAAATT CCTCCATGGC 
ATCCGTCTCT TCTTAGCCAA GCTTGTTATT 
AAGCACTGGC TTAGCCCCTT GCTGCAGCTG 
ATTCACTACA TGGTGGTTGA GATAGTGGCC 
CCAACAGGGG TCCCTAAAGA. TGAAGTGTTA 
CATGTCTTTC ATCCAAAAAG AGCTGTGTTT 
GTCGAGTGCT GGAAGGATTG TTTATCCATC 
GGTAAAGATC CTAATTCTAA AGACAACTCA 
GCCAATGACC TGCCTCCCTA TGACCCACAG 
GCTTTGGTGA ATAATATGTC CTTTGTAAGA 
GTTCTAGGAC TTATAGTTCG ATATGTTATG 
TGTGAACTGG TTGCGAAACA ATTGAAGCAA 
GTGTGCTTGA ACAAAGTGAC CAAGAGCTTC 
GTGTTCTTTC TGCTGCCAAA ATTTCATGGA 
CTTTGTCGTG TGGAGGGAAT GACAGAGCTG 
CAAGTCATGA GACATAGAGA TGATGAAAGA 
ATGATGCCAA AGTTAAAACC AGTAGAACTC 
GTTTCCCATC CTTCTACAAC ATGTAGGGAA 
GATAATTACA GAGATCCAGA AAGTGAGACA 
GCAAAAGATG TGCTGATTCA AGGATTGATC 
CGAAATTTCT GGAGCCATGA AACTAGGTTA 
CTAAATTCCT TATATTCTCC TAAGATAGAA 
CTGCTCGAAA TGACCAGCAT GAGCCCAGAT 
TCAGAATGCG AATTTCAGGA ATATACCATT 
CTCACTCOGA TGTTTGTGGA GACCCAGGCC 
GAAGGGTCCC TCTCAGCTCG CTGGGCAGTG 
CATGACTTCA CACTGACACA GACTGCAGAT 
AGCAGCACTG ACCCGCTGGT . CGACCACACC 
CACAAGAGGA GTGAAAGGTT ACAGAGAGCA 
AAAAAAAGGC TGGGCCTTCC AGGGGACGAG 
CGGACGGACC TACTACGACT GCGCAGACGG 
ATGTATGCCA GAAAAGGCGT TGCTGAGCAA 
AAAATGAAGC AGGATGCCCA GGTCGTTCTG 
GACATTCAGA TCAAGCACAG CAGCGTCATC 
ccaataattg" CAAAACAGCT CTTTAGCAGC 
AAATTTAAGA CACTGTCTGA AAAAAACAAC 
CGTTTTCTTA ATACCACCTT CTCTTTCTTT 
AGCTGTCAGC ACGCAGCCCT GCTGAGCCTC 
GCCAGCCTAC AGCAGCCCGT GGGCATCCGC 
CCTGCTGAGC. TGCCTGCCAA GCGAGTCCGT 
AGATGGGTGG AGCTTGCTAA GCTGTATAGA 
ATTTTTACCA GTGAGATAGG- AACAAAGCAA 
AGAAGTGATT ATTCTGAAGC TGCTAAGCAG 
GTAGATGGTG AGCCCACAGA AGCCGAGAAG 



GGCGAGAGAA CTGTAGGAGC GCTCCAGGTC 3900 

AAAGCAGTGG CTTTCTTCTT ' AGAAAGCATT 3960 

TGCTTTGGCA CTGGGGCAGC AGGTAACAGA 4020 

TACAGCAAAT GCACCGTTGT GGTCCGGATT 4080 

TCCCCGGAAG GATGGAAGCT CCTGAAGAAG 4140 

CTGGTGCAGA CGCTGTGTGA GCCCGCAAGC 4200 

ATGGCTCATC TTCCTGATGT TTGTGTGAAT 4260 

AAAGATATCC TAGAGACCCA TCTGAGAGAG 4320 

TGTGCCGTCA ACTTGTATGG CCCTGACGCG 4380 

GTGTCTGCCT GTAAACAGCT TCACAGAGCT 4440 

TCCACAGATT TGCATCATTC- TGTTGGCACA 4500 

GCCCCTGGAG ATGAGAGACA GTGT C TGCCT 4560 

AGCGGACTTC TGGAGTTAGC CTTTGCTTTT 4620 

CTCCTGAACC CAGCGGTGCT GTCCACGGCG 4680 

CACTTCTCCC ATGGGGAGTA TTTCTATAGC 4740 

TTGAAAAATC TGGATCTTGC TGTATTGGAG 4800 

ATGGTGAGTG CCGTTTTGAA CGGCATGTTA 4860 

AAACACCAAG GACTGAAACT TGCGACTACA 4920 

TGGTGGGCCA AAGATTCCCC TCTCGAAACT 4980 

ATTTTACAGA TTGATTCATC TGTATCTTTT 5040 

TTTACAACAT ATATTAGTCT ACTTGCTGAC 5100 

GCTGTCACTC TTCTTCCATT CTTCACCAGC 5160 

CGTGTTCTGG AGCAGCTCAT CGTTGCTCAC 5220 

GGAACTCCGC GGTTCAATAA TTATGTGGAC 5280 

TTATCTCAAA GCCCTATGTT GTTGGAATTG 5340 

CATGTCATGG AAGAATTATT TCAATCCAGT 5400 

GTCACACAAG TAGGCCTTCT GGAAAGCGTG 5460 

CTAAGTTTCA CACGCCAGTC CTTTGTGGAC 5520 

AGCCTGGATG CTTTGAGAGA ATTCTTCAGC 5580 

AAGTCCAGGT TTACAAAGCT AAATGAATCT 5640 

GGCTACTATA AGATTCTAGA CGTGATGTAT 5700 

AAGGAATCAA AAATTAATCA AGTTTTCCAT 5760 

ACAAAGACAT TGATTAAATT GTGCTACGAT 5820 

CAGCTGCTGG AGAGGAGAAG ACTTTACCAT 5880 

ATCTGCTGTG TCTTCAATGA GTTAAAATTT 5940 

GAAAAGAACT TGCTTATTTT TGAAAATCTG 6000 

GTAGAAGTTG AGGTTCCTAT GGAAAGAAAG 6060 

AGAGAAGCAG CAAATGGGGA TTCAGATGGT 6120 

GCAGACAGTA CCCTGAGTGA GGAAATGAGT 6180 

TATTCATACA GCTCCCAAGA CCCTAGACCT 6240 

CGGGACCCCA CGGTGCATGA TGATGTGCTG 6300 

GAGTGCATGG CGCCCCTGAC GGCCCTGGTC 6360 

CAAGGAGAAG AGGATTCAGT GCCAAGAGAT 6420 

AAACTGGGAA ATCCAATAGT ACCATTAAAT 6480 

AATACAGAAG AGGTCTTTCG CCCTTACGCG 6540 

GCTGCTTCTG AAAACAATGG AGGAGAAGGA 6600 

ACTATTCTTT CATGGACAGG CTTGGCCACT 6660 

GCAAATCGAT TGCTTAATTT CCTAATGAAA 6720 

AGACACAACC TTGAAATTAT AAAGACCCTT 6780 

CCTTATAGGT TAATATTTGA AAAGTTTTCC 6840 

GTAGGGATTC AATTGCTAGG CATCGTGATG 6900 

TGTGGCATCC AGAGTAGCGA ATACTTCCAG 6960 

TATAAAGAAG TGTATGCCGC TGCAGCAGAA 7020 

GAGAGAAAAA ACATACTGGA GGAGTCTCTG 7080 

CATCAGAATA CTATGGAGGA CAAGTTTATT 7140 

CCTCCTCTTG CAGACAGGTT CATGAATGCT 7200 

GTGTTGAAAA CACTCTGTCT GGAGGTGGTA 7260 

TACTTCCAGT TAAAGAGCAA GGACTTCGTT 7320 

CAAAAAGTAT GTTTGGACAT AATTTAT AAG 7380 

CGAGAACTTC TGAACCCCGT TGTGGAATTC 7440 

CAAATGTATA ATATTCTCAT GTGGATTCAT 7500 

GATAATGACT CCCAGGAAAT ATTTAAGTTG 7560 

GATGAGAACC CTGGACTTCA ATTAATTATT 7620 

CCTTCAAATA CCTTGGACCG GTTGCTGGCA 7680 

GTGCACTTTT TAAGTTTAGC AACAAATTTT 7740 

TATCCAAACC CCATGTTCGA GCATCCTCTG - 7800 

GATTCTGATT GGCGTTTCCG AAGTACTGTT 7860 

TCCCAGGGCA CTCTCCAGAC CCGTACCCAG 7920 

GCAGGGCAGA TAAGGGCCAC CCAGCAGCAG 7980 

GGAAGAAGCT CATTTGATTG GCTGACCGGG 8040 

AGTCCCTCAT CTGACTCCTT GCTGTTTGCC 8100 

CCCTTGAAGT CAGTGGGGCC TGATTTTGGG 8160 

GTGGATAACA AAGTGAAAGG TGCGGCCGGC 8220 

TTTATGAGGG ACCAGGAGAA GCTCAGTTTG 82 BO 

AAACGAGAGA AGGAAATCAA GAGTGAGTTA 8340 

TACAGAAGCT ACCGGCACGG AGACCTTCCT 8400 

ACCCCGTTAC AGGCCGTGGC CCAGAGGGAC 8460 

TTGTTTTCTG GAATTTTGAA AGAGATGGAT 8520 

ATCACTCAAA AGTTGCTTCA AGACTTCAAT 8580 

CCACCCTTTG TCTCTTGTAT TCAGGACATT 8640 

GACCCAGCGG CTGTTAGCGC TGGTTGCCTG 8700 

CTGCTAGAGG AGGCTCTGCT CCGCCTGCTG 8760 

GGGAAGGCCC GCCTCCCTCC TGATGTCCTC 8820 

TCAATTGGAG AATACGACGT CCTCCGTGGG 8880 

ATCACTCAGA GTGCATTATT AGCAGAAGCC 8940 

TATGATGAGG CTCTCAATAA ACAAGACTGG 9000 

GATTTTTGGG AACTTGCATC CCTTGACTGT 9060 
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TACAACCAOC TTGCTGAGTG GAAATCACTT GAATACTGTT CTACAGCCAG TATAGACAGT ' .9120 
GAGAACCCCC. CAGACCTAAA-TAAAATCTGG AGTGAACCAT TTTATCAGGA AACA7ATCTA 9180 
' CCTTACATGA TCCGCAGCAA GCTGAAGCTG CTGCTCCAGG GAGAGGCTGA CCAGTCCCTG. \ 9240 
• CTGACATTTA . TTGACAAAGC -TATGCACGGG GAGCTCCAGA AGGCGATTCT AGAGCTTCAT 9300 
TACAGTCAAG AGCTGAGTCT GCTTTACCTC CTGCAAGATG ATGTTGACAG AGCCAAATAT -9360 
TACATTCAAA ATGGCATTCA GAGTTTTATG CAGAATTATT CTAGTATTGA TGTCCTCTTA . 9420 
CACCAAAGTA GACTCACCAA ATTGCAGTCT GTACAGGCTT TAACAGAAAT TCAGGAGTTC 9480 
ATCAGCTTTA TAAGCAAACA AGGCAATTTA TCATCTCAAG TTCCCCTTAA GAGACTTCTG 9 540 
AACACCTGGA CAAACAGATA TCCAGATGCT AAAATGGACC CAATGAACAT CTGGGATGAC 9600 
ATCATCACAA ATCGATGTTT CTTTCTCAGC AAAATAGAGG AGAAGCTTAC CCCTCTTCCA 9660 . 
GAAGATAATA GTATGAATGT GGATCAAGAT GGAGACCCCA GTGACAGGAT GGAAGTGCAA 9720 
GAGCAGGAAG AAGATATCAG CTCCCTGATC AGGAGTTGCA AGTTTTCCAT GAAAATGAAG 9780 
ATGATAGACA GTGCCCGGAA GCAGAACAAT TTCTCACTTG CTATGAAACT ACTGAAGGAG 9840 
CTGCATAAAG AGTCAAAAAC CAGAGACGAT TGGCTGGTGA GCTGGGTGCA GAGCTACTGC 9900 
CGCGTGAGCC ACTGCCGGAG CCGGTCCCAG GGCTGCTCTG AGCAGGTGCT CACTGTGCTG 9960 
AAAACAGTCT CTTTGTTGGA TGAGAACAAC GTGTCAAGCT ACTTAAGCAA AAATATTCTG 10020 
GCTTTCCGTG ACCAGAACAT .TCTCTTGGGT ACAACTTACA GGATCATAGC GAATGCTCTC 10080 
AGCAGTGAGC CAGCCTGCCT TGCTGAAATC GAGGAGGACA AGGCTAGAAG AATCTTAGAG 10140 
CTTTCTGGAT CCAGTTCAGA GGATTCAGAG AAGGTGATCG CGGGTCTGTA CCAGAGAGCA 10200 
TTCCAGCACC TCTCTGAGGC TGTGCAGGCG GCTGAGGAGG AGGCCCAGCC TCCCTCCTGG 10260 
AGCTGTGGGC CTGCAGCTGG GGTGATTGAT GCTTACATGA CGCTGGCAGA TTTCTGTGAC 10320 
CAACAGCTGC GCAAGGAGGA AGAGAATGCA TCAGTTATTG ATTCTGCAGA ACTGCAGGCG 10380 
TATCCAGCAC TTGTGGTGGA GAAAATGTTG AAAGCTTTAA AATTAAATTC CAATGAAGCC 10440 
AGATTGAAGT TTCCTAGATT ACTTCAGATT ATAGAACGGT ATCCAGAGGA GACTTTGAGC 10500 
CTCATGACAA AAGAGATCTC TTCCGTTCCC TGCTGGCAGT TCATCAGCTG GATCAGCCAC 10560 
ATGGTGGCCT TACTGGACAA AGACCAAGCC GTTGCTGTTC AGCACTCTGT GGAAGAAATC 10620 
ACTGATAACT ACCCGCAGGC TATTGTTTAT CCCTTCATCA TAAGCAGCGA AAGCTATTCC 10680 
TTCAAGGATA CTTCTACTGG TCATAAGAAT AAGGAGTTTG TGGCAAGGAT TAAAAGTAAG 10740 
TTGGATCAAG GAGGAGTGAT TCAAGATTTT ATTAATGCCT TAGATCAGCT CTCTAATCCT 10800 
GAACTGCTCT TTAAGGATTG GAGCAATGAT GTAAGAGCTG AACTAGCAAA AACCCCTGTA 10860 
AATAAAAAAA ACATTGAAAA. AATGTATGAA AGAATGTATG CAGCCTTGGG TGACCCAAAG 10920 
GCTCCAGGCC TGGGGGCCTT TAGAAGGAAG TTTATTCAGA CTTTTGGAAA AGAATTTGAT 10980 
AAACATTTTG GGAAAGGAGG TTCTAAACTA CTGAGAATGA AGCTCAGTGA CTTCAACGAC 11040 
ATTACCAACA TGCTACTTTT AAAAATGAAC AAAGACTCAA AGCCCCCTGG GAATCTGAAA 11100 
GAATGTTCAC CCTGGATGAG CGACTTCAAA GTGGAGTTCC TGAGAAATGA GCTGGAGATT 11160 
CCCGGTCAGT ATGACGGTAG GGGAAAGCCA TTGCCAGAGT ACCACGTGCG AATCGCCGGG 11220 
TTTGATGAGC GGGTGACAGT CATGGCGTCT CTGCGAAGGC CCAAGCGCAT CATCATCCGT 11280 
GGCCATGACG AGAGGGAACA CCCTTTCCTG GTGAAGGGTG GCGAGGACCT GCGGCAGGAC 11340 
.CAGCGCGTGG AGCAGCTCTT CCAGGTCATG AATGGGATCC TGGCCCAAGA CTCCGCCTGC 11400 
AGCCAGAGGG CCCTGCAGCT GAGGACCTAT AGCGTTGTGC CCATGACCTC CAGGTTAGGA 11460 
TTAATTGAGT GGCTTGAAAA TACTGTTACC TTGAAGGACC TTCTTTTGAA CACCATGTCC 11520 
CAAGAGGAGA AGGCGGCTTA CCTGAGTGAT CCCAGGGCAC CGCCGTGTGA ATATAAAGAT 11580 
TGGCTGACAA AAATGTCAGG AAAACATGAT. GTTGGAGCTT ACATGCTAAT GTATAAGGGC 11640 
GCTAATCGTA CTGAAACAGT CACGTCTTTT AGAAAACGAG AAAGTAAAGT GCCTGCTGAT 11700 
CTCTTAAAGC GGGCCTTCGT GAGGATGAGT ACAAGCCCTG AGGCTTTCCT GGCGCTCCGC 11760 
TCCCACTTCG CCAGCTCTCA CGCTCTGATA TGCATCAGCC ACTGGATCCT CGGGATTGGA 11820 
GACAGACATC TGAACAACTT TATGGTGGCC ATGGAGACTG GCGGCGTGAT CGGGATCGAC 11880 
TTTGGGCATG CGTTTGGATC CGCTACACAG TTTCTGCCAG TCCCTGAGTT GATGCCTTTT 11940 
CGGCTAACTC GCCAGTTTAT CAATCTGATG TTACCAATGA AAGAAACGGG CCTTATGTAC 12000 
AGCATCATGG TACACGCACT CCGGGCCTTC CGCTCAGACC CTGGCCTGCT CACCAACACC 12060 
ATGGATGTGT TTGTCAAGGA" GCCCTCCTTT GATTGGAAAA ATTTTGAACA GAAAATGCTG 12120 
AAAAAAGGAG GGTCATGGAT TCAAGAAATA AATGTTGCTG AAAAAAATTG GTACCCCCGA 12180 ' 
CAGAAAATAT GTTACGCTAA GAGAAAGTTA GCAGGTGCCA ATCCAGCAGT CATTACTTGT 12240 
GAT GAG CT AC TCCTGGGTCA TGAGAAGGCC CCTGCCTTCA GAGACTATGT GGCTGTGGCA 12300 
CGAGGAAGCA AAGATCACAA CATTCGTGCC CAAGAACCAG AGAGTGGGCT TTCAGAAGAG 12360 
ACTCAAGTGA AGTGCCTGAT. GGACCAGGCA ACAGACCCCA ACATCCTTGG CAGAACCTGG 12420 
GAAGGATGGG AGCCCTGGAT GTGAGGTCTG TGGGAGTCTG CAGATAGAAA GCATTACATT 12480 
GTTTAAAGAA TCTACTATAC TTTGGTTGGC AGCATTCCAT GAGCTGATTT TCCTGAAACA 12540 
CTAAAGAGAA ATGTCTTTTG TGCTACAGTT TCGTAGCATG AGTTTAAATC AAGATTATGA 12600 
TGAGTAAATG TGTATGGGTT AAATCAAAGA TAAGGTTATA GTAACATCAA AGATTAGGTG 12660 
AGGTTTATAG AAAGATAGAT ATCCAGGCTT ACCAAAGTAT TAAGTCAAGA ATATAATATG 12720 
TGATCAGCTT TCAAAGCATT TACAAGTGCT GCAAGTTAGT GAAACAGCTG TCTCCGTAAA 12780 
TGGAGGAAAT GTGGGGAAGC CTTGGAATGC CCTTCTGGTT CTGGCACATT GGAAAGCAGA 12840 
CTCAGAAGGC TTCATCACCA AGATTTTGGG AGAGTAAAGC TAAGTATAGT TGATGTAACA 12900 
TTGTAGAAGC AGCATAGGAA CAATAAGAAC AATAGGTAAA GCTATAATTA TGGCTTATAT 12960 
TTAGAAATGA CTGCATTTGA TATTTTAGGA TATTTTTCTA GGTTTTTTCC TTTCATTTTA 13020 
TTCTCTTCTA GTTTTGACAT TTTATGATAG ATTTGCTCTC TAGAAGGAAA CGTCTTTATT 13080 
TAGGAGGGCA AAAATTTTGG TCATAGCATT CACTTTTGCT ATTCCAATCT ACAACTGGAA 13140 
GATACATAAA AGTGCTTTGC ATTGAATTTG GGATAACTTC AAAAATCCCA TGGTTGTTGT 13200 
TAGGGATAGT- ACTAAGCATT TCAGTTCCAG GAGAATAAAA GAAATTCCTA TTTGAAATGA 13260 
ATTCCTCATT TGGAGGAAAA AAAGCATGCA TTCTAGCACA ACAAGATGAA ATTATGGAAT. 13320 
ACAAAAGTGG CTCCTTCCCA TGTGCAGTCC CTGTCCCCCC CCGCCAGTCC TCCACACCCA 13380 
AACTGTTTCT GATTGGCTTT TAGCTTTTTG TTGTTTTTTT TTTTCCTTCT AACACTTGTA 13440 
TTTGGAGGCT CTTCTGTGAT TTTGAGAAGT ATACTCTTGA GTGTTTAATA AAGTTTTTTT 13500 
CCAAAAGTA- 

Seq ID NO; 99 Protein sequence: 
Protein Accession #: NP_008835.5 

1 11 ' 21 - 31 41 51 

1 .1.1 I I I 

MAGSGAGVRC SLLRLQETLS AADRCGAALA GHQLIRGLGQ ECVLSSSPAV LALQTSLVFS 60 
RDFGLLVFVR KSLNS I EFRB CREEILKFLC IFLEKKGQKI APYSVEIKNT CTSVYTKDRA 120 
AKCKIPALDL LIKLLQTFRS..SRLMDEPKIG. ELPSKFYGEL ALKKKIPDTV LEKVYELLGL ' 180 
LGEVHPSEMI NNAENLPRAF 'lGELKTQMTS AVREPKLPVL AGCLKGLSSL LCNFTKSMBE 240 
DPQTSREIFN FVLKAIRPQI DLKRYAVPSA GLRLFALHAS QPSTCLLDNY VSLPEVLLKW 300 
CAHTNVELKK AALSALESFL KQVSNMVAKN AEMHKNKLQY FMEQFYGIIR NVDSNNKELS 360 
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IAIRGYGLFA GPCKVINAKD . VDFMYVBLIQ RCKQMFLTQT DTGDDRVYQM PSFLQSVASV 
LLYLDTVPEV YTPVtiEHLW .MQIDSFPQYS PKMQLVCCRA IVKVFLALAA KGPVLRNCIS 
.TWKQGLIRI CSKPWLPKG PSSESEDKRA SGEVRTGKWK VPTYKDYVDL FRHLLSSDQM 
MDSILADEAF FSVNSSSESL NHLLYDEFVK SVLKTVEKLD LTLBIQTVGE QENGDEAPGV 
WMIPTSDPAA NLHPAKPKDP SAFINLVEFC REILPEKQAE FFEPWVYSFS YELILQSTRL 
PLISGFYKLL SITVRHAKKI KYFEGVSPKS LKHSPEDPEK YSCFALFVKF GXEVAVKMKQ 
YKDELLASCL TFLLSLPKNI IELDVRAYVP ALQMAFKLGL SYTPLAEVGL NALEEWSIYI 
DRHVMQPYYK DILPCLDGYL KTSALSDETK NNWEVSALSR AAQKGFNKW LKHLKKTKNL 
SSNEAISLEE .IRIRVVQMLG SLGGQINKNI* LTVTSSDEKM KSYVAWDREK RLSFAVPFRE 
MKPVIFLDVF LPRVTELALT ASDRQTKVAA CELLHSMVMF HLGKATQMPE GGQGAPPMYQ 
LYXRTFPVLL RLACDVDQVT RQLYEPLVMQ LIHWFTMNXK FESQDTVALL EAILDGIVDP 
VDSTLRDFCG RCIREFLKWS IKQITPQQQE KSPVNTKSLF KRLYSLAIiHP NAFKRLGASL 
AFNNIYREFR EEESLVEQFV FEALVIYMES LALAHADEXS LGTIQQCCDA IEHLCRIIEK 
KHVSLNKAKK RRLPRGFPPS ASLCLLDLVK WLLAHCGRPQ TECRHKSIEL FYKFVPLLPG 
NRSPNLWLKD VLXEEGVSFL INTFEGGGCG QPSGILAQPT LLYLRGPFSL QATLCWLDLL 
LAALECYNTF IGERTVGALQ VLGTEAQSSL LKAVAFFLES IAMHDIIAAE KCPGTGAAGN 
RTSPQEGERY NYSKCTVWR IMEFTTTLLN TSPEGWKLLK KDLCNTHLMR VLVQTLCEPA 
SIGFNIGDVQ VMAHLPDVCV NLMKALKMSP YKDILETHLR EKITAQSIEE LCAVNLYGPD 
AQVDRSRLAA WSACKQLHR AGLLHNILPS QSTDLHHSVG TELLS LVYKG IAPGDERQCL 
PSLDLSCKQL ASGLLELAFA FGGLCERLVS LLLNPAVLST ASLGSSQGSV IHFSHGEYFY 
SLFSETINTE LLKNLDLAVL ELMQSSVDNT KMVSAVLNGM LDQSFRERAN QKHQGLKLAT 
TILQHWKKCD SWWAKDSPLE TKMAVLALLA KILQIDSSVS FNTSHGSFPE VFTTYISLLA 
DTKLDLHLKG QAVTLLPFPT SLTGGSLEEL RRVLEQLIVA HFPMQSREFP PGTPRFNNYV 
DCMKKFLDAL ELSQSPMLLE LKTEVLCRSQ QHVMEELFQS SFRRIARRGS CVTQVGLLES 
VYEMFRKDDP RLSFTRQSFV DRSLLTLLWH CSLDaLrEFP STIWDAIDV LKSRFTKLNE 
STFDTQITKK KGYYKILDVM YSRLPKDDVH AKESKINQVF HGSCITEGNE LTKTLIKLCY 
DAFTENMAGE NQLLERRRLY HCAAYNCAIS VICCVFNELK FYQGFLFSEK PEKNLLIFEM 
LIDLKRRYNF PVEVEVPMER KKKYIEIRKE AREAANGDSD GPSYMSSLSY LADSTLSEEM 
SQFDFSTGVQ SYSYSSQDPR PATGRFRRRE QRDPTVHDDV LELEMDELNR HECMAPLTAL 
VKHMHRSLGP PQGEEDSVPR DLPSWMKFLH GKLGNPIVPL NIRLFLAKLV INTEEVFRPY 
AKHWLSPLLQ LAASENNGGE GIHYMWEIV ATILSWTGLA TPTGVPKDEV LANRLLNFLM 
KHVFHPKRAV FRHNLEIIKT LVECWKDCLS IPYRLIFEKP SGKDPNSKDN SVGIQLLGIV 
MANDLPPYDP QCGIQSSEYF QALVNNMSFV RYKEVYAAAA EVLGLILRYV MERKNILEES 
LCELVAKQLK QHQNTMEDKF IVCLNKVTKS FPPLADRFMN AVFFLLPKFH GVLKTLCLEV 
VLCRVEGMTE LYFQLKSKDF VQVMRHRDDE RQKVCLDIIY KMMPKLKPVE LRELLNPWE 
FVSHPSTTCR EQMYNILMWI HDNYRDPESE TDNDSQEIFK LAKDVLIQGL IDENPGLQLI 
IRNFWSHETR LPSNTLDRLL ALNSLYSPKI EVHFLSLATN FLLEMTSMSP DYPNPMFEHP 
LSECEFQEYT IDSDWRFRST VLTPMFVETQ ASQGTLQTRT QEGSLSARWP VAGQIRATQQ 
QHDFTLTQTA DGRSSFDWLT GSSTDPLVDH TSPSSDSLLF ' AHKRSERLQR APLKSVGPDF 
GKKRLGLPGD EVDNKVKGAA GRTDLLRLRR RFMRDQEKLS LMYARKGVAE QKREKEIKSE 
LKMKQDAQW LYRSYRHGDL PDIQIKHSSL ITPLQAVAQR DPIIAKQLFS SLFSGILKEM 
DKFKTLSEKN NITQKLLQDF NRFLNTTFSP FPPFVSCIQD ISCQHAALL9 LDP AAV SAG C 
LASLQQPVGI RLLEEALLRL LPAELPAKRV RGKARLPPDV LRWVELAKLY RSIGEYDVLR 
GIFTSEIGTK QITQSALLAE ARSDYSEAAK QYDEALNKQD WVDGEPTEAE KDFVJELASLD 
CYNHLAEWKS LEYCSTASID SENPFDLNKI WSEPFYQETY LPYMIRSKLK LLLQGEADQS 
LLTFIDKAMH GELQKAI LEL HYSQELSLLY LLQDDVDRAK YYIQNGIQSP MQNYSSIDVL 

. LHQSRLTKLQ SVQALTEIQE FISFISKQGN- LSSQVPLKRL LNTWTNRYPD AKMDPMNIWD 
DIITNRCFFL SKIEEKLTPL PEDNSKNVDQ DGDPSDRMEV QEQEEDISSL IRSCKFSMKM 

. KMIDSARKQN NFSLAMKLLK ELHKESKTRD DWLVSWVQSY CRLSHCRSRS QGCSEQVLTV. 
LKTVSLLDEN NVSSYLSKNI LAFRDQNILL GTTYRI I ANA LSSEPACLAE IEEDKARRIL 
ELSGSSSEDS EKVIAGLYQR AFQHLSEAVQ AAEEEAQPPS WSCGPAAGVI DAYMTLADFC 
DQQLRKEEEN ASVIDSAELQ AYPALWEKM LKALKLNSNE ARLKFPRLLQ I IERYPEETL 
SLMTKEISSV PCWQFISWIS HMVALLDKDQ AVAVQHSVEE ITDNYPQAIV YPFIISSESY 
SFKDTSTGHX NKEFVARIKS KLDQGGVIQD FINALDQLSN PELLFKDWSN DVRAELAKTP 
VNKKNIEKMY ERMYAALGDP KAPGLGAFRR KFIQTFGKEF DKHFGKGGSK LLRKKLSDFN 

. DITNMLLLKM NKDSKPPGNL KECSPWMSDF KVEFLRNELE IPGQYDGRGK PLPEYHVRIA 
GFDERVTVMA SLRRPKRIII RGHDEREHPF LVKGGEDLRQ DQRVEQLFQV MNGILAQDSA 
CSQRALQLRT YSWPMTSRL GLIEWLENTV TLKDLLLNTM SQEEKAAYLS DPRAPPCEYK 
DWLTKMSGKH DVGAYMLMYK GANRTETVTS FRKRESKVPA DLLKRAFVRM STSPEAFLAL 
RSHFASSHAL ICISHWILGI GDRHLNNFMV AMETGGVIGI DFGHAFGSAT QFLPVPELMP 
FRLTRQFINL MLPMXETGLM YSIMVHALRA FRSDPGLLTN TMDVFVKEPS FDWKNFEQKM 
LKKGGSWIQE INVAEKNWYP RQKICYAKRK LAGANPAVIT CDELLLGHEK APAFRDYVAV 
ARGSKDHNIR AQEPESGLSE ETQVKCLMDQ ATDPNILGRT WEGWEPWM 

Seq ID NO: 1O0 D3STA sequence 
Nucleic Acid Accession #: NMJ)00673 
Coding sequence: 101-1225 . 
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ATGTGAAGGC. 
GTCTATGTTT 
• TATTAAATGC 
AGAAGTTGCC 
TCGCACAGAT 
ACATGAGGCA 
TGACAAAGTC 
AGATGGCAAC 
CACCAGATTT 
CGAGTACACA 
GAAAGTCTGT 
CAAGGTCAAA 
CATCATGGGC 
CAAATTTGAG 
CAAACCCATC 
AGTTATTGGG 
GACCAGCGTG 



11 
I 

ACAAGCTGCT 
GCAGAAATAC 
AAAGCAGCTG. 
CCACCAAAGA 
GACCATGTGA* 
ACTGGGATTG 
ATCCCTCTCT 
CTTTGCATTA 
ACATGCAAGG 
GTGGTGGATG 
TTAATTGGCT 
CCTGGTTCCA' 
TGTAAGTCAG 
AAGGCCATGG 
AGTGAGGTGC 
CATCTTGAAA. 
GTTGTAGGAG 



21 

I * 

GTTATATACA 
AGATCCAAGA 
TGCTTTGGGA 
CTAAAGAAGT 
TAAAAGGAAC 
TAGAGAGCAT 
TTCTGCCACA 
GGAGCGATAT 
GCAAACCAGT 
AATCTTCTGT 
GTGGGTTTTC 
CTTGCGTCGT 
CTGGTGCATC 
CTGTAGGTGC 
TGTCAGAAAT 
CCATGATTGA 
TTCCTCCATC 



31 
I 

ACAGAGTGAA 
CAAAGACAGG 
GCAGAAGCAA 
TCGCATTAAG 
AATGGTGTCC 
TGGAGAAGGA 
ATGTAGAGAA 
TACTGGTCGT 
ACACCACTTC 
TGCTAAGATT 
CACTGGATAT 
CTTTGGCCTG 
TAGGATCATT 
CACTGAGTGT 
GACAGGCAAC 
TGCCCTGGCA 
AGCCAAGATG 



41 
I 

CTGAGCATCA 
ATGGGCACTG 
CCCTTCTCCA 
ATTTTGGCCA 
AAGTTTCCAG 
GTGACTACAG 
TGCAATGCTT 
GGAGTACTGG 
ATGAACACCA 
GATGATGCAG 
GGCGCTGCTG 
GGAGGAGTTG 
GGGATTGACC 
ATCAGTCCCA 
AACGTGGGAT 
TCCTGCCACA 
CTCACCTATG- 



SI 
I 

GTCAGAAAAA 

CTGGAAAAGT 

TTGAGGAAAT 

CAGGAATCTG 

TGATTGTGGG 

TGAAACCAGG 

GTCGCAACCC 

CTGATGGCAC 

GTACATTTAC 

CTCCTCCTGA- 

TTAAAACTGG 

GCCTGTCAGT 

TCAACAAAGA 

AGGACTCTAC . 

ACACCTTTGA 

TGAACTATGG 

ACCCGATGTT 



420 
480 
540 
- 600 
. 660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960. 
1020 
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GCTCTTCACT 
TGTCCCAAAA 
TCATGTTTTA 
CATTCGAACG 
GAACTGGAGT 
ACAAGCATAA 
TATAAACATT 
TTGATTTACA 
TATGTTGAAA 
CAGATATAGC 
TTGAAACTAT 
TAACTTGGAT 
AGAAAGACAG 
ATAACTTGGT 
TATTAATATT 
ATATTATCAT 
CCTATTCACT 
CTAAACCG 



GGACGCACAT 
CTAGTGACTG 
CCATTTAAAA 
GTCCTGACGT 
TTGTCTTGTG 
GTAGAAGATT 
TAAAGTCTTG 
TTTTGTAAGG 
TGGAGATTTT 
GTATAAAGAT 
TATTTTTTAG 
TACATTTTGA 
AAAAGATTAA 
GAAACTGAAA 
TTAGAAAATA 
ACTTATCATA 
GTGCTTAGTA 



GGAAGGGATG 
AGTTCCTGGC 
AAATCAGTGA 
TTTGAGATCC 
AGAGTTCCCT 
TGTTGAAGAC 
TGAGCACCTG 
CTATAATTGT 
TAAGAGTTTT 
ATAGTAAATG 
ATTTGAATAT 
AATCAGTTCA 
GGGACGGGCA 
AAG TATAT CA 
TTCCTTTTGT 
ATGTTCAATT 
GTGACTCCAT 



TGTCTTTGGA 

aaagaaattt 
Aggatttgag 

AAAGTGGCAG 
CATCTGAAAT 
ATAGAACCCT 
GGAATTAGTA 
ATCTTTTAAG 
AACCAGCTGC 
CATCTCCCAG 
AAATGTATTT 
TTCCATGATG 
CATTTTTCAA 
TATGGGTACA 
AATACTGAAT 
TGATACAGTA 
TTAATAAAAA 



GGTTTGAAAA 
GACCTGGACC 
CTGCTCAATT 
GAGGTCTGTG 
CATGTATCTG 
TATAAAGAAT 
TAATAACAAT 
AAAACATACA 
TGCAGATATA 
AGTAATATTC 
TTTAAACACT 
CATATTACTG 
CGATTAAGAA 
CAAGGCTATT 
ATAAACATAG 
GAATTGCAAG 
GTGTTTTTAG 



Seq ID NO: 101 Protein sequence:. 
Protein Accession #: NP_000664 



1 

I . 

MGTAGKVIKC 
KFPVIVGHEA 
GVLADGTTRF 
GAAVKTGKVK 
ISPKDSTKPI 
LTYDPMLLFT 
LLNSGQSIRT 



11 
I 

KAAVLWEQKQ 
TGIVESIGEG 
TCKGKPVHHF 
PGSTCVVFGli 
SEVLSEMTGN 
GRTWKGCVFG 
VLTP . 



21 
I 

PPSIEEIEVA 
VTTVKPGDKV 
MNTSTFTEYT 
GGVGLSVIMG 
NVGYTFEVIG 
GLKSRDDVPK 



31 
I 

PPKTKEVRIK 
IPLFLPQCRE 
WDESSVAKI 
CKSAGASPvII 
HLETMIDALA 
LVTEFLAKKP 



Seq. ID NO: 102 DNA sequence 

Nucleic Acid Accession tt: NMJJ06783.1 

Coding sequence: 1..786 



1 • 
I 

ATGGATTGGG 
GGGAAGGTGT 
CAGGAAGTGT 
AAAAATGTGT 
CTGATCTTCG 
GAAACCACTC 
ATTAAAAAGC 
TTTTTCCGAA 
TACCACCTGC 
TTTATTTCTA 
ATTTGCATGC 
AGATCAAAGA 
CAGAATGAAA 
AGCTAA 



11 
1 

GGACGCTGCA 
GGATCACAGT 
GGGGTGACGA 
GCTATGACCA 
TCTCCACCCC 
GCAAGTTCAG 
ACAAGGTTCG 
TCATCTTTGA 
CCTGGGTGTT 
GGCCAACAGA 
TGCTTAACGT 
GAGCACAGAC 
TGAATGAGCT 



21 
1 

CACTTTCATC 
CATCTTTATT 
GCAAGAGGAC 
CTTTTTCCCG 
AGCGCTGCTG 
GCGAGGAGAG 
GATAGAGGGG 
AGCAGCCTTT 
GAAATGTGGG 
GAAGACCGTG 
GGCAGAGTTG 
GCAAAAAAAT 
GATTTCAGAT 



31 
I 

GGGGGTGTCA 
TTCCGAGTCA 
TTCGTCTGCA 
GTGTCCCACA 
GTGGCCATGC 
AAGAGGAATG 
TCGCTGTGGT 
ATGTATGTGT 
ATTGACCCCT 
TTTACCATTT 
TGCTACCTGC 
CACCCCAATC 
AGTGGTCAAA 



41 
I 

ILATGICRTD 
CNACRNPDGN 
DDAAPPEKVC 
GIDLNKDKFE 
SCHMNYGTSV 
DLDQLITHVIi 



41 
I 

ACAAACACTC 
TGATCCTAGT 
ACACACTGCA 
TCCGGCTGTG 
ATGTGGCCTA 
ATTTCAAAGA 
GGACGTACAC 
TTTACTTCCT 
GCCCCAACCT 
TTATGATTTC 
TGCTGAAAGT 
ATGCCCTAAA 
ATGCAATCAC 



GCAGAGATGA. 
AGTTGATAAC 
CAGGACAAAG 
TTGTCATGGT 
TCTCACAAAT 
TATTAACCTT 
GTTAATATTT 
CTTGGATTTC 
TAACTCAAAA 
ACTTAACACA' 
TGTTATGAGT 
GATTAGATTA 
TCATCATTAC 
TGCCAGCATA 
AGCTAGAGTC 
TCCCTAAGTC 
TTTTTAACAA 



SI 
I 

DHVIKGTMVS 
LCIRSDITGR 
LIGCGFSTGY 
KAMAVGATEC 
WGVPPSAKM 
PFKKISEGFE 



51 
I 

CACCAGCATC 
GGTGGCTGCC 
ACCGGGATGC 
GGCCCTCCAG 
CTACAGGCAC 
CATAGAGGAC 
CAGCAGCATC 
TTACAATGGG 
TGTTGACTGC 
TGCGTCTGTG 
GTGTTTTAGG 
GGAGAGTAAG 
AGGTTTCCCA 



Seq ID NO: 103 Protein sequence: 
Protein Accession #:"'np_006774.1 



1 11 21 31 41 51 

I I I I 1 I 

MDWGTLHTFI GGVNKHSTSI GKVWITVIFI FRVMILWAA QEVWGDEQED FVCNTLQPGC 
KNVCYDHFFP VSHIRLWALQ LIFVSTPALL VAMHVAYYRH BTTRKFRRGE KRNDFKDIED 
IKKHKVRIEG SLWWTYTSSI- FFRIIFEAAF MYVFYFLYNG YHLPWVLKCG IDPCPNLVDC 
FISRPTEKTV FTIFMISASV ICMLLNVAEL CYLLLKVCFR RSKRAQTQKN HPNHALKESK 
. QNEMNELISD SGQNAITGFP S 

Seq ID NO: 104 DNA sequence 
Nucleic Acid Accession fl: NM_020411 
Coding sequence: 86-526 



1 

I 

GGACCTGGGA 
AAGGCACGAG 
ACTGGGCGTC 
GGCGACTCGG 
ACAAACACAG 
GAACCAGCAG 
ACAGCTGAGA 
ACCGGGGATA 
ACACTGTAAA 
AAACAACGCA 
CAGCTTTCAC 



11 
I 

AGGAGCATAG 
GGAACCTCAC 
TTCCCATCGG 
GTCCCTGAGG 
AACCACACAG 
CTGAAAGTCG 
TCCCAGTGCG 
AATCTGGATT 
ATGCCAGAAG 
AGCTGGTTTT 
CAAAAAAAAA 



21 
I 

GACAGGGCAA 
TGCGCATGCT 
CCCCTTCGCC 
TCTGGATTCT 
CCAGTCCCAG 
GGATCCTACA 
CGACATGGAA 
TGGGTTCCGG 
CAGGTGAAGA 
ATATTAGATA 
AAAAAA 



31 

i 

GGCGGGATAA 
CCTTTGGTGC 
AGTGTGGGGA 
TTCTCCGCTA 
GAGCCCAGTA 
CCTGGGCAGC 
GGTGATCTGC 
CGTCAAGGTG 
GCAACCACAA 
TTTGACTTAA 



41 
I 

GGAGGGGCAC 
CCACCTCAGT 
ACGCGGCGGA 
CTGAGACACG 
ATGGAGAGCC 
AGACAGAAGA 
AAGAGCTGCA 
AAGATAATAC 
GTTTAAATGA 
ACTATCTCAA 



51 
I 

CACAGCCCTT 
GCGCATGTTC 
GCTGTGAGCC 
GCGGACACAC 
CCAAAAAGAA 
AGATCAGGAT 
TCAGTCAAAC 
CTAAAGAGGA 
AGACAAGCTG 
TAAAGTTTTG 



Seq ID NO: 105 Protein sequence : 
Protein Accession #: NP_065144.1 



1080 
1140 
1200 
1260 
1320 
1360 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 



60 
120 
160 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 



60 
120 
180 
240 
300- 
360 
420 
480 
540 
600 



11 



21 



31 



41 



51 
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MLLWCPPQCA CSLGVFPSAP SPVWGTRRSC BPATRVPEVW ILSPLLRHGG HTQTQNHT AS • 60 

PRSPVMESPK KXNQQLKVGI LHLGSRQKKI RIQLRSQCAT WKVICKSCIS QTPGINLDLG . 120 
" SGVKVKIIPK EEHCKMPEAG EEQPQV : r . 

" Seq ID NO: 10S DNA sequence 
Nucleic Acid Accession 8": J04129 
Coding sequences 99-587 

1 ; 11 21 31 41 51 

I I I I I I - ■ 

CATCCCTCTG GCTCCAGAGC TCAGAGCCAC CCACAGCCGC AGCCATGCTG TGCCTCCTGC 60 

TCACCCTGGG CGTGGCCCTG GTCTGTGGTG TCCCGGCCAT GGACATCCCC CAGACCAAGC 120 

AGGACCTGGA GCTCCCAAAG TTGGCAGGGA CCTGGCACTC CATGGCCATG GCGACCAACA . 180 

ACATCTCCCT CATGGCGACA CTGAAGGCCC CTCTGAGGGT CCACATCACC TCACTGTTGC 240 

CCACCCCCGA GGACAACCTG. GAGATCGTTC TGCACAGATG GGAGAACAAC AGCTGTGTTG 300 

AGAAGAAGGT- CCTTGGAGAG AAGACTGGGA ATCCAAAGAA GTTCAAGATC AACTATACGG 360 

TGGCGAACGA GGCCACGCTG CTCGATACTG ACTACGACAA TTTCCTGTTT CTCTGCCTAC 420 

AGGACACCAC CACCCCCATC CAGAGCATGA TGTGCCAGTA CCTGGCCAGA GTCCTGGTGG 480 

AGGACGATGA GATCATGCAG GGATTCATCA GGGCTTTCAG GCCCCTGCCC AGGCACCTAT 540 

GGTACTTGCT GGACTTGAAA CAGATGGAAG AGCOGTGCCG TTTCTAGCTC ACCTCCGCCT 600 

CCAGGAAGAC CAGACTCCCA CCCTTCCACA CCTCCAGAGC AGTGGGACTT CCTCCTGCCC 660 

TTTCAAAGAA TAACCACAGC TCAGAAGACG ATGACGTGGT CATCTGTGTC GCCATCCCCT 720 

TCCTGCTGCA CACCTGCACC ATTGCCATGG GGAGGCTGCT CCCTGGGGGC AGAGTCTCTG 780 
GCAGAGGTTA TTAATAAACC CTTGGAGCAT G 



Seq ID NO: 107 Protein sequence: 
Protein Accession ft: AAA60147 

1 11-21 31 41 51 

I I I I I I 

MDIPQTKQDL ELPKLAGTWH SMAMATNNIS LMATLKAPLR VHITSLLPTP EDNLBIVLHR 60 
WENNSCVEKK VLGEKTGNPK KFKINYTVAN EATLLDTDYD NFLFLCLQDT TTPIQSMMCQ 120 
YLARVLVEDD EIMQGFIRAF RPLPRHLWYL LDLKQMEEPC RP 



Seq ID NO: 108 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 48-794 . 

1 ' 11 21 31 41 SI 

I I I I I I 

TCCCAGGCAG CAGTTAGCCC GCCGCCCGCC TGTGTGTCCC CAGAGCCATG GAGAGAGCCA 60 

GTCTGATCCA GAAGGCCAAG CTGGCAGAGC AGGCCGAACG CTATGAGGAC ATGGCAGCCT 120 

TCATGAAAGG CGCCGTGGAG AAGGGCGAGG AGCTCTCCTG CGAAGAGCGA AACCTGCTCT 180 

CAGTAGCCTA TAAGAACGTG GTGGGCGGCC AGAGGGCTGC CTGGAGGGTG CTGTCCAGTA 240 

TTGAGCAGAA AAGCAACGAG GAGGGCTCGG AGGAGAAGGG GCCCGAGGTG CGTGAGTACC 300 

GGGAGAAGGT GGAGACTGAG CTCCAGGGCG TGTGCGACAC CGTGCTGGGC CTGCTGGACA 360 

GCCACCTCAT CAAGGAGGCC GGGGACGCCG AGAGCCGGGT CTTCTACCTG AAGATGAAGG 420 

GTGACTACTA CCGCTACCTG GCCGAGGTGG CCACCGGTGA CGACAAGAAG CGCATCATTG 480 
ACTCAGCCCG GTCAGCCTAC CAGGAGGCCA TGGACATCAG CAAGAAGGAG ATGCCGCCCA . 540 

CCAACCCCAT CCGCCTGGGC CTGGCCCTGA ACTTTTCCGT CTTCCACTAC GAGATCGCCA 600 

ACAGCCCCGA GGAGGCCATC TCTCTGGCCA AGACCACTTT CGACGAGGCC ATGGCTGATC 660 

TGCACACCCT CAGCGAGGAC TCCTACAAAG ACAGCACCCT CATCATGCAG CTGCTGCGAG 720 

ACAACCTGAC ACTGTGGACG GCCGACAACG CCGGGGAAGA GGGGGGCGAG GCTCCCCAGG 780 

AGCCCCAGAG CTGAGTGTTG CCCGCCACCG CCCCGCCCTG CCCCCTCCAG TCCCCCACCC 840 

TGCCGAGAGG ACTAGTATGG GGTGGGAGGC CCCACCCTTC TCCCCTAGGC GCTGTTCTTG 900 

CTCCAAAGGG CTCCGTGGAG AGGGACTGGC AGAGCTGAGG CCACCTGGGG CTGGGGATCC 960 

CACTCTTCTT GCAGCTGTTG AGCGCACCTA ACCACTGGTC ATGCCCCCAC CCCTGCTCTC 1020 

CGCACCCGCT TCCTCCCGAC CCCAGGACCA GGCTACTTCT CCCCTCCTCT TGCCTCCCTC 1080 

CTGCCCCTGC TGCCTCTGAT CGTAGGAATT GAGGAGTGTC . CCGCCTTGTG GCTGAGAACT 1140 

GGACAGTGGC AGGGGCTGGA GATGGGTGTG TGTGTGTGTG TGTGTGTGTG TGTGTGTGTG 1200 

CGCGCGCGCC AGTGCAAGAC CGAGATTGAG GGAAAGCATG TCTGCTGGGT GTGACCATGT 1260 
TTCCTCTCAA TAAAGTTCCC CTGTGACACT C 

Seq ID NO: 109 Protein sequence: 
Protein Accession #: NP_006133.l 

1 11 : 21 31 41 51 

I I 1 III 

MERASLIQKA KLAEQAERYE DMAAPMKGAV EKGEELSCEE RNLLSVAYKN WGGQRAAWR 60 
VLSSIEQKSN EEGSEEKGPE VREYREKVET ELQGVCDTVL GLLDSHLIKE AGDAESRVPY 120 
LKMKGDYYRY LAEVATGDDK KRIIDSARSA YQEAMDISKK EMPPTNPIRL GIALNFSVFH 180 
YEIANSPEEA ISLAKTTFDE AMADLHTLSE DSYKDSTLIM QLLRDNLTLW TADNAGEEGG 240 
EAPQEPQS . 

Seq ID NO: 110 DNA sequence 
Nucleic Acid Accession #: NM_000695 
Coding sequence : 407-1564 

1 11 21 31 41 51 

I 1 I I I I 

CACGAGTTGG TTTGGGAGCT GCCAGTCTCC TGGGAGGATC GCAGTCAGCA GAGCAGGGCT 60 
GAGGCCTGGG GGTAGGAGCA GAGCCTGCGC ATCTGGAGGC AGCATGTCCA AGAAAGGGAG 120 
TGGAGGTGCA GCGAAGGACC CAGGGGCAGA GCCCACGCTG GGGATGGACC CCTTCGAGGA 180 
CACACTGCGG CGGCTGCGTG AGGCCTTCAA CTGAGGGCGC ACGCGGCCGG CCGAGTTCCG . 240 
GGCTGCGCAG CTCCAGGGCC TGGGCCACTT CCTTCAAGAA AACAAGCAGC TTCTGCGCGA 300 



230 
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OGTGCTGGCC - CAGGACCTGC ' AT AAGCCAGC TTTCGAGGCA GACATATCTG AGCTCATCCT 360 

TTGCCAGAAC GAGGTTGACT AOGCTCTCAA GAACCTTCAG GCCTGGATGA AGGA TGAAC C 420 

ACGGTCCACG AACCTGTTCA" TGAAGCTGGA CTCGGTCTTC ATCTGGAAGG AACCCTTTGG 480 

CCTGGTCCTC ATCATCGCAC CCTGGAACTA CCCATTGAAC CTGACCCTGG TGCTCCTGGT • 540' 

GGGCACCCTC CCCGCAGGGA ATTGCGTGGT GCTGAAGCCG TCAGAAATCA GCCAGGGCAC 600 

AGAGAAGGTC CTGGCTGAGG TGCTGCCCCA GTACCTGGAC CAGAGCTGCT TTGCCGTGGT 660 

GCTGGGCGGA CCCCAGGAGA CAGGGCAGCT GCTAGAGCAC AAGTTGGACT ACATCTTCTT 720 

CACAGGGAGC CCTCGTGTGG GCAAGATTGT CATGACTGCT GCCACCAAGC ACCTGACGCC 780 

TGTCACCCTG GAGCTGGGGG GCAAGAACCC CTGCTACGTG GACGACAACT GOGACCCCCA 840 

GACOGTGGCC AACCGCGTGG CCTGGTTCTG CTACTTCAAT GCCGGCCAGA CCTGCGTGGC 900 

CCCTGACTAC GTCCTGTGCA GCCCCGAGAT GCAGGAGAGG CTGCTGCCCG CCCTGCAGAG 960 

CACCATCACC CGTTTCTATG GCGACGACCC CCAGAGCTCC CCAAACCTGG GCCGCATCAT 1020 

CAACCAGAAA CAGTTCCAGC GGCTGOGGGC ATTGCTGGGC TGCGGCCGCG TGGCCATTGG 1080 

GGGCCAGAGC AACGAGAGCG ATCGCTACAT CGCCCCCACG GTGCTGGTGG AOGTGCAGGA 1140 

GACGGAGCCT "GTGATGCAGG AGGAGATCTT OGGGCCCATC CTGCCCATCG TGAACGTGCA 1200 

GAGOGTGGAC GAGGCCATCA AGTTCATCAA CCGGCAGGAG AAGCCCCTGG CCCTGTACGC 1260 

CTTCTCCAAC AGCAGACAGG TTGTGAACCA GATGCTGGAG CGGACCAGCA GCGGCAGCTT 1320 

TGGAGGCAAT GAGGGCTTCA CCTACATATC TCTGCTGTCC GTGCCATTCG GGGGAGTCGG 1380 

CCACAGTGGG ATGGGCCGGT ACCACGGCAA GTTCACCTTC GACACCTTCT CCCACCACCG 1440 

CACCTGCCTG CTCGCCCCCT CCGGCCTGGA GAAATTAAAG GAGATCCGCT ACCCACCCTA 1500 

TACCGACTGG AACCAGCAGC TGTTACGCTG GGGCATGGGC TCCCAGAGCT GCACCCTCCT 1560 

GTGAGCGTCC CACCCGCCTC CAACGGGTCA CACAGAGAAA CCTGAGTCTA GCCATGAGGG 1620 

GCTTATGCTC CCAACTCACA TTGTTCCTCC AGACCGCAGG CTCCCCCAGC CTCAGGTTGC 1680 

TGGAGCTGTC ACATGACTGC ATCCTGCCTG CCAGGGCTGC AAAGCAAGGT CTTGCTTCTA 1740 

TCTGGGGGAC GCTGCTCGAG AGAGGCCGAG AGGCCGCAGA ACATGCCAGG TGTCCTCACT 1800 

CACCCCACCC TCCCCAATTC CAGCCCTTTG CCCTCTCGGT CAGGGTTGGC CAGGCCCAGT 1860 

CACAGGGGCA GTGTCACCCT GGAAAATACA GTGCCCTGCC TTCTTAGGGG CATCAGCCCT 1920 

GAACGGTTGA GAGCGTGGAG CCCTCCAGGC CTTTGCTCTC CCCTCTAGGC ACACGCGCAC 1980 

TTCCACCTCT GCCCCATCCC AACTGCACCA GCACTGCCTC CCCCAGGGAT CCTCTCACAT 2040 

CCCACACTGG TCTCTGCACC ACCCCTCTGG TTCACACCGC ACCCTGCACT CACCCACAGC 2100 

AGCTCCATCC ACTGGGAAAA CTGGGGTTTG CATCACTCCA CTGCACAGTG TTAGTGGGAC 2160 

CTGGGGGCAA GTCCCTTGAC TTCTCTGAGC CTCAGTTTCC TTATGTGAAA GTTGCTGGAA 2220 

CCAAAATGGA GTCACTTATG CCAAACTCTA ATAAAATGGA GTCGGGGGGG CACATAGAAG 2280 

CCCTCACACA CACATGCCCG TAACAGGATT TATCACCAAG ACACGCCTGC ATGTAAGACC 2340 

AGACACAGGG CGTATGGAAA AGCACGTCCT CAAAGACTGT AGTATTCCAG ATGAGCTGCA 2400 

GATGCTTACC TACCACGGCC GTCTCCACCA GAAAACCATC GCCAACTCCT GCGATCAGCT 2460 

TGTGACTTAC AAACCTTGTT TAAAAGCTGC TTACATGGAC TTCTGTCCTT TAAAAOGTTC 2520 

CCCTTGGCTG TGGCCCTCTG TGTATGCCTG GGATCCTTCC AAGCACTCAT AGCCCAGATA 2580 
GGAATCCTCT GCTCCTCCCA AATAAATTCA TCTGTTC 



seq ID NO: 111 Protein sequence: 
protein Accession ft: NP_000S86 

1 11 21 31 

I I I I 

MKDEPRSTNL FMKLDSVPIW KEPFGLVLII APWNYPUJLT 
ISQGTEKVLA EVLPQYLDQS CFAWLGGPQ ETGQLLEHKL 
KHLTPVTLKL GGKNPCYVDD NCDPQTVANR VAWPCYFNAG 
PALQSTITRF YGDDPQSSPN LGRI INQKQF QRLRALLGCG 
VDVQETEPVM QEEIFGPILP IVNVQSVDEA IKFINRQEKP 
SSGSFGGNEG PTYISLLSVP FGGVGHSGMG RYHGKFTFDT 
RYPPYTDWNQ QLLRWGMGSQ SCTLL 

Seq ID NO: 112 DNA sequence 
Nucleic Acid Accession #: NM_004456 
Coding sequence: 58-2298 

1 11 21 31 41 51 

I I I I I I 

GAATTCCGGG CGACGCGCGG GAACAACGCG AGTCGGCGCG CGGGACGAAG AATAATCATG 60 

GGCCAGACTG GGAAGAAATC TGAGAAGGGA CCAGTTTGTT GGCGGAAGCG TGTAAAATCA 120 

GAGTACATGC GACTGAGACA GCTCAAGAGG TTCAGACGAG CTGATGAAGT AAAGAGTATG 180 

TTTAGTTCCA ATCGTCAGAA AATTTTGGAA AGAACGGAAA TCTTAAACCA AGAATGGAAA 240 

CAGCGAAGGA TACAGCCTGT GCACATCCTG ACTTCTGTGA GCTCATTGCG CGGGACTAGG 300 

GAGTGTTCGG TGACCAGTGA CTTGGATTTT CCAACACAAG TCATCCCATT AAA GACTC TG 360 

AATGCAGTTG CTTCAGTACC CATAATGTAT TCTTGGTCTC CCCTACAGCA GAATTTTATG 420 

GTGGAAGATG AAACTGTTTT ACATAACATT CCTTATATGG GAGATGAAGT TTTAGATCAG 480 

GATGGTACTT TCATTGAAGA ACTAATAAAA AATTATGATG GGAAAGTACA CGGGGATAGA 540 

GAATGTGGGT TTATAAATGA TGAAATTTTT GTGGAGTTGG TGAATGGCCT TGGTCAATAT 600 

AATGATGATG ACGATGATGA TGATGGAGAC GATCCTGAAG AAAGAGAAGA AAAGCAGAAA 660 

GATCTGGAGG ATCACCGAGA TGATAAAGAA AGCCGCCCAC CTCGGAAATT TCCTTCTGAT 720 

AAAATTTTGG AGGCCATTTC CTCAATGTTT CCAGATAAGG GCACAGCAGA AGAACTAAAG 780 

GAAAAATATA AAGAACTCAC CGAACAGCAG CTCCCAGGCG CACTTCCTCC TGAATGTACC 840 

CCCAACATAG ATGGACCAAA TGCTAAATCT GTTCAGAGAG AGCAAAGCTT ACAC TCCTTT 900 

CATAOGCTTT TCTGTAGGCG ATGTTTTAAA TATGACTGCT TCCTACATCC TTTTCATGCA 960 

ACACCCAACA CTTATAAGCG GAAGAACACA GAAACAGCTC TAGACAACAA ACCTTGTGGA 1020 

CCACAGTGTT ACCAGCATTT GGAGGGAGCA AAGGAGTTTG CTGCTGCTCT CACCGCTGAG 1080 

CGGATAAAGA CCCCACCAAA ACGTCCAGGA GGCCGCAGAA GAGGACGGCT TCCCAATAAC 1140 

1 AGTAGCAGGC CCAGCACCCC CACCATTAAT GTGCTGGAAT CAAAGGATAC AGACAGTGAT 1200 

AGGGAAGCAG GGACTGAAAC GGGGGGAGAG AACAATGATA AAGAAGAAGA AGAGAAGAAA 1260 

GATGAAACTT CGAGCTCCTC TGAAGCAAAT TCTCGGTGTC AAACACCAAT AAAGATGAAG 1320 

CCAAATATTG AACCTCCTGA GAATGTGGAG TGGAGTGGTG CTGAAGCCTC AATGTTTAGA 1380 

GTCCTCATTG GCACTTACTA TGACAATTTC TGTGCCATTG CTAGGTTAAT TGGGACCAAA 1440 

ACATGTAGAC AGGTGTATGA GTTTAGAGTC AAAGAATCTA GCATCATAGC TCCAGCTCCC 1500 

GCTGAGGATG TGGATACTCC TCCAAGGAAA AAGAAGAGGA AACACCGGTT GTGGGCTGCA 1560 

CACTGCAGAA AGATACAGCT GAAAAAGGAC GGCTCCTCTA ACCATGTTTA CAAGTATCAA 1620 



41 51 
I I 

LVLLVGTLPA GNCWLKPSE 60 

DYIPFTGSPR VGKIVMTAAT 120 

QTCVAPDYVL CSPEMQERLL 180 

RVAIGGQSNE SDRYIAPTVL 240 

LALYAFSNSR QWNQMLERT 300 

FSHHRTCLLA PSGLEKLKEI 360 



231 
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CCCTGTGATC ATCCACGGCA GCCTTGTGAC AGTTCGTGCC CTTGTGTGAT AGCACAAAAT 1*680 
TTTTGTGAAA AGTTTTGTCA ATGTAGTTCA GAGTGTCAAA ACCGCTTTCC GGGATGCCGC .1740 
TGCAAAGCAC AGTGCAACAC CAAGCAGTGC CCGTGCTACC TGGCTGTCCG AGAGTGTGAC • 1800 

CCT6ACCTCT GTCTTACTTG TGGAGCCGCT GACCATTGGG ACAGTAAAAA TGTGTCCTGC 1860 

5 AAGAACTGCA GTATTCAGCG GGGCTCCAAA AAGCATCTAT TGCTGGCACC ATCTGACGTG 1920 

GCAGGCTGGG GGATTTTTAT CAAAGATCCT GTGCAGAAAA ATGAATTCAT CTCAGAATAC 1980 

TGTGGAGAGA TTATTTCTCA AGATGAAGCT GACAGAAGAG GGAAAGTGTA TGATAAATAC 2040 

ATGTGCAGCT TTCTGTTCAA CTTGAACAAT GATTTTGTGG TGGATGCAAC CCGCAAGGGT 2100 

AACAAAATTC GTTTTGCAAA TCATTCGGTA AATCCAAACT GCTATGCAAA AGTTATGATG 2160 

10 GTTAACGGTG ATCACAGGAT AGGTATTTTT GCCAAGAGAG CCATCCAGAC TGGCGAAGAG 2220. 

CTGTTTGTTG ATTACAGATA CAGCCAGGCT GATGCCCTGA AGTATGTCGG CATC GAAA GA 2280 

GAAATGGAAA TCCCTTGACA" TCTGCTACCT CCTCCCCCTC CTCTGAAACA GCTGCCTTAG 2340 

CTTCAGGAAC CTCGAGTACT GTGGGCAATT TAGAAAAAGA ACATGCAGTT T GAAATT CTG 2400 

AATTTGCAAA' GTACTGTAAG AATAATTTAT AGTAATGAGT TTAAAAATCA ACTTTTTATT 2460 

15 GCCTTCTCAC CAGCTGCAAA GTGTTTTGTA CCAGTGAATT TTTGCAATAA TGCAGTATGG 2520 
TACATTTTTC AACTTTGAAT AAAGAATACT TGAACTTGAA AAAAAAAAAA AAAAAA 



PCT/US02/12476 



20 
25 
30 
35 
40 
45 
50 
55 
60 
65 



Seq ID NO: 113 Protein .sequence : 
Protein Accession ft: NP_004447 



I 

MGQTGKKSEK 
KQRRIQPVHI 
MVEDETVLHN 
YNDDDDDDDG 
KEKYKELTEQ 
ATPNTYKRKN 
NSSRPSTPTI 
KPNIEPPENV 
PAEDVDTPPR 
NFCEKFCQCS 
CKNCSIQRGS 
YMCSFLFNLN 
ELFVDYRYSQ 



11 
I 

GPVCWRKRVK 
LTSVSSLRGT 
IPYMGDEVLD 
DDPEEREEKQ 
QLPGALPPEC 
TETAliDNKPC 
NVLESKDTDS 
EWSGAEASMP 
KKKRKHRLWA 
SECQNRFPGC 
KKHLLLAPSD 
NDFWDATRK 
ADALKYVGIB 



21 
I 

SEYMRLRQLK 
RECSVTSDLD 
QDGTFIEELI 



TPNIDGPNAK 
GPQCYQHLEG 
DREAGTETGG 
RVLIGTYYDN 
AHCRKIQLKK 
RCKAQCNTKQ 
VAGWGIFIKD 
GNKIRFANHS 
REMEIP 



31 

I 

RFRRADEVKS 
FPTQVIPLKT 
KNYDGKVHGD 
ESRPPRKFPS 
SVQREQSLHS 
AKEPAAALTA 
ENNDKEEEEK 
PCAIARLIGT 
DGSSNHVYNY 
CPCYLAVREC 
PVQKNEPISE 
VNPNCYAKVM 



41 
I 

MFSSNRQKIL 
LNAVASVPIM 
RECGP INDEX 
DKILEAISSM 
FHTLPCRRCF 
ERIKTPPKRP 
KDETSSSSEA 
KTCRQVYEFR 
QPCDHPRQPC 
DPDLCLTCGA 
YCGEIISQDE 
MVNGDHRIGI 



51 
I 

ERTEILNQEW 
YSWSPLQQNF 
PVELVNALGQ 
FPDRGTAEEL 
KYDCPLHPFH 
GGRRRGRLPN 
NSRCQTPIKM 
VKESSIIAPA 
DSSCPCVIAQ 
ADHWDSKNVS 
ADRRGKVYDK 
FAKRAIQTGE 



Seq ID NOi 114 DNA sequence 
Nucleic Acid Accession ft: NM_001827 
Coding sequence: 96-335 



1 
I 

AGTCTCCGGC 
CGCTCTCGTT 
CGGACAAGTA 
CCAAACAAGT 
AACAGAGTCT 
TTAGACGACC 
TTTTCAAATT 
ACAAATCTTT 
AAATGCAACT 
TTTCTCTTAA 
TATGTTGCAT 



11 

i 

GAGTTGTTGC 
TCATTTTCTG 
CTTCGACGAA 
ACCTAAAACT 
AGGCTGGGTT 
TCTTCCAAAA 
TAATGTATAT 
CATCCATACC 
GCAAGTAGGT 
GTGCCTGTTT 
TTAAAAAAAA 



21 
I 

CTGGGCTGGA 
CAGCGCGCCA 
CACTACGAGT 
CATCTGATGT 
CATTACATGA 
GATCAACAAA 
GTGTATATAA 
TGTGCATGAG 
TACTGTAAGA 
GAGTTTACTG 
AAAAAAA 



31 
I 

CGTGGTTTTG 
CGAGGATGGC 
ACCGGCATGT 
CTGAAGAGGA 
TTCATGAGCC 
AATGAAGTTT 
GGTAGTATTC 
CTGTATTCTT 
TGTTTAAGAT 
AAACAGTTTA 



41 
I 

TCTGCTGCGC 
CCACAAGCAG 
TATGTTACCC 
GTGGAGGAGA 
AGAACCACAT 
ATCTGGGGAT 
AGTGAATACT 
CACAGCAACA 
AAAAGTTCTT 
CTTTTGTTCA 



51 
I 

CCGCTCTTCG 
ATCTACTACT 
AGAGAACTTT 
CTTGGTGTCC 
ATTCTTCTCT 
CGTCAAATCT 
TGAGAAATGT 
GAGCTCAGTT 
CCAGTCAGTT 
ATAAAGTTTG 



Seq ID NO: 115 Protein sequence: 
Protein Accession ft: NP_00181B 

1 11 



51 



21 31 41 

111 I I I 

MAHKQIYYSD KYFDEHYEYR HVMLPRELSK QVPKTHLMSE EEWRRLGVQQ SLGWVHYMIH 
EPEPHILLFR RPLPKDQQK 

Seq ID NO: 116 DNA sequence 

Nucleic Acid Accession ft : CAT cluster 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



60 
120 
1Q0 
240 
300 
360 
420 
480 
540 
600 



60 



70 
75 
80 
85 



TCAGACCTCA 
GCATCTGGAC 
AGAGGTGTGT 
GCAGCCAACA 
AAGGACTGAT 
CCTTGGAAGG 
TCAAGAATTC " 
AGTGTGGGAG 
CCCATCTCTA 
CCTGTAGTTC 
AGGCTGCAAT 
CCTGTCTCAA 
TTTGAGGTGC 
CCTGAAGGAG 
GACAGACCTT 
CTCTTCCCCC 
CCGCCTCCCA 
CAGCOGGATA 
TCCTGCTCCG 



11 
I 

TGAGTCACTT 
CCTTGGTGCT 
TCCAGGGAAA 
GAGTTCAAAA 
CCACATTCCC 
ACCTGGCTCA 
TTTGCTGAGC 
GATCTCTTGA 
AAATAATAAT 
CAGCTACCCA 
GAACTGTGAT 
ATAATAATAA 
CATTTGGGTA 
CAGAGGGATG 
GTCCTTCTTC 
TCCCTGTCCC 
TGTCTGCTGT 
CAGAGTGAAT 
TGTAAAGAGG 



21 
I 

GGACTCTTGA 
ATCGACGAAG 
GCCCCTATCT 
TGCAGGCTTG 
ACCAGGAAGT 
GGCTGGACCA 
ATGGTGCCTC 
GCCCAGGAGT 
AATAATAAAA 
GGAGGCTGAG 
TACCCCACTG 
TAATAATAAT 
GAAAGAAAAG 
CATCGCTGGA 
CTTGTGGAAA 
AGGGAACCAA 
GCCTTTGTAC 
AGTTAACCAC 
CCAGTGTTTG 
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CTTCTCATTT GTTTGGGATC TGGCCACCAA . GTTCCAGAAT GATACACGGA TCAGTGCAGA 1200- 

• AGTTCATCAG- GCTCTCGGAC CTTAGGGCTG TTGGAGAAGG CTTCAGCAGC AGAACTGATG 1260 f 

GTGAAGGCTC GTGTTCTCCA TCCTCAACTT TCTTTGCTTC GATCATACAC AAGAATACAT 1320 

TTGGAAGGGC AAAAAATGAA CACTGTCGTT CATTGCAGCC GTGTTTTGTG ACACAGATGC 1380. 

ACAGTCTGCT GTGAAGACCT TCTCTCAAGT " GGCATTTGGG AGTCCATGCC AGATCATGGT .1440 

GCTTCATGAG AGACTGACAG CTATCAGGGG TTGTGGCACT TAGTGAGGAC TCTCCTCCCC 1500; 

CAGTGTGTGC TGATGACACA TAGACACCTG ACAATAGCTT GAGTCTTCTC TGTTCCTTTT 1560 

ACTCTGTAGC CAACATACAC v ATGATTTAAA " ACCCTTTCTA AATATCTATC ATGGTTCATC 1620 

CTTGTCCAAA TGCAGAGTCA GAGCTATTTG TACTTCATTA TTATTTCCAA GGCGAATAGT 1680. 

TGGCTTTCTT TTTGCAAAAA TAATTAAAGT TTTTGTATGT TGCAAAAAAA AAAAAAAAAA 1740 

"aaacaaaaaa_ a 



Seq ID NO : 117 DNA sequence 

Nucleic Acid . Accession, #: BC01217B.1 

Coding sequence: 204-2285 

1 11 21 31 41 51 

I i I I I I 

CTTCTCTCCC GCGGCGCTGG GGCCCGCGCT CCGCTGCTGT TGCTCCATTC GGCGC TTTT C 60 

TGGCGGCTGG CTCCTCTCCG CTGCCGGCTG CTCCTCGACC AGGCCTCCTT CTCAACCTCA 120 

GCCCGCGGCG CCGACCCTTC CGGCACCCTC CCGCCCCGTC TCGTACTGTC GCCGTCACCG 180 

CCGCGGCTCC GGCCCTGGCC CCGATGGCTC TGTGCAACGG AGACTCCAAG CTGGAGAATG 240 

CTGGAGGAGA CCTTAAGGAT GGCCACCACC ACTATGAAGG AGCTGTTGTC ATTCTGGATG 300 

CTGGTGCTCA GTACGGGAAA GTCATAGACC GAAGAGTGAG GGAACTGTTC GTGCAGTCTG 360 

AAATTTTCCC CTTGGAAACA CCAGCATTTG CTATAAAGGA ACAAGGATTC CGTGCTATTA 420 

TCATCTCTGG AGGACCTAAT TCTGTGTATG CTGAAGATGC TCCCTGGTTT GATCCAGCAA 480- 

TATTCACTAT TGGCAAGCCT GTTCTTGGAA TTTGCTATGG TATGCAGATG ATGAATAAGG 540 

TATTTGGAGG TACTGTGCAC AAAAAAAGTG TCAGAGAAGA TGGAGTTTTC AACATTAGTG 600 

TGGATAATAC ATGTTCATTA TTCAGGGGCC TTCAGAAGGA AGAAGTTGTT TTGCTTACAC 660 

ATGGAGATAG • TGTAGACAAA GTAGCTGATG GATTCAAGGT TGTGGCACGT TCTGGAAACA 720 

TAGTAGCAGG CATAGCAAAT GAATCTAAAA AGTTATATGG AGCACAGTTC CACCCTGAAG 780 

TTGGCCTTAC AGAAAATGGA AAAGTAATAC TGAAGAATTT CCTTTATGAT ATAGCTGGAT 840 

GCAGTGGAAC CTTCACCGTG CAGAACAGAG AACTTGAGTG TATTCGAGAG ATCAAAGAGA 900 

GAGTAGGCAC GTCAAAAGTT TTGGTTTTAC TCAGTGGTGG AGTAGACTCA ACAGTTTGTA 960 

CAGCTTTGCT AAATCGTGCT TTGAACCAAG AACAAGTCAT TGCTGTGCAC ATTGATAATG 1020 

GCTTTATGAG AAAACGAGAA AGCCAGTCTG TTGAAGAGGC CCTCAAAAAG CTTGGAATTC 1060 

AGGTCAAAGT GATAAATGCT GCTCATTCTT TCTACAATGG AACAACAACC CTACCAATAT 1140 

CAGATGAAGA TAGAACCCCA CGGAAAAGAA TTAGCAAAAC GTTAAATATG ACCACAAGTC • 1200 

CTGAAGAGAA AAGAAAAATC ATTGGGGATA CTTTTGTTAA GATTGCCAAT GAAGTAATTG 1260 

GAGAAATGAA CTTGAAACCA GAGGAGGTTT TCCTTGCCCA AGGTACTTTA CGGCCTGATC 1320 

TAATTGAAAG TGCATCCCTT GTTGCAAGTG GCAAAGCTGA ACTCATCAAA ACCCATCACA 1380 

ATGACACAGA GCTCATCAGA AAGTTGAGAG AGGAGGGAAA AGTAATAGAA CCTCTGAAAG 1440 

ATTTTCATAA AGATGAAGTG AGAATTTTGG GCAGAGAACT TGGACTTCCA GAAGAGTTAG 1500 

TTTCCAGGCA TCCATTTCCA GGTCCTGGCC TGGCAATCAG AGTAATATGT GCTGAAGAAC 1560 

CTTATATTTG TAAGGACTTT CCTGAAACCA ACAATATTTT GAAAATAGTA GCTGATTTTT 1620 . 

CTGCAAGTGT TAAAAAGCCA CATACCCTAT TACAGAGAGT CAAAGCCTGC ACAACAGAAG 1680 

AGGATCAGGA GAAGCTGATG CAAATTACCA GTCTGCATTC ACTGAATGCC TTCTTGCTGC 1740 

CAATTAAAAC TGTAGGTGTG CAGGGTGACT GTCGTTCCTA CAGTTACGTG TGTGGAATCT 1800 

CCAGTAAAGA TGAACCTGAC TGGGAATCAC TTATTTTTCT GGCTAGGCTT ATACCTCGCA 1860 

TGTGTCACAA CGTTAACAGA GTTGTTTATA TATTTGGCCC ACCAGTTAAA GAACCTCCTA 1920 

CAGATGTTAC TCCCACTTTC TTGACAACAG GGGTGCTCAG TACTTTACGC CAAGCTGATT 1980 

TTGAGGCCCA TAACATTCTC AGGGAGTCTG GGTATGCTGG GAAAATCAGC CAGATGCCGG 2040 

TGATTTTGAC ACCATTACAT TTTGATCGGG ACCCACTTCA AAAGCAGCCT TCATGCCAGA 2100 

GATCTGTGGT TATTCGAACC TTTATTACTA GTGACTTCAT GACTGGTATA CCTGCAACAC 2160 

CTGGCAATGA GATCCCTGTA GAGGTGGTAT TAAAGATGGT CACTGAGATT AAGAAGATTC 2220 

CTGGTATTTC TCGAATTATG TATGACTTAA CATCAAAGCC CCCAGGAACT ACTGAGTGGG 2280 
AGTAATAAAC TTCTTGTTCT ATTAAAA 



Seq ID NO: 118 Protein sequence: 
Protein Accession #: AAH12178.1 

1 11 21 31 41 51 

I I I I I I 

MALCNGDSKL ENAGGDLKDG HHHYBGAWI LDAGAQYGKV IDRRVRELFV QSEIFPLBTP 60 

APAIKEQGFR AIIISGGPNS VYAEDAPWFD PAIFTIGKPV LGICYGMQMM NKVFGGTVHK 120 

KSVREDGVFN ISVDNTCSLF RGLQKEBWL LTHGDSVDKV ADGFKWARS GNIVAGIANB 180 

SKKLYGAQFH PEVGLTENGK VILKNFI>YDI AGCSGTFTVQ NRELECIREI KERVGTSKVL 240 

VLLSGGVDST VCTALLNRAL NQEQVIAVHI DNGFMRKRES QSVEEALKKL GIQVKVINAA 300 

HSFYNGTTTL PISDEDRTPR KRISKTLNMT TSPBEKRKII GDTPVKIANE VIGEMNLKPE 360 

EVFLAQGTLR ;PDLIESASLV ASGKASLIKT HHNDTBLIRK LREEGKVIEP LKDFHKDBVR 420 

ILGRELGLPE . ELVSRHPFPG PGLAIRVICA EEPYICKDFP ETNNILKIVA DFSASVKKPH 480 

TLLQRVKACT; TEEDQEKLMQ ITSLHSLNAF LLPIKTVGVQ GDCRSYSYVC GISSKDEPDW 540 

ESLIFLARLI -PRMCHNVNRV VYIFGPPVKE PPTDVTPTFL TTGVLSTIiRQ ADFEAHNILR 600 ' 

ESGYAGKISQ MPVILTPLHF DRDPLQKQPS CQRSWIRTF ITSDFMTGIP ATPGNEIPVE 660 
WLKMVTEIK KIPGISRIMY DLTSKPPGTT EWE 

Seq ID NO: 119; DNA sequence 
Nucleic Acid Accession #: NM_006500.l 
■Coding sequence: 27. ..1967, 

1 . . 11 . ' .21 • 31. 41 51 

i . :j ' .I . I I I ■ 

ACTTGCGTCT CGCCCTCCGG CCAAGCATGG GGCTTCCCAG GCTGGTCTGC GCCTTCTTGC - 60 
TCGCCGCCTG CTGCTGCTGT CCTCGCGTCG CGGGTGTGCC CGGAGAGGCT GAGCAGCCTG 120 
CGCCTGAGCT GGTGGAGGTG GAAGTGGGCA GCACAGCCCT TCTGAAGTGC GGCCTCTCCC 180 
AGTCCCAAGG CAACCTCAGC CATGTCGACT G GTTTT CTGT CCACAAGGAG AAGOGGACGC . 240 
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TCATCTTCCG TGTGCGCCAG . GGCCAGGGCC AGAGCGAACC TGGGGAGTAC .GAGCAGCGGC *- 300 

TCAGCCTCCA GGACAGAGGG GCTACTCTGG CCCTGACTCA AGTCACCCCC CAAGACGAGC _ * ' 360 

.GCATCTTCTT. GTGCCAGGGC AAGCGCCCTC GGTCCCAGGA GTACOGCATC CAGCTCCGCG " 420 

TCTACAAAGC TCCGGAGGAG CCAAACATCC AGGTCAACCC CCTGGGCATC CCTGTGAACA 480 

" GTAAGGAGCC TGAGGAGGTC GCTACCTGTG TAGGGAGGAA OGGGTACCCC ATTCCTCAAG 540 

TCATCTGGTA CAAGAATGGC CGGCCTCTGA AGGAGGAGAA GAACCGGGTC CACATTCAGT 600 

CGTCCC3VGAC. TGTGGAGTCG AGTGGTTTGT ACACCTTGCA GAGTATTCTG AAGGCACAGC ' 660 

TGGTTAAAGA AGACAAAGAT GCCCAGTTTT ACTGTGAGCT CAACTACCGG CTGCCCAGTG ] 720 

. GGAACCACAT GAAGGAGTCC AGGGAAGTCA CCGTCCCTGT TTTCTACCCG ACAGAAAAAG 780 

TGTGGCTGGA AGTGGAGCCC GTGGGAATGC TGAAGGAAGG GGACCGCGTG GAAATCAGGT - 840 

GTTTGGCTGA TGGCAACCCT CCACCACACT TCAGCATCAG CAAGCAGAAC CCCAGCACCA 900 

GGGAGGCAGA GGAAGAGACA ACCAACGACA ACGGGGTCCT GGTGCTGGAG CCTGCCCGGA 960 

AGGAACACAG TGGGCGCTAT GAATGTCAGG CCTGGAACTT GGACACCATG ATATCGCTGC 1020 

TGAGTGAACC ACAGGAACTA CTGGTGAACT ATGTGTCTGA CGTCCGAGTG AGTCCCGCAG 1080 

CCCCTGAGAG ACAGGAAGGC AGCAGCCTCA CCCTGACCTG TGAGGCAGAG AGTAGCCAGG 1140 

ACCTCGAGTT CCAGTGGCTG AGAGAAGAGA CAGACCAGGT GCTGGAAAGG GGGCCTGTGC 1200 

TTCAGTTGCA TGACCTGAAA CGGGAGGCAG GAGGCGGCTA TCGCTGCGTG GCGTCTGTGC 1260 

CCAGCATACC . CGGCCTGAAC CGCACACAGC TGGTCAAGCT GGCCATTTTT GGCCCCCCTT 1320 

GGATGGCATT CAAGGAGAGG AAGGTGTGGG TGAAAGAGAA TATGGTGTTG AATCTGTCTT 1380 

GTGAAGCGTC AGGGCACCCC CGGCCCACCA TCTCCTGGAA CGTCAACGGC ACGGCAAGTG 1440 

AACAAGACCA' AGATCCACAG CGAGTCCTGA GCACCCTGAA TGTCCTCGTG ACCCCGGAGC 1500 

TGTTGGAGAC AGGTGTTGAA TGCACGGCCT CCAACGACCT GGGCAAAAAC ACCAGCATCC 1560 

TCTTCCTGGA GCTGGTCAAT TTAACCACCC TCACACCAGA CTCCAACACA' ACCACTGGCC 1620 

TCAGCACTTC CACTGCCAGT CCTCATACCA GAGCCAACAG CACCTCCACA GAGAGAAAGC 16 BO 

TGCCGGAGCC GGAGAGCCGG GGCGTGGTCA TCGTGGCTGT GATTGTGTGC ATCCTGGTCC 1740. 

TGGCGGTGCT GGGCGCTGTC CTCTATTTCC TCTATAAGAA GGGCAAGCTG CCGTGCAGGC 1800 

GCTCAGGGAA GCAGGAGATC AGGCTGCCCC CGTCTCGTAA GACCGAACTT GTAGTTGAAG 1860 

TTAAGTCAGA TAAGCTCCCA GAAGAGATGG GCCTCCTGCA GGGCAGCAGC GGTGACAAGA 1920 

GGGCTCCGGG AGACCAGGGA GAGAAATACA TOGATCTGAG GCATTAGCCC CGAATCACTT 1980 

CAGCTCCCTT CCCTGCCTGG ACCATTCCCA GCTCCCTGCT CACTCTTCTC T CAGC CAAAG 2040 

CCTCCAAAGG GACTAGAGAG AAGCCTCCTG CTCCCCTCAC CTGCACACCC CCTTTCAGAG 2100 

GGCCACTGGG TTAGGACCTG AGGACCTCAC TTGGCCCTGC AAGCCGCTTT TCAGGGACCA 2160 

GTCCACCACC ATCTCCTCCA CGTTGAGTGA AGCTCATCCC AAGCAAGGAG CCCCAGTCTC 2220 

CCGAGCGGGT. AGGAGAGTTT CTTGCAGAAC GTGTTTTTTC TTTACACACA TTATGGCTGT 2280 

AAATACCTGG CTCCTGCCAG CAGCTGAGCT GGGTAGCCTC TCTGAGCTGG TTTCCTGCCC 2340 

CAAAGGCTGG CTTCCACCAT CCAGGTGCAC CACTGAAGTG AGGACACACC GGAGCCAGGC 2400 

GCCTGCTCAT GTTGAAGTGC GCTGTTCACA CCCGCTCCGG AGAGCACCCC AGCGGCATCC 2460 

AGAAGCAGCT GCAGTGTTGC TGCCACCACC CTCCTGCTCG CCTCTTCAAA GTCTCCTGTG 2520 

ACATTTTTTC TTTGGTCAGA AGCCAGGAAC TGGTGTCATT CCTTAAAAGA TACGTGCCGG 25B0 

GGCCAGGTGT GGTGGCTCAC GCCTGTAATC CCAGCACTTT GGGAGGCCGA GGCGGGCGGA 2640 

TCACAAAGTC AGGACGAGAC CATCCTGGCT AACACGGTGA AACCCTGTCT CTACTAAAAA 2700 

TACAAAAAAA AATTAGCTAG GCGTAGTGGT TGGCACCTAT AGTCCCAGCT ACTCGGAAGG 2760 

CTGAAGCAGG AGAATGGTAT GAATCCAGGA GGTGGAGCTT GCAGTGAGCC GAGACCGTGC 2820 

CACTGCACTC CAGCCTGGGC AACACAGCGA GACTCCGTCT CGAGGAAAAA AAAAGAAAAG 2880 

AOGCGTACCT GCGGTGAGGA AGCTGGGCGC TGTTTTCGAG TTCAGGTGAA TTAGCCTCAA 2940 

TCCCCGTGTT CACTTGCTCC CATAGCCCTC TTGATGGATC ACGTAAAACT GAAAGGCAGC 3000 

GGGGAGCAGA CAAAGATGAG GTCTACACTG TCCTTCATGG GGATTAAAGC TATGGTTATA 3060 

TTAGCACCAA ACTTCTACAA ACCAAGCTCA GGGCCCCAAC CCTAGAAGGG CCCAAATGAG 3120 • 

AGAATGGTAC TTAGGGATGG AAAACGGGGC CTGGCTAGAG CTTCGGGTGT GTGTGTCTGT 3180 

CTGTGTGTAT GCATACATAT GTGTGTATAT ATGGTTTTGT CAGGTGTGTA AATTTGCAAA 3240 

TTGTTTCCTT TATATATGTA TGTATATATA TATATGAAAA TATATATATA TATGAAAAAT 3300 

AAAGCTTAAT TGTCCCAGAA AATCATACAT TGCTTTTTTA TTCTACATGG GTACCACAGG 3360 

AACCTGGGGG CCTGTGAAAC TACAACCAAA AGGCACACAA AACCGTTTCC AGTTGGCAGC 3420 

AGAGATCAGG GGTTACCTCT GCTTCTGAGC AAATGGCTCA AGCTCTACCA GAGCAGACAG 3480 

CTACCCTACT TTTCAGCAGC AAAACGTCCC GTATGACGCA GCACGAAGGG CCTGGCAGGC 3540 
TGTTAGCAGG AGCTATGTCC CTTCCTATCG tttccgtcca ctt 



Seq ID NO: 120 Protein sequence: 
Protein Accession 8: NPJJ06491.1 



21 31 41 51 

I I II" 

VAGVPGEABQ PAPELVEVEV GSTALLKCGL SQSQGNLSHV 60 

GQSEPGEYEQ RLSLQDRGAT LALTQVTPQD BRIFLCQGKR 120 

IQVNPLGIPV NSKEPEEVAT CVGRNGYPIP QVIWYKNGRP 180 

LYTLQSILKA QLVKEDKDAQ PYCELNYRLP SGNHMKESRE 240 

MLKEGDRVEI RCLADGNPPP HPSISKQNPS TREAEEETTN 300 

QAWNLDTMIS LLSEPQELLV NYVSDVRVSP AAPERQEGSS "360 

ETDQVLERGP VLQLHDLKRE AGGGYRCVAS VPSIPGLNRT 420 

WVKENMVLNL SCEASGHPRP TISWNVNGTA SEQDQDPQRV 480 • 

ASNDLGKNTS ILFLELVNLT TLTPDSNTTT GLSTSTASPH 540 

VIVAVIVCIL VLAVLGAVLY FLYKKGKLPC RRSGKQEITL ■ 600 
MGLLQGSSGD KRAPGDQGEK YIDLRH 

Seq ID NO: 121 DNA sequence 
Nucleic Acid Accession NM_018306 
Coding sequence: 60-671 

1 11 21 31 41 51 

I I I I I I 

ATAGTCTACA. CAGAGCTCCC CTTGCTGCCC AGACAAGCTG AAGGACCACA GGAAAAGCCA 60 

TGGAGACTTC, AGCATCCTCC TCCCAGCCTC AGGACAACAG TCAAGTCCAC AGAGAAACAG • 120 

AAGATGTAGA CTATGGAGAG ACAGATTTCC ACAAGCAAGA CGGGAAGGCT GGACTCTTTT ' 180 

CCCAAGAACA ATATGAGAGA AACAAGTCTT CTTCCTCCTC CTTCTCTTCC TCCTCATCCT -240 

CCTCATCTTC TTCATCCTCC TCCTCCTCAG GTCCTGGGCA TGGGGAGCCT GACGTTTTGA 300 



1 11 

I 1 
MGLPRLVCAF. LLAACCCCPR . 
DWFSVHKEKR TLIFRVRQGQ 
PRSQEYRIQL . RVYKAPEEPN 
LKEEKNRVHI QSSQTVESSG 
VTVPVFYPTE KVWLEVEPVG 
DNGVLVLEPA RKEHSGRYEC 
LTLTCEAESS QDLEFQWLRE 
QLVK1AIFGP PWMAFKERKV' 
LSTLNVLVTP ELLETGVECT 
TRANSTSTER . KLPEPESRGV 
PPSRKTELW EVKSDKLPEE 
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AGGATGAGCT TCAACTCTAT GGAGATGCTC. 
GACTCCGAAG GAGAGGCTCT GACCCAGCAA 
.GACTGAATAT . AAAGAAAGAT GATGAGTTTT 
GGGCCTTGCT GGTGTG TTAT CACTATTACG. 
TGCTCACCTT CGCCTCCCTG GAAACOGTTG 
ACAGCGTCCT CCAAGGCTTC ATCCCCCTCT 
AGACTGACTG AGGCCACTTC CAGGTGGGCA' 
GCGACCCCTG. AGCCCACAAG GGCAGAGCAG 
AGACCAATAA ACAGAACACT TTTCCTTCCA 
AGGGGCATCT CATTTGGGCA GTACTGCTGT 
GGTGGGTGTG GGGCCTGAGG CTTCACAGTA 
ACTGCTCTCA GAGGACAGCA AGGGACCCTG 
ATGGTTTTTC TCAAATCCCA GGCTATCTGC 
GGGAGAGATG GATGGTCCAC TGCTTTGAGG 
GAACCTGATG CAGGTAAGAT GCTGAGGACT 
GGCAGGAAAA TGATCATCAG AAACTAAATG 
TCCTCGCACT TTGGGAGGCT CAGGCTAAGG. 
CTGGGCAACA TAGTGAGACC CCCATCTCTA 
TACATACCTG TACATACCTG- CGGTTCCAGC 
TGAGCCCAGG AGTTCAGGGC TGCAGTGAGG 
CGACAGAGCA AGATCGTTTC TCTAAAATT 



Seq ID NO: 122 Protein sequence:. 
Protein Accession ft: NP_060776 



CTGGAGAGGT 
GTGGAGAAGT 
TCCATTTCGT 
CAGACTGGTT 
GCATCTACTT 
TCCAGAAGTT 
GCAGAGGCAG 
CATTCTGAGA 
TGTGGTCTGA 
GCAACCCAGC 
CCTGGACCAG 
AGCTCTGCAA 
ATGCGCTCTC 
CAGGGAGCCA 
AAAACCATTT 
GCAGCCAGGC 
GTCGCTTGAA 
CAATTTTTTT 
TACTCAAGAG 
TACGATCAAG 



GGTACCCTCT 
GGAGGCCTCT 
CCTCCTGTGC 
CATGTCTCTT 

cgga'ctagtg 
taggctgaca 
gccccagtgt 
gacgcacagg 
atgttggcac 

TGCAAGGATG 
CAGGAAGATT 
GCTGTGATCT 
AGGTGCTACC 
TCGGGCTGGG 
TTTTTGCACC 
ATGGGGGCTC 
GCTGAGAGTT 
TTAATGACCA 
GCTGAGGCAG 
CCACTGCACT 



GGGGAATCAG* 

CAGTTAAGAA 

TTTGCCATCG* 

GGGGTCGGCC; 

TACCGTATCC' 

GGGTTCAGGA 

GACCACCACT 

AGACCAAGCC 

CAGCCCGGGC 

GAAGGCAGAG 

CTGGGAGGTC 

GTCTGGGTTC 

GAGCCATCCT 

GCCCCTTGGT 

CAAAAAAAAA 

ACGACTGTAA 

CAAGACCAAC 

AATGTGGCGG. 

GAGGACTGCT 

CCAGCCTGGG. 
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METSASSSQP QDNSQVHRET EDVDYGETDP HKQDGKAGLP SQEQYEHNKS SSSSFSSSSS 

SSSSSSSSSS GPGHGEPDVL KDELQLYGDA PGEWPSGBS GLRRRGSDPA SGEVEASQtiR 

RLNIKiOJDEF'FHFVLLCPAI GADLVCYHYY ADWFMSLGVG LLTFASLETV GIYFGLVYRI 
HSVLQGFIPL FQKPRLTGPR KTD 

Seq ID NO: 123 DNA sequence 
Nucleic Acid Accession #: BC022542 
Coding . sequence : 243,-896 



360 
. 420 
480" 
540 
' 600 
660 
720 
u 780 
840 
900 
960 
1020 
1080- 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
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ACTTGGrCCC AGCCGATAAA TCTGGGGCAG CGCGCGGTAG GAGCTGCGGG CGGCCAGGCC 
CCTTCCTGCG TCCGCACCTG GCCCCGCGCG CCCCTCTCGG GCGTCCGGCT TCCGGCGTCC 
TGGCGGCTCG GGTGGCGGCG GTTCGGGCGG CCGCCTGGCT GCTCCTCGGG GCGGCGACGG 
GGCTCACGCG CGGGCCCGCC ACGGCCTTCA CCGCCGCGCG CTCTGACGCC GGCATAAGGG 
CCATGTGTTC TGAAATTATT TTGAGGCAAG AAGTTTTGAA AGATGGTTTC CACAGAGACC 
TTTTAATCAA AGTGAAGTTT GGGGAAAGCA TTGAGGACTT GCACACGTGC CGTCTCTTAA 
TTAAACAGGA CATTCCTGCA GGACTTTATG TGGATCCGTA TGAGTTGGCT TCATTACGAG 
AGAGAAACAT AACAGAGGCA GTGATGGTTT CAGAAAATTT TGATATAGAG GCCCCTAACT 
ATTTGrCCAA GGAGTCTGAA GTTCTCATTT ATGCCAGACG AGATTCACAG TGCATTGACT 
GTTTTCAAGC CTTTTTGCCT GTGCACTGCC GCTATCATCG GCCGCACAGT GAAGATGGAG 
AAGCCTCGAT TGTGGTCAAT AACCCAGATT TGTTGATGTT TTGTGACCAA GAGTTCCCGA 
TTTTGAAATG CTGGGCTCAC TCAGAAGTGG CAGCCCCTTG TGCTTTGGAT AATGAGGATA 
TATGCCAATG GAACAAGATG AAGTATAAAT CAGTATATAA GAATGTGATT CTACAAGTTC 
CAGTGGGAGT GACTGTACAT ACCTCTCTAG TATGTTCTGT GACTCTGCTC ATTACAATCC 
TGTGCTCTAC ATTGATCCTT GTAGCAGTTT TCAAATATGG CCATTTTTCC CTATAAGTTT 
TATGTAGTTA AATGCTTCCT AGAAACCTAA ATAAGATCTA TTAATTTCTG ACGAGAGGTG 
TTCTTCTAGA ATTAATTACT TTTATCTTTT GTCTTCATTT GTGGCCAAAA TTATGTTTAC 
TAGAGGAAAT TTGGGATCAT TCTCAGCTAA TTCCAAAATG TAGTGCTCTA TTGCATGGAT 
CCTTGGTAAT CCTCAAGCAT CAGATGCCAT AAGGGGAAAC TTAATTCTGC TAAATTAATG 
TTTATTTTGT GAGAAGTGAC TTTATCTTCA TTTGGGGTAG AAAAATTATT TCTTTATGTA 
GTAGAGACAA ATTATTCTCA TTTTGCAAGT ACTTTCAATT TAAGCTACAA ATTGAGAAAA 
CCGTTATAAA TAAGAATAAA ATAGGCCAGG CACAGTGGCT CACACCTGTA ATCCCAGCAC 
TTTGGGAGGC CGAGGTGGGC GGATCACCAG AGGTCAAGAG TTTGAGACCA GCTTGGTGAA 
ACCCTGTCTC TACTAAAAAT ACAAAAGTTA GCTGGGGCTG GTGGTGGGCA TCTGTAGTCC 
CAGCTAATTG GAAGGGTGAG GCGGGAGGAT CGCTTGAACC TGGGAGGCGG AGGTTCCAGA 
GAGCCAAGAT CGCACCACTG CACTACAGCC TGGGCGACAG AACGAGACCC TGTCTCCAAA 
GGAAAAACAA AAAAGAAGAA TAAAATAATT TGGATGAAAA TCATGTTTAT TTAAATAGTA 
ATGTCATGAG ACTATTAAAG ATGTGCCAGA GTTTCAATGA AAATCATTAA AGTAGGACAG 
■ CTAAGAAATT AATATTAATA TAAAAATTAT TGATAATCTT AAATTATTGA TTATTCCTTA 
ACGCACTCCA TTCTCCTTTT ACATTTTATC ATGTTTCTTT TGAATATATG AATTGGCAAA 
GGACTTGATG AAACTGAGTA CTAAGATTTG GTACAGAGTA TGTCAGGAAG ACAACTCAGA 
TTGCCATTTT. AAATAAAGTT .GTACATGAAC AAAAAAAAAA AAAAAA 



Seq' ID NO: .124 Protein sequence: 
Protein Accession #: AAH22542 

1 11 21 31 41 51 

I I I I ' I I - 

MCSEIILRQE VLKDGFHRDL LIKVKFGESI EDLHTCRLLI KQDIPAGttYV DPYE1ASLRB 60 

RNITEAVMVS ENFDIEAPNY LSKESEVLIY ARRDSQCIDC FQAFLPVHCR YHRPHSEDGB 120 

ASIWNNPDL LMFCDQAGSR RMIRFRFDSF DKTIEFPILK CWAHSEVAAP CAIiENEDICQ 180 

WNKMKYKSVY KNVILQVPVG LTVHTSLVCS VTLLITILCS KKKKK 

Seq ID NO: 125 DNA sequence 

Nucleic Acid Accession #» NM_004994.1 

Coding sequence: *2p;.2143 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
. 900 
960 
1020' 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
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AGACACCTCT- GCCCTCACCA TGAGCCTCTG GCAGCCCCTG GTCCTGGTGC TCCTGGTGCT 60 

. GGGCTGCTGC. TTTGCTGCCC CCAGACAGCG CCAGTCCACC CTTGTGCTCT TCCCTGGAGA 120 

CCTGAGAACC- AATCTCACCG ACAGGCAGCT GGCAGAGGAA TACCTGTACC GCTATGGTTA 180 

CACTCGGGTG GCAGAGATGC GTGGAGAGTC GAAATCTCTG GGGCCTGCGC TGCTGCTTCT 240 

CCAGAAGCAA CTGTCCCTGC CCGAGACCGG TGAGCTGGAT AGCGCCACGC TGAAGGCCAT, 300 

GCGAACCCCA CGGTGOGGGG TCCCAGACCT GGGCAGATTC CAAACCTTTG AGGGCGACCT 360 

CAAGTGGCAC CACCACAACA TCACCTATTG GATCCAAAAC TACTCGGAAG ACTTGCCGCG 420 

.. GGCGGTGATT GACGACGCCT TTGCCCGCGC CTTCGCACTG TGGAGCGCGG TGACGCCGCT " 480 

CACCTTCACT CGCGTGTACA GCCGGGAOGC AGACATOGTC ATCCAGTTTG GTGTCGOGGA 540 

GCACGGAGAC GGGTATCCCT TOGACGGGAA GGACGGGCTC CTGGCACACG CCTTTCCTCC 600 

TGGCCCCGGC ATTCAGGGAG ACGCCCATTT CGAOGATGAC GAGTTGTGGT CCCTGGGCAA 660 

GGGCGTCGTG GTTCCAACTC GGTTTGGAAA CGCAGATGGC GCGGCCTGCC ACTTCCCCTT 720 

CATCTTCGAG GGCCGCTCCT ACTCTGCCTG CACCACCGAC GGTCGCTCCG ACGGCTTGCC 780 

CTGGTGCAGT ACCACGGCCA ACTACGACAC CGACGACCGG TTTGGCTTCT GCCCCAGCGA 840 

GAGACTCTAC ACCCGGGACG GCAATGCTGA TGGGAAACCC TGCCAGTTTC CATTCATCTT 900 

CCAAGGCCAA TCCTACTCCG CCTGCACCAC GGAOGGTOGC TCCGACGGCT ACCGCTGGTG 960 

CGCCACCACC GCCAACTACG ACCGGGACAA GCTCTTCGGC TTCTGCCCGA CCCGAGCTGA 1020 

CTCGACGGTG ATGGGGGGCA ACTCGGOGGG GGAGCTGTGC GTCTTCCCCT TCACTTTCCT 1080 

GGGTAAGGAG TACTOGACCT GTACCAGCGA GGGCCGCGGA GATGGGCGCC TCTGGTGCGC 1140 

TACCACCTCG AACTTTGACA GCGACAAGAA GTGGGGCTTC TGCCCGGACC AAGGATACAG 1200 

TTTGTTCCTC GTGGCGGCGC ATGAGTTCGG CCACGCGCTG GGCTTAGATC ATTCCTCAGT 1260 

GCCGGAGGCG CTCATGTACC CTATGTACCG CTTCACTGAG GGGCCCCCCT TGCATAAGGA 1320 

CGACGTGAAT GGCATCCGGC ACCTCTATGG TCCTCGCCCT GAACCTGAGC CACGGCCTCC 1380 

AACCACCACC ACACCGCAGC CCACGGCTCC CCCGACGGTC TGCCCCACCG GACCCCCCAC 1440 

TGTCCACCCC TCAGAGCGCC CCACAGCTGG CCCCACAGGT CCCCCCTCAG CTGGCCCCAC 1500 

AGGTCCCCCC ACTGCTGGCC CTTCTACGGC CACTACTGTG CCTTTGAGTC CGGTGGACGA 1560 

TGCCTGCAAC GTGAACATCT TCGACGCCAT CGCGGAGATT GGGAACCAGC TGTATTTGTT 1620 

CAAGGATGGG AAGTACTGGC GATTCTCTGA GGGCAGGGGG AGCCGGCCGC AGGGCCCCTT 1680 

CCTTATCGCC GACAAGTGGC CCGCGCTGCC CCGCAAGCTG GACTOGGTCT TTGAGGAGCC 1740 

GCTCTCCAAG AAGCTTTTCT TCTTCTCTGG GCGCCAGGTG TGGGTGTACA CAGGCGCGTC 1800 

GGTGCTGGGC CCGAGGCGTC TGGACAAGCT GGGCCTGGGA GCOGACGTGG CCCAGGTGAC 186.0 

CGGGGCCCTC CGGAGTGGCA GGGGGAAGAT GCTGCTGTTC AGCGGGCGGC GCCTCTGGAG 1920 

GTTCGACGTG AAGGCGCAGA TGGTGGATCC CCGGAGCGCC AGOGAGGTGG ACCGGATGTT 1980 

CCCCGGGGTG CCTTTGGACA CGCACGACGT CTTCCAGTAC CGAGAGAAAG CCTATTTCTG' 2040 

CCAGGACCGC TTCTACTGGC GCGTGAGTTC CCGGAGTGAG TTGAACCAGG TGGACCAAGT 2100 

GGGCTACGTG ACCTATGACA TCCTGCAGTG CCCTGAGGAC TAGGGCTCCC GTCCTGCTTT 2160 

GCAGTGCCAT GTAAATCCCC ACTGGGACCA ACCCTGGGGA AGGAGOCAGT TTGCCGGATA 2220 

CAAACTGGTA TTCTGTTCTG GAGGAAAGGG AGGAGTGGAG GTGGGCTGGG CCCTCTCTTC 2280 
TCACCTTTGT . TTTTTGTTGG AGTGTTTCTA ATAAACTTGG ATTCTCTAAC CTTT 

Seg ID NO* 126. Protein sequence: 
protein Accession #: NP_004985.1 

1 11 21 31 41 51 

} I I I I I . 

MSLWQPLVLV LLVLGCCFAA PRQRQSTLVL FPGDLRTNLT DRQLAEEYLY RYGYTRVAEM 60 

RGESKSLGPA LLLLQKQLSL PETGELDSAT LKAMRTPRCG VPDLGRFQTF EGDLKWHHHN 120 

ITYWIQNYSE DLPRAVIDDA FARAFALWSA VTPLTPTRVY SRDADIVIQF GVAEHGDGYP 180 

FDGKDGLLAH AFPPGPGIQG DAHFDDDELW SLGKGVWPT RFGNADGAAC HFPFIFEGRS 240 

YSACTTDGRS DGIiPWCSTTA NYDTDDRFGF CPSERLYTRD GNADGKPCQF PFIFQGQSYS 300 

ACTTDGRSDG YRWCATTANY DRDKLFGFCP TRADSTVMGG NSAGKLCVFP FTFLGKEYST 360 

CTSEGRGDGR LWCATTSNFD SDKKWGFCPD QGYSLFLVAA HEFGHALGLD HSSVPEALMY 420 

PMYRFTEGPP LHKDDVNGIR HLYGPRPEPE PRPPTTTTPQ PTAPPTVCPT GPPTVHPSER 480 

PTAGPTGPPS AGPTGPPTAG PSTATTVPLS PVDDACNVNI FDAIAEIGNQ LYLFKDGKYW 540 

RFSEGRGSRP QGPFLIADKW PALPRKLDSV FKEPLSKKLF FFSGRQVWVY TGASVLGPRR 600 

LDKLGLGADV AQVTGALRSG RGKMLLFSGR RLWRFDVKAQ MVDPRSASEV DRMFPGVPLD .660 
THDVFQYREK AYFCQDRFYW RVSSRSELNQ VDQVGYVTYD ILQCPED 

Seq ID NO: 127 DNA sequence 
Nucleic Acid Accession #j NM_004181 
Coding sequence: 32-670 

I 11 21 31 41 51 

II i I I I 

GCAGAAATAG CCTAGGGAGA TCAACCCCGA GATGCTGAAC AAAGTGCTGT CCCGGCTGGG 60 

GGTCGCCGGC CAGTGGCGCT TCGTGGACGT GCTGGGGCTG GAAGAGGAGT CTCTGGGCTC 120 

GGTGCCAGCG CCTGCCTGCG CGCTGCTGCT GCTGTTTCCC CTCACGGCCC AGCATGAGAA 180 

CTTCAGGAAA AAGCAGATTG AAGAGCTGAA GGGACAAGAA GTTAGTCCTA AAGTGTACTT 240 

CATGAAGCAG ACCATTGGGA ATTCCTGTGG CACAATCGGA CTTATTCACG CAGTGGCCAA 300 

TAATCAAGAC AAACTGGGAT TTGAGGATGG ATCAGTTCTG AAACAGTTTC TTTCTGAAAC 360 

AGAGAAAATG TCCCCTGAAG ACAGAGCAAA ATGCTTTGAA AAGAATGAGG CCATACAGGC 420 

AGCCCATGAT GCCGTGGCAC AGGAAGGCCA ATGTCGGGTA GATGACAAGG TGAATTTCCA 480 

TTTTATTCTG TTTAACAACG TGGATGGCCA CCTCTATGAA CTTGATGGAC GAATGCCTTT 540 

TCCGGTGAAC CATGGCGCCA GTTCAGAGGA CACCCTGCTG AAGGACGCTG CCAAGGTGTG 600 

CAGAGAATTC ACCGAGCGTG AGCAAGGAGA AGTCCGCTTC TCTGCCGTGG CTCTCTGCAA 660 

GGCAGCCTAA TGCTCTGTGG GAGGGACTTT GCTGATTTCC CCTCTTCCCT TCAACATGAA 720 

AATATATACC CCCCATGCAG TCTAAAATGC TTCAGTACTT GTGAAACACA GCTGTTCTTC 780 

TGTTCTGCAG ACACGCCTTC CCCTCAGCCA CACCCAGGCA CTTAAGCACA AGCAGAGTGC 840 

ACAGCTGTCC ACTGGGCCAT TGTGGTGTGA GCTTCAGATG GTGAAGCATT CTCCCCAGTG 900 

TATGTCTTGT ATCCGATATC TAACGCTTTA AATGGCTACT TTGGTTTCTG TCTGTAAGTT 960 
AAGACCTTGG ATGTGGTTAT GTTGTCCTAA AGAATAAATT TTGCTGATAG TAGC 

Seq ID HOt 128 Protein sequences 
Protein Accession S: NP_004172 
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I I I I I I 

MLKKVLSRLG VAGQWRFVDV LGLEEESLGS VPAPACALU* LFPLTAQHEN FRKKQIEELK , 
GQSVSPKVYP MKQTIGNSCG TIGLIHAVAN KQDKLGFEDG SVLKQFLSET EKMSPEDRAX 
CFEKNEAIQA AHDAVAQEGQ CRVDDKVNFH FILFNNVDGH LYELDGKMPF PVNHGASSED 
TL2JKDAAKVC REFTEREQGS VRFSAVALCK AA 



Seq ID NO*. 129 DNA sequence - 
Nucleic Acid Accession § : NM_000213 
Coding sequence: 127-5385 

1 11 21 31 41 51 

I * I .\: I . I I I 

CGCCCGCGCG CTGCAGCCCC ATCTCCTAGC GGCAGCCCAG GCGCGGAGGG AGCGAGTCCG 
CCCCGAGGTA GGT CCAGGAC GGGCGCACAG CAGCAGCCGA GGCTGGCCGG GAGAGGGAGG 
AAGAGGATGG CAGGGCCACG CCCCAGCCCA TGGGCCAGGC TGCTCCTGGC AGCCTTGATC 
AGCGTCAGCC TCTCTGGGAC CTTGGCAAAC CGCTGCAAGA AGGCCCCAGT GAAGAGCTGC 
ACGGAGTGTG TCCGTGTGGA TAAGGACTGC GCCTACTGCA CAGACGAGAT GTTCAGGGAC 
CGGCGCTGCA ACACCCAGGC GGAGCTGCTG GCOGCGGGCT GCCAGCGGGA GAGCATCGTG 
GTCATGGAGA GCAGCTTCCA AATCACAGAG GAGACCCAGA TTGACACCAC CCTGCGGCGC 
AGCCAGATGT CCCCCCAAGG CCTGCGGGTC CGTCTGCGGC CCGGTGAGGA GCGGCATTTT 
GAGCTGGAGG TGTTTGAGCC ACTGGAGAGC CCCGTGGACC TGTACATCCT CATGGACTTC 
TCCAACTCCA TGTCCGATGA TCTGGACAAC CTCAAGAAGA TGGGGCAGAA CCTGGCTCGG 
GTCCTGAGCC AGCTCACCAG CGACTACACT ATTGGATTTG GCAAGTTTGT GGACAAAGTC 
AGCGTCCCGC AGACGGACAT GAGGCCTGAG AAGCTGAAGG AGCCCTGGCC CAACAGTGAC 
CCCCCCTTCT CCTTCAAGAA CGTCATCAGC CTGACAGAAG ATGTGGATGA GTTCCGGAAT 
AAACTGCAGG GAGAGCGGAT CTCAGGCAAC CTGGATGCTC CTGAGGGCGG CTTCGATGCC 
ATCCTGCAGA CAGCTGTGTG CACGAGGGAC ATTGGCTGGC GCCCGGACAG CAGCCACCTG 
CTGGTCTTCT CCACCGAGTC AGCCTTCCAC TATGAGGCTG ATGGCGCCAA CGTGCTGGCT 
GGCATCATGA GCCGCAACGA TGAACGGTGC CACCTGGACA CCACGGGCAC CTACACCCAG 
TACAGGACAC AGGACTACCC GTCGGTGCCC ACCCTGGTGC GCCTGCTCGC CAAGCACAAC 
ATCATCCCCA TCTTTGCTGT CACCAACTAC TCCTATAGCT ACTACGAGAA GCTTCACACC 
TATTTCCCTG TCTCCTCACT GGGGGTGCTG CAGGAGGACT CGTCCAACAT CGTGGAGCTG 
CTGGAGGAGG CCTTCAATCG GATCCGCTCC AACCTGGACA TCCGGGCCCT AGACAGCCCC 
CGAGGCCTTC GGACAGAGGT CACCTCCAAG ATGTTCCAGA AGACGAGGAC TGGGTCCTTT 
CACATCCGGC GGGGGGAAGT GGGTATATAC CAGGTGCAGC TGCGGGCCCT TGAGCACGTG 
GATGGGACGC ACGTGTGCCA GCTGCCGGAG GACCAGAAGG GCAACATCCA TCTGAAACCT 
TCCTTCTCCG ACGGCCTCAA GATGGACGCG GGCATCATCT GTGATGTGTG CACCTGCGAG 
CTGCAAAAAG AGGTGCGGTC AGCTCGCTGC AGCTTCAACG GAGACTTCGT GTGCGGACAG 
TGTGTGTGCA GCGAGGGCTG GAGTGGCCAG ACCTGCAACT GCTCCACCGG CTCTCTGAGT 
GACATTCAGC CCTGCCTGCG GGAGGGCGAG GACAAGCCGT GCTCCGGCCG TGGGGAGTGC 
CAGTGCGGGC ACTGTGTGTG CTACGGCGAA GGCCGCTACG AGGGTCAGTT CTGCGAGTAT 
GACAACTTCC AGTGTCCCCG CACTTCCGGG TTCCTCTGCA ATGACCGAGG ACGCTGCTCC 
ATGGGCCAGT GTGTGTGTGA GCCTGGTTGG ACAGGCCCAA GCTGTGACTG TCCCCTCAGC 
AATGCCACCT GCATCGACAG CAATGGGGGC ATCTGTAATG GACGTGGCCA CTGTGAGTGT 
GGCCGCTGCC ACTGCCACCA GCAGTCGCTC TACACGGACA CCATCTGCGA GATCAACTAC 
TCGGCGATCC ACCCGGGCCT CTGCGAGGAC CTACGCTCCT GCGTGCAGTG CCAGGCGTGG 
GGCACCGGCG AGAAGAAGGG GCGCACGTGT GAGGAATGCA ACTTCAAGGT CAAGATGGTG 
GACGAGCTTA AGAGAGCCGA GGAGGTGGTG GTGCGCTGCT CCTTCCGGGA CGAGGATGAC 
GACTGCACCT. ACAGCTACAC CATGGAAGGT GACGGCGCCC CTGGGCCCAA CAGCACTGTC 
CTGGTGCACA AGAAGAAGGA CTGCCCTCCG GGCTCCTTCT GGTGGCTCAT CCCCCTGCTC 
CTCCTCCTCC TGCCGCTCCT GGCCCTGCTA CTGCTGCTAT GCTGGAAGTA CTGTGCCTGC 
TGCAAGGCCT GCCTGGCACT TCTCCOGTGC TGCAACCGAG GTCACATGGT GGGCTTTAAG 
GAAGACCACT ACATGCTGCG GGAGAACCTG ATGGCCTCTG ACCACTTGGA CACGCCCATG 
CTGCGCAGCG GGAACCTCAA GGGCCGTGAC GTGGTCCGCT GGAAGGTCAC CAACAACATG 
CAGCGGCCTG GCTTTGCCAC TCATGCCGCC AGCATCAACC CCACAGAGCT GGTGCCCTAC 
GGGCTGTCCT TGCGCCTGGC CCGCCTTTGC ACCGAGAACC TGCTGAAGCC TGACACTCGG 
GAGTGCGCCC AGCTGCGCCA GGAGGTGGAG GAGAACCTGA ACGAGGTCTA CAGGCAGATC 
TCCGGTGTAC ACAAGCTCCA GCAGACCAAG TTCCGGCAGC AGCCCAATGC CGGGAAAAAG 
CAAGACCACA CCATTGTGGA CACAGTGCTG ATGGCGCCCC GCTCGGCCAA GCCGGCCCTG 
CTGAAGCTTA CAGAGAAGCA GGTGGAACAG AGGGCCTTCC ACGACCTCAA GGTGGCCCCC 
GGCTACTACA CCCTCACTGC AGACCAGGAC GCCCGGGGCA TGGTGGAGTT CCAGGAGGGC 
GTGGAGCTGG TGGACGTACG GGTGCCCCTC TTTATCCGGC CTGAGGATGA CGACGAGAAG 
CAGCTGCTGG TGGAGGCCAT CGACGTGCCC GCAGGCACTG CCACCCTCGG CCGCCGCCTG 
GTAAACATCA CCATCATCAA GGAGCAAGCC AGAGACGTGG TGTCCTTTGA GCAGCCTGAG 
TTCTCGGTCA GCCGCGGGGA CCAGGTGGCC CGCATCCCTG TCATCCGGCG TGTCCTGGAC 
GGCGGGAAGT CCCAGGTCTC CTACCGCACA CAGGATGGCA CCGCGCAGGG CAACCGGGAC 
TACATCCCCG TGGAGGGTGA GCTGCTGTTC CAGCCTGGGG AGGCCTGGAA AGAGCTGCAG 
GTGAAGCTCC TGGAGCTGCA AGAAGTTGAC TCCCTCCTGC GGGGCCGCCA GGTCCGCCGT 
TTCCACGTCC AGCTCAGCAA CCCTAAGTTT GGGGCCCACC TGGGCCAGCC CCACTCCACC 
ACCATCATCA TCAGGGACCC AGATGAACTG GACCGGAGCT TCACGAGTCA GATGTTGTCA 
TCACAGCCAC CCCCTCACGG CGACCTGGGC GCCCCGCAGA ACCCCAATGC TAAGGCCGCT 
GGGTCCAGGA AGATCCATTT CAACTGGCTG CCCCCTTCTG GCAAGCCAAT GGGGTACAGG 
GTAAAGTACT GGATTCAGGG TGACTCCGAA TCCGAAGCCC ACCTGCTCGA CAGCAAGGTG 
CCCTCAGTGG AGCtCACCAA CCTGTACCCG TATTGCGACT ATGAGATGAA GGTGTGCGCC " 
TACGGGGCTC AGGGCGAGGG ACCCTACAGC TCCCTGGTGT CCTGCOGCAC CCACCAGGAA 
GTGCCCAGCG AGCCAGGGCG TCTGGCCTTC AATGTCGTCT CCTCCACGGT GACCCAGCTG 
AGCTGGGCTG AGCCGGCTGA GACCAACGGT GAGATCACAG CCTACGAGGT CTGCTATGGC 
CTGGTCAACG ATGACAACCG ACCTATTGGG CCCATGAAGA AAGTGCTGGT TGACAACCCT 
AAGAACCGGA TGCTGCTTAT TGAGAACCTT CGGGAGTCCC AGCCCTACCG CTACACGGTG 
AAGGCGCGCA ACGGGGCCGG CTGGGGGCCT GAGCGGGAGG CCATCATCAA CCTGGCCACC 
CAGCCCAAGA GGCCCATGTC CATCCCCATC ATCCCTGACA TCCCTATCGT GGACGCCCAG 
AGCGGGGAGG ACTACGACAG CTTCCTTATG TACAGCGATG ACGTTCTACG CTCTCCATCG 
GGCAGCCAGA GGCCCAGCGT CTCCGATGAC ACTGAGCACC TGGTGAATGG CCGGATGGAC 
TTTGCCTTCC CGGGCAGCAC CAACTCCCTG CACAGGATGA CCACGACCAG TGCTGCTGCC 
TATGGCACCC ACCTGAGCCC ACACGTGCCC CACCGCGTGC TAAGCACATC CTCCACCCTC 
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ACACGGGACT ACAACTCACT GACCCGCTCA GAACACTCAC ACTOGACCAC ACTGCCGAGG 4440 

GACTACTCCA CCCTCACCTC .CGTCTCCTCC CACGACTCTC GCCTGACTGC TGGTGTGCCC 4500 

- GACAOGCCCA CCCGCCTGGT GTTCTCTGOC CTGGGGCGCA CATCTCTCAG AGTGAGCTGG 4560 

GAGGAGCCGC GGTGCGAGCG GCCGCTGCAG GGCTACAGTG TGGAGTACCA GCTGCTGAAC 4620 

5 GGCGGTGAGC TGCATCGGCT CAACATCCCC AACCCTGCCC AGACCTCGGT GGTGGTGGAA 4 6 BO 

GACCTCCTGC CCAACCACTC CTACGTGTTC CGCGTGCGGG CCCAGAGCCA GGAAGGCTGG 4740 

GGCCGAGAGC GTGAGGGTGT CATCACCATT GAATCCCAGG TGCACCCGCA GAGCCCACTG 4800 

TGTCCCCTGC CAGGCTCCGC CTTCACTTTG AGCACTCCCA GTGCCCCAGG CCCGCTGGTG 4860- 

TTCACTGCCC TGAGCCCAGA CTCGCTGCAG CTGAGCTGGG AGCGGCCACG GAGGCCCAAT 4920 

10 GGGGATATCG TCGGCTACCT GGTGACCTGT GAGATGGCCC AAGGAGGAGG GCCAGCCACC 4980 

GCATTCCGGG TGGATGGAGA CAGCCCOGAG AGCCGGCTGA CCGTGCCGGG CCTCAGCGAG 5040 

AACGTGCCCT ACAAGTTCAA GGTGCAGGCC AGGACCACTG AGGGCTTCGG GCCAGAGCGC 5100 

GAGGGCATCA TCACCATAGA GTCCCAGGAT GGAGGACCCT TCCCGCAGCT GGGCAGCCGT 5160 

GCCGGGCrCT TCCAGCACCC GCTGCAAAGC GAGTACAGCA GCATCACCAC CACCCACACC 5220 

15 AGCGCCACCG AGCCCTTCCT AGTGGATGGG CCGACCCTGG GGGCCCAGCA CCTGGAGGCA 5280 

GGCGGCTCCC TCACCCGGCA TGTGACCCAG GAGTTTGTGA GCOGGACACT GACCACCAGC 5340 

GGAACCCTTA GCACCGACAT GGACCAACAG TTCTTCCAAA CTTGACCGCA CCCTGCCCCA 5400 

CCCCCGCCAT GTCCCACTAG GCGTCCTCCC GACTCCTCTC CCGGAGCCTC CTCAGCTACT 5460 

' CCATCCTTGC ACCCCTGGGG GCCCAGCCCA CCCGCATGCA CAGAGCAGGG GCTAGGTGTC 5520 

20 TCCTGGGAGG CATGAAGGGG GCAAGGTCCG TCCTCTGTGG GCCCAAACCT ATTTGTAACC 5580 

AAAGAGCTGG GAGCAGCACA AGGACCCAGC CTTTGTTCTG CACTTAATAA ATGGTTTTGC 5640 
ACTG 
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Seq ID NO: 130 Protein sequence i 
Protein Accession #; NP 000204 



MAGPRPSPWA 
CNTQAELLAA 
EVFEPLESPV 
PQTDMRPEKL 
QTAVCTRDIG 
TQDYPSVPTL 
EAFNRIRSNL 
THVCQLPEDQ 
CSEGWSGQTC 
FQCPRTSGFL 
CHCHQQSLYT 
LXRAEEVWR 
LLPLLALLLL 
SGNLKGRDW 
AQLRQEVEEN 
LTEKQVEQRA 
LVEAIDVPAG 
KSQVSYRTQD 
VQLSNPKPGA 
RKIHFNWLPP 
AQGEGPYSSL 
NDDNRPIGPM 
KRPMSIPIIP 
FPGSTNSLHR 
STLTSVSSHD 
ELKRLNIPNP 
LPGSAFTLST 
RVDGDSPESR 
LFQHPLQSEY 
LSTHMDQQFF 



11 

I 

RLLLAALISV 
GOQRESXWM 
DIiYIliMDFSN 
KEPWPNSDPP 
WRPDSTHLLV 
VRLLAKHNII 
DXRALDSPRG 
KGNIHLKPSF 
NCSTGSLSDI 
CNDRGRCSMG 
DTICEINYSA 
CSFRDEDDDC 
LCWKYCACCK 
RWKVTNNMQR 
LNEVYRQISG 
FHDLKVAPGY 
TATLGRRLVN 
GTAQGNRDYI 
HLGQPHSTTI 
SGKPMGYRVK 
VSCRTHQEVP 
KKVLVDNPKN 
DIPIVDAQSG 
MTTTSAAAYG 
SRLTAGVPDT 
AQTSWVEDL 
PSAPGPLVFT 
LTVPGLSENV 
SSITTTHTSA 
QT 



21 
I 

SLSGTLANRC 
ESSFQITBET 
SMSDDLDNLK 
FSFKNVISLT 
FSTESAFHYE 
PIFAVTNYSY 
LRTEVTSKMF 
S0GLKMDAGI 
QPCLREGEDK 
QCVCEPGWTG 
IHPGLCEDLR 
TYSYTMEGDG 
ACLAXiLPCCN 
PGFATHAASI 
VHKLQQTKFR 
YTLTADQDAR 
ITIIKEQARD 
PVEGELLFQP 
IIRDPDELDR 
YWIQGDSESE 
SEPGRLAFNV 
RMLLIENLRE 
EDYDSFLMYS 
THLSPHVPHR 
PTRIjVFSALG 
IiPNHSYVFRV 
ALSPDSLQLS 
PYKFKVQART 
TEPFLVDGPT 



31 

I 

KKAPVKSCTE 
QIDTTLRRSQ 
KMGQNLARVL 
BDVDEFRNKIi 
ADGANVLAGI 
SYYEKLHTYF 
QKTRTGSFHI 
ICDVCTCELQ 
PCSGRGECQC 
PSCDCPLSNA 
SCVQCQAWGT 
APGPNSTVLV 
RGHMVGFKED 
NPTELVPYGL 
QQPNAGKKQD 
GMVEFQEGVE 
WSFEQPEFS 
GEAWKELQVK 
SFTSQMLSSQ 
AHLLDSKVPS 
VSSTVTQLSW 
SQPYRYTVKA 
DDVLRSPSGS 
VLSTSSTLTR 
PTSIiRVSWQE 
RAQSQEGWGR 
WERPRRPNGD 
TEGFGPEREG 
LGAQHLEAGG 



41 

I 

CVRVDKDCAY 
MSPQGLRVRL 
SQLTSDYTIG 
QGERISGNLD 



FVSSLGVLQE 
RRGEVGIYQV 
KEVRSARC5F 
GHCVCYGEGR 
TCIDSNGGIC 



HKKKDCPPGS 
HYMLRENLMA 
SLRLARLiCTB 
BTIVDTVLMA 
LVDVRVPLFI 
VSRGDQVARI 
LLELQEVDSL 
PPPHGDLGAP 
VELTNLYPYC 
AEPAETNGEI 
RNGAGWGPER 
QRPSVSDDTB 
DYNSLTRSEH 
PRCERPLQGY 
EREGVITIES 
IVGYLVTCEM 
IITIESQDGG 
SLTRHVTQEF 



51 
I 

CTDEMFRDRR 
RPGEERHFEL 
FGKFVDKVSV 
APEGGFDAIL 
DTTGTYTQYR 
DSSNIVELLE 
QLRALEHVDG 
NGDFVCGQCV 
YEGQFCEYDN 
NGRGHCECGR 
CNFKVKMVDE 
FWWLIPL1LL1L 
SDHLDTPMLR 
NLLKPDTREC 
PRSAKPALLK 
RPEDDDEKQL 
PVIRRVLDGG 
LRGRQVRRFH 
QNPNAKAAGS 
DYEMKVCAYG 
TAYEVCYGLV 
EAIINLATQP 
HLVNGRMDFA 
SHSTTLPRDY 
SVEYQLLNGG 
QVHPQSPLCP 
AQGGGPATAF 
PFPQLGSRAG 
VSRTLTTSGT 



Seq ID NO: 131 DNA sequence 
Nucleic Acid Accession #: BC004372 
Coding sequence: 132.. 2231 



CCTCGTGCCG 
GCCGGCCCCT 
CTCCGGACAC 
TGAGCCTGGC 
AGAAAAATGG 
ATAGCACCTT 
. GCAGGTATGG 
GTGCAGCAAA 
ATTGCTTCAA 
ATGCCTTTGA 
AGAAAGGAGA 
ACGTGAGCAG 
CCTTTTCTAC 
ACAGAATCCC 
AAGAAAATGA 
ATGAAGATTT 
AGAACCAGGA 
CAACCACAAG 
ACCCAGAAGC 
ATTCTACAAG. 
AGGAACAGTG 



11 
I 

CGGACCCCAG 
GCCCCGCGCC 
CATGGACAAG 
GCAGATCGAT 
TOGCTACAGC 
GCCCACAATG 
GTTCATAGAA 
CAACACAGGG 
TGCTTCAGCT 
TGGACCAATT 
ATACAGAACG 
CGGCTCCTCC 
TGTACACCCC 
TGCTACCAGT 
AGATGAAAGA 
TATCTCCAGC 
CTGGACCCAG 
GATGACTGAT 
ACACCCTCCC 
CACAATCCAG 
GTTTGGCAAC 



21 
I 

CCTCTGCCAG 
CAGGGATCCT 
TTTTGGTGGC 
TTGAATATAA 
ATCTCTCGGA 
GCCCAGATGG 
GGGCATGTGG 
GTGTACATCC 
CCACCTGAAG 
ACCATAACTA 
AATCCTGAAG 
AGTGAAAGGA 
ATCCCAGACG 
ACGTCTTCAA 
GACAGACACC 
ACCATTTCAA 
TGGAACCCAA 
GTAGACAGAA 
CTCATTCACC 
GCAACTCCTA 
AGATGGCATG 



31 , 
I 

GTTCGGTCCG 
CCAGCTCCTT 
ACGCAGCCTG 
CCTGCOGCTT 
CGGAGGCCGC 
AGAAAGCTCT 
TGATTCCCCG 
TCACATCCAA 
AAGATTGTAC 
TTGTTAACCG 
ACATCTACCC 
GCAGCACTTC 
AAGACAGTCC 
ATACCATCTC 
TCAGTTTTTC 
CCACACCACG 
GCCATTCAAA 
ATGGCACCAC 
ATGAGCATCA 
GTAGTACAAC 
AGGGATATCG 



41 
I 

CCATCCTCGT 
TCGCCCGOGC 
GGGACTCTGC 
TGCAGGTGTA 
TGACCTCTGC 
GAGCATCGGA 
GATCCACCCC 
CACCTCCCAG 
ATCAGTCACA 
TGATGGCACC 
CAGCAACCCT 
AGGAGGTTAC 
CTGGATCACC 
AGCAGGCTGG 
TGGATCAGGC 
GGCTTTTGAC 
TCCGGAAGTG 
TGCTTATGAA 
TGAGGAAGAA 
GGAAGAAACA 
CCAAACACCC 



51 
I 

CCCGTCCTCC 
CCTCCGTTCG 
CTCGTGCCGC 
TTCCACGTGG 
AAGGCTTTCA 
TTTGAGACCT 
AACTCCATCT 
TATGACACAT 
GACCTGCCCA 
CGCTATGTCC 
ACTGATGATG 
ATCTTTTACA 
GACAGCACAG 
GAGCCAAATG 
ATTGATGATG 
CACACAAAAC 
CTACTTCAGA 
GGAAACTGGA 
GAGACCCCAC 
GCTACCCAGA 
AGAGAAGACT 



60 
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1740 
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cccattcgac aacagggaca gctgcagcct cagctcatac 
ggacaacacc'aagcccagag gacagttcct ggactgattt 
ccatgggacg aggtcatcaa gcaggaagaa ggatggatat 
ogcttcagcc tactgcaaat ccaaacacag gtttggtgga 
ctctttcaat gacaacgcag cagagtaatt ctcagagctt 
tggaagaaga taaagaccat ccaacaactt ctactctgac 
tcacaggtgg aagaagagac ccaaatcatt ctgaaggctc 
at acctctca - ttacccacac acgaaggaaa gcaggacctt 

AGACTGGGTC CTTTGGAGTT ACTGCAGTTA CTGTTGGAGA 
GTTCCTTATC AGGAGACCAA GACACATTCC ACCCCAGTGG 
GATCTGAATC AGATGGACAC TCACATGGGA GTCAAGAAGG 
GTCCTATAAG GACACCCCAA ATTCCAGAAT GGCTGATCAT 
TGGCTTTGAT TCTTGCAGTT TGCATTGCAG TCAACAGTCG 
AAAAGCTAGT GATCAACAGT GGCAATGGAG CTGTGGAGGA 
ACGGAGAGGC CAGCAAGTCT CAGGAAATGG TGCATTTGGT 
CTCCAGACCA GTTTATGACA GCTGATGAGA CAAGGAACCT 
TTGGGGTGTA ACACCTACAC CATTATCTTG GAAAGAAACA 
TACAGGGAGC TGGGACACTT AACAGATGCA ATGTGCTACT 
TTTTTAGCAT AAAATTTTCT ACTCTTAAAA AAAAAAAAAA 



CAGCCATCCA 
CTTCAACCCA 
GGACTCCAGT 
AGATTTGGAC 
CTCTACATCA 
ATCAAGCAAT 
AACTACTTTA 
CATCCCAGTG 
TTCCAACTCT 
GGGGTCCCAT 
TGGAGCAAAC 
CTTGGCATCC 
AAGAAGGTGT 
CAGAAAGCCA 
GAACAAGGAG 
GCAGAATGTG 
ACCGTTGGAA 
GATTGTTTCA 



ATGCAAGGAA 

ATCTCACACC. 

CATAGTACAA 

AGGACAGGAC 

CATGAAGGCT 

AGGAATGATG: 

CTGGAAGGTT 

ACCTCAGCTA 

AATGTCAATC 

ACCACTCATG 

ACAACCTCTG. 

CTCTTGGCCT 

GGGCAGAAGA 

AGTGGACTCA 

TCGTCAGAAA 

GACATGAAGA 

ACATAACCAT 

TTGCGAATCT 
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Seq ID NO: 132 Protein sequence-. 
Protein Accession 8: AAH04372 



1 11 21 31 41 51 • 

t I i III 

MDKFWWHAAW GLCLVPI*SLA QIDLNITCRF AGVFHVEKNG RYSISRTEAA DLCKAFNSTL 60 

PTMAQMBKAL SIGFETCRYG FIEGHWIPR IHPNSICAAN NTGVYILTSN TSQYDTYCFN 120 

ASAPPEEDCT SVTDLPNAFD GPITITIVNR DGTRYVQKGB YRTNPSDIYP SNPTDDDVSS 180 

GSSSERSSTS GGYIPYTFST VHPIPDEDSP WITDSTDRIP ATSTSSNTIS AGWEPNEENE 240 

DERDRHLSFS GSGIDDDEDP ISSTISTTPR AFDHTKQNQD WTQWNPSHSN PEVLLQTTTR 300 

MTDVDRNGTT AYEGNWNPEA HPPLIHHEHH EEEETPHSTS TIQATPSSTT EETATQKEQW 360 

FGNRWHEGYR QTPREDSHST TGTAAASAHT SHPMQGRTTP SPEDSSWTDF FNPISHPMGR 420 

GHQAGRRMDM DSSHSTTLQP TANPNTGLVE DLDRTGPLSM TTQQSNSQSF STSHEGLEED 480 

KDHPTTSTLT" SSNRNDVTGG RRDPNHSEGS TTLLEGYTSH YPHTKESRTF IPVTSAKTGS 540 

FGVTAVTVGD SNSNVNRSLS GDQDTPHPSG GSHTTHGSES DGHSHGSQEG GANTTSGPIR 600 

TPQIPEWliII LASLLALALI LAVCIAVMSR RRCGQKKKLV INSGNGAVED RXPSGLKGEA 660 
SKSQEMVHLV NKESSETPDQ FMTADETRNL QNVDMKIGV 

Seq ID NO: 133 DNA sequence 
Nucleic Acid Accession #: NMJJ02882 
Coding sequence: 150-755 



1 - il 21 31 41 51 

I III I I 

CGAGGTTCGG GTCGTGGGGC GGAGGGAAGA GCGGGCGGGC GGGAGGCGCC GGCGCCAGAC 60 

GCGGAGGGAA GGAGCTACGA GTAGCCGCCG AGAGGCCGCG GAGCCAGCGA CGACCGACCC 120 

AGCCGAGCCG CCGCCGCCGC CGCX3CCCCCA TGGCGGCCGC CAAGGACACT CATGAGGACC 180 

ATGATACTTC CACTGAGAAT ACAGACGAGT CCAACCATGA CCCTCAGTTT GAGCCAATAG 240 

TTTCTCTTCC TGAGCAAGAA ATTAAAACAC TGGAAGAAGA TGAAGAGGAA CTTTTTAAAA 300 

TGCGGGCAAA ACTGTTCCGA TTTGCCTCTG AGAACGATCT CCCAGAATGG AAGGAGCGAG 360 

GCACTGGTGA OGTCAAGCTC CTGAAGCACA AGGAGAAAGG GGCCATCCGC CTCCTCATGC 420 

GGAGGGACAA GACCCTGAAG ATCTGTGCCA ACCACTACAT CACGCCXSATG ATGGAGCTGA 480 

AGCCCAACGC AGGTAGCGAC CGTGCCTGGG TCTGGAACAC CCACGCTGAC TTCGCCGACG 540 

AGTGCCCCAA GCCAGAGCTG CTGGCCATCC GCTTCCTGAA TGCTGAGAAT GCACAGAAAT 600 

TCAAAACAAA GTTTGAAGAA TGCAGGAAAG AGATCGAAGA GAGAGAAAAG AAAGCAGGAT 660 

CAGGCAAAAA TGATCATGCC GAAAAAGTGG CGGAAAAGCT AGAAGCTCTC TCGGTGAAGG 720 

AGGAGACCAA GGAGGATGCT GAGGAGAAGC AATAAATCGT CTTATTTTAT TTTCTTTTCC 780 

TCTCTTTCCT TTCC T TTT T T TAAAAAATTT TACCCTGCCC CTCTTTTTCG GTTTGTTTTT 840 
ATTCTTTCAT TTTTACAAGG GACGTTATAT AAAGAACTGA ACTC 



Seq ID NO: 134 Protein sequence: 
Protein Accession #: NP_002873 

1 .11 21. 31 41 51 

I | I I I I 

MAAAKDTHED HDTSTENTDE SNHDPQFBPI VSLPEQEIKT LEEDEEELFK MRAKLFRFAS 60 
ENDLPEWKER GTGDVKLLKH KEKGAIRLLM RRDKTLKICA NHYITPMMEL KPN AG SD RAW 120 
VWNTHADFAD ECPKPEliLAI RFLNAENAQK FKTKFEECRK EIEEREKKAG SGKNDHAEKV 180 
AEKLEALSVK EETKEDAEEK Q 

Seq ID NO: 135 DNA sequence 

Nucleic Acid Accession 8: NM_000077.2 

Coding sequence": 277-742 , . 



1 -". . 11' 21 31 41 51 

I I I I I I 

CCCAACCTGG GGCGACTTCA GGTGTGCCAC ATTCGCTAAG TGCTCGGAGT TAATAGCACC 60 

TCCTCCGAGC ACTCGCTCAC GGCGTCCCCT TGCCTGGAAA GATACCGCGG TCCCTCCAGA 120 

GGATTTGAGG GACAGGGTCG GAGGGGGCTC TTCCGCCAGC ACCGGAGGAA GAAAGAGGAG .180 

GGGCTGGCTG GTCACCAGAG GGTGGGGCGG ACCGCGTGCG CTCGGCGGCT GCGGAGAGGG 240 

GGAGAGCAGG CAGCGGGCGG CGGGGAGCAG CATGGAGCCG GCGGCGGGGA GCAGCATGGA ' 300 

GCCTTCGGCT GACTGGCTGG CCACGGCCGC GGCCCGGGGT CGGGTAGAGG AGGTGCGGGC ' 360 

GCTGCTGGAG GCGGGGGCGC TGCCCAACGC ACCGAATAGT TACGGTCGGA GGCCGATCCA 420 

GGTCATGATG ATGGGCAGCG CCCGAGTGGC GGAGCTGCTG CTGCTCCACG GCGCGGAGCC . 480 



239 



WO 02/086443 ; / . . • 

* CAACTGOGCC GACCCCGCCA CTCTCACCCG ACCCGTGCAC GACGCTGCCC GGGAGGGCTT 540 
" CCTGGACACG CTGGTGGTGC TGCACCGGGC CGGGGCGCGG CTGGACGTGC GCGATGCCTG _ 600 ; 
GGGCCGTCTG . CCCGTGGACC TGGCTGAGGA GCTGGGCCAT CGCGATGTCG CAOGGTACCT . . 660 
GCGCGCGGCT GCGGGGGGCA CCAGAGGCAG TAACCATGCC CGCATAGATG CCGCGGAAGG • 720 
TCCCTCAGAC ATCCCCGATT-: GAAAGAACCA GAGAGGCTCT GAGAAACCTC GGGAAACTTA 780 
GATCATCAGT CACCGAAGGT CCTACAGGGC CACAACTGCC CCCGCCACAA CCCACCCCGC . 840 
TTTCGTAGTT TTCATTTAGA AAATAGAGCT TTTAAAAATG TCCTGCCTTT TAACGTAGAT , 900 
ATATGCCTTC CCCCACTACC GTAAATGTCC ATTTATATCA TTTTTTATAT ATTCTTATAA 960 
AAATGTAAAA AAGAAAAACA CCGCTTCTGC CTTTTCACTG TGTTGGAGTT TTCTGGAGTG 1020 
•AGCACTCACG CCCTAAGCGC ACATTCATGT GGGCATTTCT TGCGAGCCTC GCAGCCTCCG 1080 
GAAGCTGTCG ACTTCATGAC AAGCATTTTG TGAACTAGGG AAGCTCAGGG GGGTTACTGG 1140 
CTTCTCTTGA GTCACACTGC TAGCAAATGG CAGAACCAAA GCTCAAATAA AAATAAAATA 1200 
ATTTTCATTC ATTCACTC " 

Seq ID NO: 136 Protein sequence: 
Protein Accession #: NP_000068.1 

1 11 21 31 41 51 

I - I I I I . I 

MEPAAGSSME PSADWLATAA ARGRVEEVRA LLEAGALPNA PNSYGRRPIQ VMMMGSARVA 60 
ELLIiLHGAEP NCADPATLTR PVHDAAREGF LDTLWLHRA GARLDVRDAW GRLPVDLAEE 120 
LGHRDVARYL RAAAGGTRGS NHARIDAAEG PSDIPD 



Seq ID NO: 137 DNA sequence 

Nucleic Acid Accession 8: NM_058196.1 

Coding sequence: 104-421 

1 11 21 31 41 . 51 

I I - I I I I 

TGTGTGGGGG TCTGCTTGGC GGTGAGGGGG CTCTACACAA GCTTCCTTTC CGTCATGCCG 60 

GCCCCCACCC TGGCTCTGAC CATTCTGTTC TCTCTGGCAG GTCATGATGA TGGGCAGCGC 120 

CCGAGTGGCG GAGCTGCTGC TGCTCCACGG CGCGGAGCCC AACTGCGCCG ACCCCGCCAC 180 

TCTCACCCGA CCCGTGCAOG ACGCTGCCCG GGAGGGCTTC CTGGACACGC TGGTGGTGCT 240 

GCACCGGGCC GGGGCGCGGC TGGACGTGCG CGATGCCTGG GGCCGTCTGC CCGTGGACCT 300 

GGCTGAGGAG CTGGGCCATC GCGATGTOGC ACGGTACCTG CGCGCGGCTG CGGGGGGCAC 360 

CAGAGGCAGT" AACCATGCCC GCATAGATGC CGCGGAAGGT CCCTCAGACA TCCCCGATTG 420 

AAAGAACCAG AGAGGCTCTG AGAAACCTCG GGAAACTTAG ATCATCAGTC ACCGAAGGTC 480 

CTACAGGGCC ACAACTGCCC CCGCCACAAC CCACCCCGCT TTCGTAGTTT TCATTTAGAA 540 

AATAGAGCTT TTAAAAATGT CCTGCCTTTT AACGTAGATA TAAGCCTTCC CCCACTACCG 600 

TAAATGTCCA TTTATATCAT TTTTTATATA TTCTTATAAA AATGTAAAAA AGAAAAACAC 660 

CGCTTCTGCC TTTTCACTGT GTTGGAGTTT TCTGGAGTGA GCACTCACGC CCTAAGCGCA 720 

CATTCATGTG GGCATTTCTT GCGAGCCTCG CAGCCTCCGG AAGCTGTCGA CTTCATGACA 7 BO 

AGCATTTTGT GAACTAGGGA AGCTCAGGGG GGTTACTGGC TTCTCTTGAG TCACACTGCT 840 
AGCAAATGGC AGAACCAAAG CTCAAATAAA AATAAAATAA TTTTCATTCA TTCACTC 



Seq ID NO: 138 Protein sequence: 
Protein Accession NP_476103 . 1 

1 ." 11 " 21 31 ' 41 51 

I I I I I I 

MMMGSARVAB LLLLHGAEPN CADPATLTRP VHDAAREGPL DTLWLHRAG ARLDVHDAWG 60 
RXiPVD LABEL GHRDVARYLR AAAGGTRGSN HARIDAAEGP SDIPD 

Seq ID NO: 139 DNA sequence 

Nucleic Acid Accession NM_058197.1 

Coding sequence: 272-684 

1 11 - .; 21 31 41 " 51 

f I 1 1 I I 

CCCAACCTGG GGCGACTTCA GGTGTGCCAC ATTCGCTAAG TGCTCGGAGT TAATAGCACC 60 

TCCTCCGAGC ACTCGCTCAC GGCGTCCCCT TGCCTGGAAA GATACCGCGG TCCCTCCAGA 120 

GGATTTGAGG GACAGGGTCG GAGGGGGCTC TTCCGCCAGC ACCGGAGGAA GAAAGAGGAG 180 

GGGCTGGCTG GTCACCAGAG GGTGGGGCGG ACCGCGTGCG CTCGGCGGCT GCGGAGAGGG 240 

GGAGAGCAGG CAGCGGGCGG CGGGGAGCAG CATGGAGCCG GCGGCGGGGA GCAGCATGGA 300 

GCCGGCGGCG GGGAGCAGCA TGGAGCCTTC GGCTGACTGG CTGGCCACGG CCGCGGCCCG 360 

GGGTCGGGTA GAGGAGGTGC. GGGCGCTGCT GGAGGCGGGG GCGCTGCCCA AOGCACCGAA'. 420 

TAGTTACGGT CGGAGGCCGA TCCAGGTGGG TAGAAGGTCT GCAGCGGGAG CAGGGGATGG 480 

CGGGGGACTC. TGGAGGACGA AGTTTGCAGG GGAATTGGAA TCAGGTAGCG CTTCGATTCT .540 

CCGGAAAAAG GGGAGGCTTC CTGGGGAGTT TTCAGAAGGG GTTTGTAATC ACAGACCTCC 600 

TCCTGGCGAC GCCCTGGGGG CTTGGGAAAC CAAGGAAGAG GAATGAGGAG CCACGCGCGT. 660 

ACAGATCTCT CGAATGCTGA GAAGATCTGA AGGGGGGAAC ATATTTGTAT TAGATGGAAG ' 720 

TCATGATGAT GGGCAGCGCC CGAGTGGCGG AGCTGCTGCT GCTCCACGGC GCGGAGCCCA 780 

ACTGCGCCGA CCCCGCCACT CTCACCCGAC CCGTGCACGA CGCTGCCCGG GAGGGCTTCC 840 

TGGACACGCT GGTGGTGCTG CACCGGGCCG GGGCGCGGCT GGACGTGCGC GATGCCTGGG ' 900 

GCCGTCTGCC CGTGGACCTG GCTGAGGAGC TGGGCCATCG CGATGTCGCA CGGTACCTGC . 960 

GCGCGGCTGC GGGGGGCACC AGAGGCAGTA ACCATGCCCG CATAGATGCC GCGGAAGGTC 1020 

CCTCAGACAT CCCCGATTGA AAGAACCAGA GAGGCTCTGA GAAACCTCGG GAACTTAGAT 1080 

CATCAGTCAC CGAAGGTCCT ACAGGGCCAC AACTGCCCCC GCCACAACCC ACCCCGCTTT" 1140 

CGTAGTTTTC ATTTAGAAAA TAGAGCTTTT AAAAATGTCC TGCCTTTTAA CGTAGATATA 1200 

TGCCTTCCCC CACTACCGTA AATGTCCATT TATATCATTT TTTATATATT CTTATAAAAA 1260 

TGTAAAAAAG AAAAACACCG CTTCTGCCTT TTCACTGTGT TGGA GTTTT C TGGAGTGAGC 1320 

ACTCACGCCC TAAGCGCACA TTCATGTGGG CATTTCTTGC GAGCCTCGCA GCCTCCGGAA 1380 

GCTGTCGACT TCATGACAAG CATTTTGTGA ACTAGGGAAG CTCAGGGGGG TTACTGGCTT 1440 

CTCTTGAGTC ACACTGCTAG CAAATGGCAG AACCAAAGCT CAAATAAAAA TAAAATAATT . 1500 ' 
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TTCATTCAXTJ. CACTC ."■ . -« . . - " . - 

Seq ID NO: .140 Protein sequence: 
Protein Accession §: NP_478 104.1 

1. * 11 . - " :21 31 41 • 51 ' 

I I- . I I I I 

MEPAAGSSME PAAGSSMEPS ADWLATAAAR GRVEEVRALIi EAGALPNAPN SYGRRPIQVG 
RRSAAGAGDG GRLWRTKFAG ELBSGSASIL RKKGRLPGEP SEGVCNHRPP PGDALGAWBT. 



PCT/US02/12476 1 



Seg ID NO: 141 DMA ' sequence 

Nucleic Acid Accession 8: NM_058195.1 

Coding sequence: 163-684 



1 
) 

CCTCCCTACG 
GATCTTGGAG 
GGGCGCGCTC 
GTGGGTCCCA 
GGCGGCGCAG 
TTGGTGACCC 
CACATCCCGC 
GTGCTGATGC 
CATGATGATG 
CTGCGCCGAC 
GGACACGCTG 
CCGTCTGCCC 
CGCGGCTGCG 
CTCAGACATC 
CATCAGTCAC 
CGTAGTTTTC' 
TGCCTTCCCC 
TGTAAAAAAG 
ACTCACGCCC 
GCTGTCGACT 
CTCTTGAGTC 
TTCATTCATT 



11 
I . 

GGCGCCTCCG 
GTCCGGGTGG 
AGGGAAGGOG 
GTCTGCAGTT 
CGGCTGCCGA 
TCCGGATTCG 
GGCTCACGGG 
TACTGAGGAG 
GGCAGCGCCC 
CCCGCCACTC 
GTGGTGCTGC 
GTGGACCTGG 
GGGGGCACCA 
CCCGATTGAA 
CGAAGGTCCT 
ATTTAGAAAA 
CACTACCGTA 
AAAAACACCG 
TAAGCGCACA 
TCATGACAAG 
ACACTGCTAG 
CACTC. 



21 
I 

GCAGCCCTTC 
GAGTGGGGGT 
GGTGCGCGCC 
AAGGGGGCAG 
GCTCGGCCCT 
GCGCGCGTGC 
GGAGTGGGCA 
CCAGCGTCTA 
GAGTGGCGGA 
TCACCCGACC 
ACCGGGCCGG 
CTGAGGAGCT 
GAGGCAGTAA 
AGAACCAGAG 
ACAGGGCCAC 
TAGAGCTTTT 
AATGTCCATT 
CTTCTGCCTT 
TTCATGTGGG 
CATTTTGTGA 
CAAATGGCAG 



31 
I 

CCGCGTGCGC 
GGGGTGGGGG 
TGCGGGGOGG 
GAGTGGCGCT 
GGAGGCGGCG 
GGCCCGCCGC 
GCGCCAGGGG 
GGGCAGCAGC 
GCTGCTGCTG 
CGTGCACGAC 
GGCGCGG CTG 
GGGCCATCGC 
CCATGCCCGC 
AGGCTCTGAG 
AACTGCCCCC 
AAAAATGTCC 
TATATCATTT 
TTCACTGTGT 
CATTTCTTGC 
ACTAGGGAAG 
AACCAAAGCT 



41 
I 

AGGGCTCAGA 
TGGGGGTGAA 
AGATGGGCAG 
GCTCACCTCT 
AGAACATGGT 
GAGTGAGGGT 
CGCCCGCCGC 
CGCTTCCTAG 
CTCCACGGCG 
GCTGCCCGGG 
GACGTGCGCG 
GATGTCGCAC 
ATAGATGCCG 
AAACCTCGGG 
GCCACAACCC 
TGCCTTTTAA 
TTTATATATT 
TGGAGTTTTC 
GAGCCTCGCA 
CTCAGGGGGG 
CAAATAAAAA 



51 
I 

GCCGTTCCGA 

GGTGGGGGGC 

GGGGCGGTGC 

GGTGCCAAAG 

GCGCAGGTTC 

TTTCGTGGTT 

TGTGGCCCTC 

AAGACCAGGT 

CGGAGCCCAA . 

AGGGCTTCCT 

ATGCCTGGGG 

GGTACCTGCG 

CGGAAGGTCC 

AAACTTAGAT 

ACCC06CTTT 

CGTAGATATA 

CTTATAAAAA 

TGGAGTGAGC 

GCCTCCGGAA 

TTACTGGCTT 

TAAAATAATT 



Seq ID NO: 142 Protein sequence: 
Protein. Accession #: NP_478102.1 



1 11 21 31 41 51 

III I I I 

MGRGRCVGPS LQLRGQEWRC SPLVPKGGAA AAELGPGGGE NMVRRFLVTL RIRRACGPPR 
VRVFWHIPR LTGEWAAPGA PAAVALVLML LRSQRIX3QQP LPRRPGHDDG QRPSGGAAAA 
PRRGAQLRRP RHSHPTRARR CPGGLPGHAG GAAPGRGAAG RARCLGPSAR GPG 

Seq ID NO: 143 DNA sequence 
Nucleic Acid Accession #: NM_018131 
Coding sequence i 412.. 1107 



1 
I 

GAAATTGCAC 
GATAAAGAGA 
AATGCTTATC 
GCCAGATATA 
AGGAGGGAGC 
TCTGCTGCAA 
CAGACTGTGG 
CAGCTGAAAG 
GTCTATGTAA 
GCTCATTCAC 
AAGCAGAAAT 
CAAACCATAA 
CAAAAAGAAG 
CATCTGGAAG 
ATTGCTAGGG 
CAGTCTCTTT 
GAACAACAGA. 
GTGCAGCATC 
GTTGGAATCC 
AGGAGAGACT 
CAATGGAAGC 
CGATCTGCTT 
TTAAAAGATT' 
CATTTTGCAT 
CTTTTAGGCT 
GGCTGCAATG . 
AAAATACTTG 
GCAAAATATT 
ACTGTGTTCG 
ATTGTTAAGA 



11 
I 

ACTTAAAGAC 
GACAGAGACT 
AACTCACAGA 
GTACTACCGC 
AGGTGTTGAA 
CCTCACGAAT 
CTCCAAACTG 
ATGCTCTGGA 
AAGGACTTTT 
TCCCACAGCA 
GTTACAACGA 
CTCAGCTGAG 
TTCACAATTT 
ATGATAGGCA 
GAAAACTTGA 
ACACATCTCT 
TGCAGGCATG 
AATTGCATGT 
TTGAAACAGC 
GAAAACAGAG 
CTGGTGGAAT 
GTCCATGTGG 
CAATACTGTA 
AAAACTGCCT 
GCTGTGCATT 
ACAGAATGTG 
GTGAGGAAAA 
TTATGTTTCG 
CATTTTTTAT 
GGTGGTGATA 



21 

I 

ATCAGTGGAT 
TTTGGAGAAA 
GAAGGACAAA 
ATTGCTTGAA 
AGCCTTATCT 
TGCTGAACTT 
CTTCAACTCA 
GAAAAATCAG 
AGCAAAGATC 
GACAAAAAAG 
TCTCTTGGCA 
TTTTGAACTG 
AAATCAGCTG 
TAAAACAGAG 
AGAAGAGAAG 
GCTAAAGCAG 
TACTTTAGAC 
AATTCTTAAG 
TTCATGAGTT 
AAAAAGTTGC 
GTCCCAAGTG 
AATACTGTTC 
TTTTCTGTTA 
ATCTACCTTT 
TCTCTTGGCA 
GTGAGCAGCG 
GATAGCTCAG 
GGGGTTTTGA 
CCAAGCACTT 
GATACTATTT 



31 
I 

GAAATCACAA 
ATTCGAGTCC 
GAAATACAGC 
CAGCTGGAAG 
GAAGAGAAAG 
GAAAGCAAAA 
TCAATAAATA 
CAGTGGCTCG 
TTTGAGTTGG 
CCTGAATCAG 
AGTGCAAAAA 
AGTGAATTTC 
TTGTATTCAC 
AAGATACAAA 
AAGAGATCCG 
CAAGAAGAAC 
TTTGAAAATG 
GAGCTCCGAA 
TGCCATCACA 
CGCCTCACCA 
CAATATACAG 
AAAGTAGCAA 
GCTTGTGGGC 
GACACTCCAG 
GTGATACCTC 
TCTACTGAGA 
GTTATTGCTA 
AAAATCAAAG 
AGAAAACCTA 
TTTTTTCATA 



41 

I 

GTGGGAAAGG 
TTGAGGCTGA 
GACTGAGAGA 
AGACAACGAG 
ACGTATTGAA 
CCAATACACT 
ATATTCATGA 
TGTATGATCA 
AAAAGAAAAC 
AAGGTTATCT 
AAGATCTTGA 
GAAGAAAATA 
AAAGAAGGGC 
AACTCAGGGA 
AAGAGCTCTT 
AAACAAGGGT 
AAAAACTCGA 
AAGCAAGAAA 
GAGCCATTAG 
AAAAGTCCCA 
TATCCAGCCA 
AATAAGTATT 
ATTTTGAATT 
CATGCTAGTG 
CCTGACATGG 
TACTAACATT 
ATGGGTTAAT 
ATAATTAACC 
CAATCCTAAT 
TT6TATAG06 



51 
I 

AAAGCTGACT 
GAAGGAGAAG 
CCAACTGAAG 
AGAAGGAGAA 
ACAACAGTTG 
CCGTTTATCA 
AATGGAAATA 
GCAGCGGGAA 
GGAAACAGCT 
TCAAGAAGAG 
GGTTGAACGA 
TGAAGAAACC 
AGATGTGCAA 
AGAGAATGAT 
ATCTCAGGTC 
AGCTCTGTTG 
CCGTCAACAT 
AAATAACACA 
TCACTTTCCA 
CTGCTGCACT 
CTGAGCATCG 
TGTTTTGATA 
ATATATTTCA 
AATCATGTAT 
TTCATCATCA 
TTGCACTGTC 
GCACCAGCAA 
AAGGATCTTA 
TTTGATGTCC 
GTTATTAGAA 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
S40 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
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AAGTTGGGGA TTTTCTTGAT CTTTATTGCT GCTTACCATT GAAACTTAAC CCAGCTGTGT* 1860 

TCCCCAACTC TGTTCTGCGC -ACGAAACAGT ATCTGTTTGA GGCATAATCT TAAGTGGCCA 1920 

CACACAATGT TTTCTCTTAT GTTATCTGGC AGTAACTGTA ACTTGAATTA CATTAGCACA 1980 
TTCTGCTTAG CTAAAATTGT TAAAATAAAC TTTAATAAAC CCATGTAGCC CTCTCATTTG '2040 

ATTGACAGTA TTTTAGTTAT TTTTGGCATT CTTAAAGCTG GGCAATGTAA TGATCAGATC 2100 

TTTGTTTGTC TGAACAGGTA TTTTTATACA TGCTTTTTGT AAACCAAAAA CTTTTAAATT 2160 

TCTTCAGGTT TTCTAACATG CTTACCACTG GGCTACTGTA AATGAGAAAA GAATAAAATT 2220 
ATTTAATGTT TT 



Seq ID NO: 144 Protein sequence: 
Protein Accession #: NP_060601 



1 11 21" 31 41 51 

I I I I I 1 

MEIQLKDALE KNQQWliVYDQ QREVYVKGLL AKIFBLEKKT ETAAHSLPQQ TKKPESEGYL 60 

QEEKQKCYND LLASAKKDLE VERQTITQLS FELSEFRRKY EETQKEVHNL NQLLYSQRRA 120 

DVQHLEDDRH KTEKIQKLRE ENDIARGXLE EEKKRSEELL SQVQSLYTSL LXQQEEQTRV- 180 
ALLEQQMQAC TLDFENEKLD RQHVQHQLHV ILKELRKARK NNTVGIbETA S 

Seq ID NO: 145 DNA sequence 
Nucleic Acid Accession ft: NM_001168 
Coding sequence: 50.. 478 

1 ■ 11 21 31 41 51 

I I I I I I 

CCGCCAGATT TGAATCGCGG GACCGGTTGG CAGAGGTGGC GGCGGCGGCA TGGGTGCCCC 60 

GACGTTGCCC CCTGCCTGGC AGCCCTTTCT CAAGGACCAC CGCATCTCTA CATTCAAGAA 120 

CTGGCCCTTC TTGGAGGGCT GCGCCTGCAC CCCGGAGCGG ATGGCCGAGG CTGGCTTCAT 180 

CCACTGCCCC ACTGAGAACG AGCCAGACTT GGCCCAGTGT TTCTTCTGCT TCAAGGAGCT 240 

GGAAGGCTGG GAGCCAGATG ACGACCCCAT AGAGGAACAT AAAAAGCATT CGTCCGGTTG 300 

CGCTTTCCTT TCTGTCAAGA AGCAGTTTGA AGAATTAACC CTTGGTGAAT TTTTGAAACT 360 

GGACAGAGAA AGAGCCAAGA ACAAAATTGC AAAGGAAACC AACAATAAGA AGAAAGAATT 420 

TGAGGAAACT GCGAAGAAAG TGCGCCGTGC CATCGAGCAG CTGGCTGCCA TGGATTGAGG 480 

CCTCTGGCCG GAGCTGCCTG GTCCCAGAGT GGCTGCACCA CTTCCAGGGT TTATTCCCTG 540 

GTGCCACCAG CCTTCCTGTG GGCCCCTTAG CAATGTCTTA GGAAAGGAGA TCAACATTTT 600 

CAAATTAGAT GTTTCAACTG TGCTCCTGTT TTGTCTTGAA AGTGGCACCA GAGGTGCTTC 660 

TGCCTGTGCA GCGGGTGCTG CTGGTAACAG TGGCTGCTTC TCTCTCTCTC TCTCTTTTTT 720 

GGGGGCTCAT TTTTGCTGTT TTGATTCCCG GGCTTACCAG GTGAGAAGTG AGGGAGGAAG 780 

AAGGCAGTGT CCCTTTTGCT AGAGCTGACA GCTTTGTTCG CGTGGGCAGA GCCTTCCACA 840 

GTGAATGTGT CTGGACCTCA TGTTGTTGAG GCTGTCACAG TCCTGAGTGT GGACTTGGCA 900 

GGTGCCTGTT GAATCTGAGC' TGCAGGTTCC TTATCTGTCA CACCTGTGCC TCCTCAGAGG 960 

ACAGTTTTTT TGTTGTTGTG TTTTTTTGTT TTTTTTTTTT GGTAGATGCA TGACTTGTGT 1020 

GTGATGAGAG AATGGAGACA GAGTCCCTGG CTCCTCTACT GTTTAACAAC ATGGCTTTCT 1080 

TATTTTGTTT GAATTGTTAA TTCACAGAAT AGCACAAACT ACAATTAAAA CTAAGCACAA 1140 

AGCCATTCTA AGTCATTGGG GAAACGGGGT GAACTTCAGG TGGATGAGGA GACAGAATAG 1200 

AGTGATAGGA AGCGTCTGGC AGATACTCCT TTTGCCACTG CTGTGTGATT AGACAGGCCC 1260 

AGTGAGCCGC GGGGCACATG CTGGCCGCTC CTCCCTCAGA AAAAGGCAGT GGCCTAAATC 1320 

CTTTTTAAAT GACTTGGCTC GATGCTGTGG GGGACTGGCT GGGCTGCTGC AGGCCGTGTG 1380 

TCTGTCAGCC CAACCTTCAC ATCTGTCACG TTCTCCACAC GGGGGAGAGA CGCAGTCCGC 1440 

CCAGGTCCCC GCTTTCTTTG GAGGCAGCAG CTCCCGCAGG GCTGAAGTCT GGCGTAAGAT 1500 

GATGGATTTG ATTCGCCCTC CTCCCTGTCA TAGAGCTGCA GGGTGGATTG TTACAGCTTC 1560 
GCTGGAAACC TCTGGAGGTC ATCTCGGCTG TTCCTGAGAA ATAAAAAGCC TGTCATTTC 

Seq ID NO: 146 Protein sequence: 
Protein Accession #> NP_001159 



1 11 ■ 21 31 41 51 

I I I I 1 I 

MGAPTLPPAW QPPLXDHRIS TPKNWPPLEG CACTPERMAE AGFIHCPTEN EPDLAQCFPC 60 

FKELEGWEPD DDPIEEHKKH SSGCAFLSVK KQFEELTLGE PLKLDRERAK NKIAKETNNK 120 
KKEFEETAKK VRRAIEQLAA MD 

Seq ID NO: 147 DNA sequence 

Nucleic Acid Accession #: NM_014i76.1 

Coding sequence: 127-720 

1 11 21 31 41 51 • 

I ' I I I II 

GCGCGCAGCG CTGGTACCCC GTTGGTCCGC GCGTTGCTGC GTTGTGAGGG GTGTCAGCTC 60 

AGTGCATCCC AGGCAGCTCT TAGTGTGGAG CAGTGAACTG TGTGTGGTTC CTTCTACTTG 120 

GGGATCATGC AGAGAGCTTC ACGTCTGAAG AGAGAGCTGC ACATGTTAGC CACAGAGCCA 180 

CCCCCAGGCA TCACATGTTG GCAAGATAAA GACCAAATGG ATGACCTGCG AGCTCAAATA 240 

TTAGGTGGAG CCAACACACC TTATGAGAAA GGTGTTTTTA AGCTAGAAGT TATCATTCCT 300 

GAGAGGTACC CATTTGAACC TCCTCAGATC CGATTTCTCA CTCCAATTTA TCATCCAAAC 360 

ATTGATTCTG CTGGAAGGAT TTGTCTGGAT GTTCTCAAAT TGCCACCAAA AGGTGCTTGG 420 

AGACCATCCC TCAACATCGC AACTGTGTTG ACCTCTATTC AGCTGCTCAT GTCAGAACCC 480 

AACCCTGATG ACCCGCTCAT GGCTGACATA TCCTCAGAAT TTAAATATAA TAAGCCAGCC 540 

TTCCTCAAGA ATGCCAGACA GTGGACAGAG AAGCATGCAA GACAGAAACA AAAGGCTGAT 600 

GAGGAAGAGA ■ TGCTTGATAA TCTACCAGAG GCTGGTGACT CCAGAGTACA CAACTCAACA 660 

CAGAAAAGGA AGGCCAGTCA GCTAGTAGGC ATAGAAAAGA AATTTCATCC TGATGTTTAG 720 

GGGACTTGTC CTGGTTCATC TTAGTTAATG TGTTCTTTGC CAAGGTGATC TAAGTTGCCT 780 

ACCTTGAATT. TTTTTTTAAA TATATTTGAT GACATAATTT TTGTGTAGTT TATTTATCTT 840 

GTACATATGT ATTTTGAAAT CTTTTAAACC TGAAAAATAA ATAGTCATTT AATGTTGAAA 900 
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AAAAAAAAAA.AAAAAAAAAA AAAAAAAA 

Seq ID NO: 148 Protein sequence t 
Protein Accession £:" NP .054895.1 



PCT/US02/12476 



xi 
I 



21 



31 



41 



51 



MQRASRLKRE LHMLATBPPP GITCWQDKDQ MDDLRAQILG GANTPYEKGV FKLEVIIPER 
YPFEPPQIRP . LTPIYHFNID SAGRICLDVL KLPPXGAWRP SLNIATVLTS IQL.LMSEPNP 
DDPLMADISS EFKYNKPAFL KNARQWTEKH ARQKQKADEE EMLDNLPEAG DSRVHNSTQK 
RKASQLVGIE KKFHPDV 

Seq ID NO: 149 DNA sequence 
Nucleic Acid Accession #: NM_003812 
Coding sequence: 224-2722 



11 



21 31 41 51 

I I . I I I 

TCCTCTGCGT CCCGCCCCGG GAGTGGCTGC GAGGCTAGGC GAGCC6G6AA AGGGGGCGCC 
GCCCAGCCCC GAGCCCCGCG CCCCGTGCCC CGAGCCCGGA GCCCCCTGCC CGCGGCGGCA 
CCATGCGCGC CGAGCCGGCG TGACCGGCTC CGCCCGCGGC CGCCCCGCAG CTAGCCCGGC 
GCTCTCGCCG GCCACACGGA GCGGCGCCCG GGAGCTATGA GCCATGAAGC CGCCCGGCAG 
CAGCTCGCGG CAGCCGCCCC TGGCGGGCTG CAGCCTTGCC GGCGCTTCCT GCGGCCCCCA 
ACGCGGCCCC GCCGGCTCGG TGCCTGCCAG CGCCCCGGCC CGCACGCCGC CCTGCCGCCT 
GCTTCTCGTC CTTCTCCTGC TGCCTCCGCT CGCOGCCTCG TCCCGGCCCC GCGCCTGGGG 
GGCTGCTGCG CCCAGCGCTC CGCATTGGAA TGAAACTGCA GAAAAAAATT TGGGAGTCCT 
GGCAGATGAA GACAATACAT TGCAACAGAA TAGCAGCAGT AATATCAGTT ACAGCAATGC 
AATGCAGAAA GAAATCACAC TGCCTTCAAG ACTCATATAT TACATCAACC AAGACTCGGA - 
AAGCCCTTAT . CACGTTCTTG ACACAAAGGC AAGACACCAG CAAAAACATA ATAAGGCTGT 
CCATCTGGCC CAGGCAAGCT TCCAGATTGA AGCCTTCGGC TCCAAATTCA TTCTTGACCT 
CATACTGAAC AATGGTTTGT TGTCTTCTGA TTATGTGGAG ATTCACTACG AAAATGGGAA 
ACCACAGTAC TCTAAGGGTG GAGAGCACTG TTACTACCAT GGAAGCATCA GAGGCGTCAA 
AGACTCCAAG GTGGCTCTGT CAACCTGCAA TGGACTTCAT GGCATGTTTG AAGATGATAC 
CTTCGTGTAT ATGATAGAGC CACTAGAGCT GGTTCATGAT GAGAAAAGCA' CAGGTCGACC " 
ACATATAATC CAGAAAACCT TGGCAGGACA GTATTCTAAG CAAATGAAGA ATCTCACTAT 
GGAAAGAGGT GACCAGTGGC CCTTTCTCTC TGAATTACAG TGGTTGAAAA GAAGGAAGAG 
AGCAGTGAAT CCATCACGTG GTATATTTGA AGAAATGAAA TATTTGGAAC TTATGATTGT 
TAATGATCAC AAAACGTATA AGAAGCATCG CTCTTCTCAT GCACATACCA ACAACTTTGC 
AAAGTCCGTG GTCAACCTTG TGGATTCTAT TTACAAGGAG CAGCTCAACA CCAGGGTTGT 
CCTGGTGGCT GTAGAGACCT GGACTGAGAA GGATCAGATT GACATCACCA CCAACCCTGT 
* GCAGAtGCTC CATGAGTTCT CAAAATACCG GCAGCGCATT AAGCAGCATG CTGATGCTGT 
GCACCTCATC TCGCGGGTGA CATTTCACTA TAAGAGAAGC AGTCTGAGTT ACTTTGGAGG 
TGTCTGTTCT CGCACAAGAG GAGTTGGTGT GAATGAGTAT GGTCTTCCAA TGGCAGTGGC 
ACAAGTATTA- TCGCAGAGCC TGGCTCAAAA CCTTGGAATC CAATGGGAAC CTTCTAGCAG 
AAAGCCAAAA TGTGACTGCA CAGAATCCTG GGGTGGCTGC ATCATGGAGG AAACAGGGGT 
GTCCCATTCT CGAAAATTTT CAAAGTGCAG CATTTTGGAG TATAGAGACT TTTTACAGAG 
AGGAGGTGGA GCCTGCCTTT TCAACAGGCC AACAAAGCTA TTTGAGCCCA CGGAATGTGG 
AAATGGATAC GTGGAAGCTG .GGGAGGAGTG TGATTGTGGT TTTCATGTGG AATGCTATGG' 
ATTATGCTGT AAGAAATGTT CCCTCTCCAA CGGGGCTCAC TGCAGCGACG GGCCCTGCTG 
TAACAATACC TCATGTCTTT TTCAGCCACG AGGGTATGAA TGCCGGGATG CTGTGAACGA 
GTGTGATATT ACTGAATATT GTACTGGAGA CTCTGGTCAG TGCCCACCAA ATCTTCATAA 
GCAAGACGGA TATGCATGCA ATCAAAATCA GGGCCGCTGC TACAATGGCG AGTGCAAGAC 
CAGAGACAAC CAGTGTCAGT ACATCTGGGG AACAAAGGCT GCAGGGTCTG ACAAGTTCTG- 
CTATGAAAAG CTGAATACAG AAGGCACTGA GAAGGGAAAC TGCGGGAAGG ATGGAGACCG 
GTGGATTCAG TGCAGCAAAC ATGATGTGTT CTGTGGATTC TTACTCTGTA CCAATCTTAC 
TCGAGCTCCA CGTATTGGTC AACTTCAGGG TGAGATCATT CCAACTTCCT TCTACCATCA' 
AGGCCGGGTG ATTGACTGCA GTGGTGCCCA TGTAGTTTTA GATGATGATA CGGATGTGGG 
CTATGTAGAA GATGGAACGC CATGTGGCCC GTCTATGATG. TGTTTAGATC GGAAGTGCCT 
ACAAATTCAA GCCCTAAATA TGAGCAGCTG TCCACTCGAT TCCAAGGGTA AAGTCTGTTC 
. GGGCCATGGG GTGTGTAGTA ATGAAGCCAC CTGCATTTGT GATTTCACCT GGGCAGGGAC 
AGATTGCAGT ATCCGGGATC CAGTTAGGAA CCTTCACCCC CCCAAGGATG AAGGACCCAA 
GGGTCCTAGT. GCCACCAATC TCATAATAGG CTCCATCGCT GGTGCCATCC TGGTAGCAGC 
TATTGTCCTT" GGGGGCACAG GCTGGGGATT TAAAAATGTC AAGAAGAGAA GGTTCGATCC 
TACTCAGCAA GGCCCCATCT GAATCAGCTG CGCTGGATGG ACACCGCCTT GCACTGTTGG 
ATTCTGGGTA TGACATACTC GCAGCAGTGT TACTGGAACT ATTAAGTTTG TAAACAAAAC 
CTTTGGGTGG TAATGACTAC GGAGCTAAAG TTGGGGTGAC AAGGATGGGG TAAAAGAAAA 
CTGTCTCTTT TGGAAATAAT GTCAAAGAAC ACCTTTCACC ACCTGTCAGT AAAGGGGGGA 
GGGGGCAAAA GACCATGCTA TAAAAAGAAC TGTTCCAGAA TCTTTTTTTT fCCCTAATGG 
ACGAAGGAAC AACACACACA CAAAAATTAA ATGCAATAAA GGAATCATTA AAAA 



Seq ID NO: 150 Protein sequence: 
Protein Accession NP_003803 . 



1 „ ~. 
I 

MKPPGSSSRQ 
RPRAWGAAAP 
INQDSESPYH 
HYENGKPQYS 
KSTGRPHIIQ 
LELMIVNDHK 
ITTNPVQMLH 
LPMAVAQVLS 
RDFLQRGGGA 
SDGPCCNNTS 
NGECKTRDNQ 



11 
I 

PPLAGCSLAG 
SAPKHNETAE 
VLDTKARHQQ 
KGGEHCYYHG 
KTLAGOYSKQ 
TYKKHRSSHA 
EPSKYRQRIK 
QSLAQNLGIQ 
CLFNRPTKLF 
CLFQPRGYEC 
CQYIWGTKAA 



21 
I 

ASCGPQRGPA 
KNLGVLADED 
KHNKAVHLAQ 
SIRGVKDSKV 
MKNLTMERGD 
HTNNFAKSW 
QHADAVHLIS 
WEPSSRKPKC 
EPTECGNGYV 
RDAVNECDIT 
GSDKFCYEKL 



31 
I 

GSVPASAPAR 
NTLQQNSSSN 
ASFQIEAFGS 
ALSTCNGLHG 
QWPPLSELQW 
NLiVDSIYKEQ 
RVTFHYICRSS 
DCTESWGGCI 
EAGEECPCGF 
EYCTGDSGQC 
NTEGTEKGNC 



41 
I 

TPPCRLLLVL 
ISYSNAMQKE 
KFILDLILNN 
MFEDDTFVYM 
LKRRKRAVNP 
LNTRWLVAV 
LSYFGGVCSR 
MEETGVSHSR 
HVECYGLCCK 
PPNLHKQDGY 
GKDGDRWIQC 



51 
I 

LLLPPLAASS 
ITLPSRLIYY 
GLLSSDYVEI 
IEPIiELVHDE 
SRGIPEEMKY 
ETWTEKDQID 
TRGVGVNEYG 
KPSKCSILEY 
RCSLSNGAHC 
ACNQNOGRCY 
SKHDVFCGFL 



60 
120 

180' 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 • 
840 
90 0 
960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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LCTNLTRAPR IGQDQGEIIP TSFYHQGRVI DCSGAHWU) DDTDVGYVED GTPCGPSMMC 720 

LDRKCLQIQA LNMSSCPLDS KGKVCSGHGV CSNEATCTCD FTWAGTDCSI RDPVRNLHPP 780 
KDSGPKGPSA TNLIIGSIAG AILVAAIVLG GTGWGFKNVK KRRFDPTQOG PI 

Seq ID NO: 151 DNA . sequence 
Nucleic Acid Accession 8 : NM_023915 
Coding sequence:" 250-1326 

1 . « 11. 21 31 41 

I I I • I I I 

GGCACGAGGG TTTCGTTTTC ATGCTTTACC AGAAAATCCA CTTCCCTGCC GACCTTAGTT - 60 

TCAAAGCTTA TTCTTAATTA GAGACAAGAA ACCTGTTTCA ACTTGAAGAC ACCGTATGAG 120 

GTGAATGGAC AGCCAGCCAC CACAATGAAA GAAATCAAAC CAGGAATAAC CTATGCTGAA 180 

CCCACGCCTC AATCGTCCCC AAGTGTTTCC TGACACGCAT CTTTGCTTAC AGTGCATCAC 240 

AACTGAAGAA TGGGGTTCAA CTTGACGCTT GCAAAATTAC CAAATAACGA GCTGCACGGC 300. 

CAAGAGAGTC ACAATTCAGG CAACAGGAGC GACGGGCCAG GAAAGAACAC CACCCTTCAC 3 SO 

AATGAATTTG ACACAATTGT CTTGCCGGTG CTTTATCTCA TTATATTTGT GGCAAGCATC 420 

TTGCTGAATG GTTTAGCAGT GTGGATCTTC. TTCCACATTA GGAATAAAAC CAGCTTCATA 480 

TTCTATCTCA AAAACATAGT GGTTGCAGAC CTCATAATGA CGCTGACATT TCCATTTCGA 540 

ATAGTCCATG ATGCAGGATT TGGACCTTGG TACTTCAAGT TTATTCTCTG CAGATACACT 600 

TCAGTTTTGT. TTTATGCAAA CATGTATACT TCCATCGTGT TCCTTGGGCT GATAAGCATT 660 

GATCGCTATC TGAAGGTGGT' CAAGCCATTT GGGGACTCTC GGATGTACAG CATAACCTTC 720 

ACGAAGGTTT TATCTGTTTG TGTTTGGGTG ATCATGGCTG TTTTGTCTTT GCCAAACATC 780 

ATCCTGACAA ATGGTCAGCC AACAGAGGAC AATATCCATG ACTGCTCAAA ACTTAAAAGT 840 

CCTTTGGGGG TCAAATGGCA TACGGCAGTC ACCTATGTGA ACAGCTGCTT GTTTGTGGCC 900 

GTGCTGGTGA TTCTGATCGG ATGTT ACATA . GCCATATCCA- GGTACATCCA CAAATCCAGC 960 

AGGCAATTCA TAAGTCAGTC AAGCCGAAAG CGAAAACATA ACCAGAGCAT- CAGGGTTGTT 1020 

GTGGCTGTGT TTTTTACCTG- CTTTCTACCA TATCACTTGT GCAGAATTCC- TTTTACTTTT 1080 

AGTCACTTAG ACAGGCTTTT AGATGAATCT GCACAAAAAA TCCTATATTA CTGCAAAGAA 1140 

ATTACACTTT TCTTGTCTGC GTGTAATGTT TGCCTGGATC CAATAATTTA CTTTTTCATG 1200 

TGTAGGTCAT TTTCAAGAAG GCTGTTCAAA AAATCAAATA TCAGAACCAG GAGTGAAAGC 1260 

ATCAGATCAC TGCAAAGTGT GAGAAGATCG. GAAGTTCGCA TATATTATGA TTACACTGAT 1320 

GTGTAGGCCT TTTATTGTTT GTTGGAATCG ATATGTACAA AGTGTAAATA AATGTTTCTT 1380 
TTCATTATCC TTAAAAAAAA AA 

Seq ID NO: 152 Protein sequence: 
Protein Accession #: NP_076404 

1 . U 21 31 41 51 

MGFNLTLAKL PNNBLHGQ2S HNSGNRSDGP GKNTTLHNKF DTIVLPVLYL IIFVASIbLN 60 

GLAVWIFPHI RNKTSFIFYIf KNIWADLIM^ TLTFPFRIVH DAGFGPWYFK FILCRYTSVL 120 

FYANMYTSIV FLGLISIDRY LKVVKPFGDS RMYSITFTKV LSVCVWVIMA VLSLPNIILT 180 

NGQPTEDNIH- DCSKLKSPLG VKWHTAVTYV NSCLFVAVLV ILIGCYIAIS RYIHKSSRQF 240 

ISQSSRKRKH NQSIRVWAV FFTCFLPYHL CRIPFTFSHL DRLLDESAQK IIjYYCKEITL 300 
FLSACNVCLD PIIYFFMCR9 FSRRLFKKSN IRTRSESIRS LQSVRRSEVR IYYDYTDV 

Seq ID NOr 153 DNA sequence 
Nucleic Acid Accession ih D80008.1 
coding sequence: 149-739 

1 .11 21 31 41 51 

GTTCGGCGCC AAAGCGCGGA GCGGAGGCCG AGGCGAGAGC CTGGCGCTGT AGGACTAGAA 60 

CGAAAGGAGT GAGGCGCCGA GAGCCCAGAT ACCATTTTGG CGTGAGAGCT GGTGGTTGGC 120 

AAGGCCGCGG GAGTGGGAAG CGTCCGCCAT GTTCTGCGAA AAAGCCATGG AACTGATCCG 180 

CGAGCTGCAT CGCGCGCCCG AAGGGCAACT GCCTGCCTTC AACGAGGATG GACTCAGACA 240 

AGTTCTGGAG GAGATGAAAG CTTTGTATGA ACAAAACCAG TCTGATGTGA ATGAAGCAAA 300 

GTCAGGTGGA CGAAGTGATT TGATACCAAC TATCAAATTT CGACACTGTT CTCTGTTAAG 350 

AAATCGACGC TGCACTGTAG CATACCTGTA TGACCGCTTG CTTCGGATCA GAGCACTCAG 420 

AIGGGAATAT GGTAGCGTCT TGCCAAATGC ATTACGATTT CACATGGCTG CTGAAGAAAT 480 

GGAGTGGTTT AATAATTATA AAAGATCTCT TGCTACTTAT ATGAGGTCAC TGGGAGGAGA. 540 

TGAAGGTTTG GACATTACAC AGGATATGAA ACCACCAAAA AGCCTATATA TTGAAGTCCG • 600 

GTGTCTAAAA GACTATGGAG" AATTTGAAGT TGATGATGGC ACTTCAGTCC TATTAAAAAA 660 

AAATAGCCAG CACTTTTTAC CTCGATGGAA ATGTGAGCAG CTGATCAGAC AAGGAGTCCT 720 

GGAGCACATC CTGTCATGAC CATGCGCCGA GGCACTTCCA GGCTTCACTC AACTCATGGA 780 

CTCCTCTGTA CTCACTCTCT CCACCACTCC CTTCACCTCC CTCTTTGATT TTAGAAGCTA 840 

TAGACATTGT TTAAGATAAC TAAGAATACT TGGCTAAGAA' GTA TAATT TG CTAACTATTA 900 

AGGACTTTCT TTTTTTAATG TTGTACACTA TTCTTCCTAC TCTTTTTTGG TTTTGGTTTT 960 

GTTTTGTAGA GACTGTCTCA CTATGTTGCC CAAGCTGGTC TCAAACTCCT GGCCTCAAGC 1020 

AGTCCTCCCA CCTTAGCTTC TCAAAGTGTT GAGATCACAG GCGTGAGCCA CTGCACCCGG 1080 

CCCCTACTCC TTTTTCTAAT AAGCTGTATC TGTAATCACA GCATTCCTAC AGTTGTTACA 1140 

GTGTGTTTTT TAAATGAAAG TAAACATGGT TACATTTGAA TCTCTTAAAT AAGCAGTCAC 1200 

TTGGCTGGAC AGGAAGAAGG TAGATCCTGT GTGTCTTGTT TTCTGGTCAT GTGTATTGTA 1260 

CAAGCTAGAG AGCTGAATTT CTGAGATACA CATTTTCAAA TCACATGCAA GTGAAGATGA 1320 

TGGTCTGTAG AAATTTTCAG TATATATAAT GTTTAATGAC ATACTAATTT ATCATCTGGC 1380 

TATTTGGGAA GGAAGGACAC "ACATGGATTT TGCACATTTC CACCATGGTG GCTGGTGTGG - 1440 

CTTGTGGCTA TGGGGTGATC ACCAGTATCA CCACTTTGGA AGGGGACAGT GAAATTGGGG v < 1500 

CTAGAGAAGG AACTTTGTAC AGTTTTCCCT GAGATTCAGA TTGACTGAAA AGTCACATGA 1560 

AGAGTTGATT GTCTTTTAAT GGTATGTTTT AAACAGCTGA CATTTTAAAT T TTGAT GAAA 1620 

TCCAGTTTAT TCGTTTGTTC TTTTATGCTT TGGGTGTTGC ATCCGAGAAA TCTTTTCCCA 1680 

TCCCAAGATC ACAATTTTTT TTCCTTTTTA CTTCTAGAAG TGTTATAATT TTAAGCTTTA 1740 

TACTTTGGTC TATGACCCGT TlTri 'l 1 1 ' TT GTTTTGTTTT GTTTTTTCGT TTGTTTCTTT 1800 

GTTTTGAGAT GGAGTCTTGT TCTGTCACCC AGGCTGGGGT GCAGTGGCGT GATCTTGGCT 1860 

CACTGCAATC TCTATCCCCT GGGTTCAAGT GATT CTCT TG T CTCAGC CTC CCAAGTAGCT 1920 

GGGATTACAG GCACAGGCCG CCACGCCTGG CTAATTTTTG TATTTTTAGT AGAGACAGAG 1980 
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TTTTACCATG TTGGCCAGGC TGGTTTCAAA CTCCTGACCT CAAGTGACCC ACCTTGGCCT 2040 

CCCAAAGTTT TGGGATTACA AGTGTGGGCC ACCGCGGCCA GCCTATGATC CATTTTGAAT 2100 

GA AT TTT T TA TATGGTGCAA GGTGTCAATC CACCTTCACT TTTTCTTGGG AATATAGATA 2160 

TCCAGCTGTT TCACTACCAT TTTTTGAAAG GACTGCCCTT. TGCTCTATCA CCTTTGCATT 2220 

TTTGTTAAAA AGTAGTTGTC AATGTATATG TGGGTTTATT TCAGGACTCT GTTTTGTTCC 2280 

ATTGACCTGT TTTTCTCTCC TGAATGCCAA TACCATATTT GTATGTAGTG TATGTAATTT 2340 

TCTAATAATT CTTGAAACAG ATAGTATTAA TGTGTCATAT- TTTTGCTGTT. GTTTGTATTT 2400 

TTTGTAGAGA TGGGGTTTCA CCGTGTTGGC CAGGCTGTGT TGAACTCCTG AGCTAAAGCA 2460. 

ATACACTTGC CTCGTCCTCC CCATGTGCTG GGATTACAGG CGTGAGCCTT GGTGCTGGCC 2520 

CAGTGTACCA CATTTCTTTT TGAGATTTGT TTTGGCTATG TTAAGTCCTT TGCTTTTGAT 2580 

GTGAAATTTG GGAACAGGCA GGGTGTGGTG GCTTATGCCT GTAATCCTAG. AACTTTGGGA 2640 

GGCCTAGATG GGTGGATCAC TTGAGCTCAG GAGTTCCAGA- CCAGCCCGGG CCTATGGCAA 2700 

AACTCCGTCT CTACAAAAAA TAGAAAAAAT TAGCCAGGTG- TGGTGGTGCA TGCCTGTAGT 2760 

CACAGTTACA CGGCAGGCTG AGGTGGGAGG ATCACTTGAA CCCCAGAGGT CAAGACTGCA 2820 

GTGAGCTGAG ATCACACCAC TGTACTCCAG CCTGGGTGAC AAAGTGAGAC TCTATCTCAA 2880 

AAAGAAATTA' GGATCAATTT GTCAATTTCT ACAACAACAA CAACAAAAAC CCCTGTTGGG 2940 

CACCTTGATT GAGATTGCAT TGAATTTATA TAAAACTGTT GGGAGAATTG ACATCTTAAT 3000 

AATATTGAGT CTTCTGGCCT ATAAACAAGG TCTGTCTTCC TAGGTATTAA TGTTTTGTCT 3060 

TCTATTTCTC . TTAATAATCT TTTGTAGTTT TCAGTGTACA GGTCTACCAT GTCAGCATTT 3120 

CATAGTTTTG ATGCTAAATG GTATTTTAAA ATTTCAAATT CTAACCACTT GTTGCTAGTA 3180 

AATAGAAATA CAATTGATGT TGAACTTGTA TCCTTCAGCC TTGCTAAACT GTGAGTTCTC 3240 

ATGGTGTTTT TGTAAATTAC ATCAACAGTC ATGTGTTCTA TGAATAAAGA GTTTTACTCC 3300 
TTC 

; Seq ZD NO: 154 Protein sequence: 
Protein Accession #: BAA11503.1 



1 ' 11 21 31 41 51 

I I I I I I 

MFCEKAMELI RELHRAPEGQ LPAFNEDGIiR QVLEEMKALY EQNQSDVNEA KSGGRSDLIP 
TIKFRHCSL1* RNRRCTVAYL YDRLLRIRAL RWEYGSVLPN ALRFHMAAEE MEWFNNYKRS 
LATYMRSLGG DEGLDITQDM KPPKSLYIEV RCLKDYGEFE VDDGTSVLLK KNSQHPLPRW 
KCBQLIRQGV IiEHILS 



60 
120 
180 



Seq ID NO; 155 DNA sequence 

Nucleic Aci<5 Accession #: Eos sequence 

Coding sequence: 149-709 



GTTCGGCGCC 
CGAAAGGAGT 
AAGGCCGCGG. 
CGAGCTGCAT 
AGTTCTGGAG 
GTCAGGTGGA 
AAATCGACGC. 
ATGGGAATAT 
GGAGTGGTTT 
TGAAGGTTTG 
ATGCAGTGGC 
CAACCTCCAC 
GCACTTCAGr 
AGCTGATCAG 
CAGGCTTCAC 
CCCTCTTTGA 
AAGTATAATT 
ACTCTTTTTT ' 
TCTCAAACTC 
AGGCGTGAGC 
CAGCATTCCT 
AATCTCTTAA 
TTTTCTGGTC 
AATCACATGC 
ACATACTAAT 
TCCACCATGG 
GAAGGGGACA 
GATTGACTGA 
GACATTTTAA 
GCATCCGAGA 
AGTGTTATAA 
TTGTTTTTTC 
GTGCAGTGGC 
TGTCTCAGCC 
rGTATTTTTA 
CTCAAGTGAC 
CAGCCTATGA 
CTTTTTCTTG 
rTTGCTCTAT 
TTTCAGGACT 
TTGTATGTAG 
ATTTTTGCTG 
GTTGAACTCC 
GGCGTGAGCC 
TGTTAAGTCC 
CTGTAATCCT 
GACCAGCCCG 



11 

I 

AAAGCGCGGA 
GAGGCGCCGA 
GAGTGGGAAG 
CGCGCGCCCG 
GAGATGAAAG 
CGAAGTGATT 
TGCACTGTAG 
GGTAGCGTCT 
AATAATTATA 
GACATTACAC 
GCGATCTCGG 
CTCCCAGGTC 
CCTATTAAAA 
ACAAGGAGTC 
TCAACTCATG 
TTTTAGAAGC 
TGCTAACTAT 
GGTTTTGGTT 
CTGGCCTCAA 
CACTGCACCC 
ACAGTTGTTA 
ATAAGCAGTC. 
ATGTGTATTG 
AAGTGAAGAT 
TTATCATCTG. 
TGGCTGGTGT 
GTGAAATTGG 
AAAGTCACAT 
ATTTTGATGA 
AATCTTTTCC 
TTTTAAGCTT 
STTTGTTTCT 
GTGATCTTGG 
TCCCAAGTAG 
GTAGAGACAG 
CCACCTTGGC 
TCCATTTTGA 
GGAATATAGA- 
CACCTTTGCA 
CTGTTTTGTT 
TGTATGTAAT 
TTGTTTGTAT 
TGAGCTAAAG 
TTGGTGCTGG 
TTTGCTTTTG 
AGAACTTTGG 
GGCCTATGGC 



21 
I 

GCGGAGGCCG 
GAGCCCAGAT 
CGTCCGCCAT 
AAGGGCAACT 
CTTTGTATGA 
TGATACCAAC 
CATACCTGTA 
TGCCAAATGC 
AAAGATCTCT 
AGGATATGAA 
CTCAACCTGC 
CGGTGTCTAA 
AAAAATAGCC 
CTGGAGCACA 
GACTCCTCTG 
TATAGACATT 
TAAGGACTTT 
TTGTTTTGTA 
GCAGTCCTCC 
GGCCCCTACT 
CAGTGTGTTT 
ACTTGGCTGG 
TACAAGCTAG 
GATGGTCTGT 
GCTATTTGGG 
GGCTTGTGGC 
GGCTAGAGAA 
GAAGAGTTGA 
AATCCAGTTT 
CATCCCAAGA 
TATACTTTGG 
TTGTTTTGAG. 
CTCACTGCAA 
CTGGGATTAC 
AGTTTTACCA 
CTCCCAAAGT 
ATGAATTTTT 
TATCCAGCTG 
TTTTTGTT AA ' 
CCATTGACCT 
TTTCTAATAA 
TTTTTGTAGA 
CAATACACTT 
CCCAGTGTAC 
ATGTGAAATT 
GAGGCCTAGA 
AAAACTCCGT 




TTTAAATGAA 
ACAGGAAGAA 
AGAGCTGAAT 
AGAAATTTTC 
AAGGAAGGAC 
TATGGGGTGA 
GGAACTTTGT 
TTGTCTTTTA 
ATTCGTTTGT 
TCACAATTTT 
TCTATGACCC 
ATGGAGTCTT 
TCTCTATCCC 
AGGCACAGGC 
TGTTGGCCAG 
TTTGGGATTA 
TATATGGTGC 
TTTCACTACC 
AAAGTAGTTG 
GTTTTTCTCT 
TTCTTGAAAC 
GATGGGGTTT 
GCCTCGTCCT 
CACATTTCTT 
TGGGAACAGG 
TGGGTGGATC 
CTCTACAAAA 



41 
I 

CTGGCGCTGT 
; CGTGAGAGCT 
• AAAGCCATGG 
AACGAGGATG 
TCTGATGTGA 
CGACACTGTT 
CTTCGGATCA 
CACATGGCTG 
ATGAGGTCAC 
AGCCTATATA 
TCCCAGGTTC 
AGAATTTGAA 
ACCTCGATGG 
ACCATGCGCC 
CTCCACCACT 
ACTAAGAATA 
TGTTGTACAC 
CACTATGTTG 
TCTCAAAGTG 
ATAAGCTGTA 
AGTAAACATG 
GGTAGATCCT 
TTCTGAGATA 
AGTATATATA 
ACACATGGAT 
TCACCAGTAT 
ACAGTTTTCC 
ATGGTATGTT 
TCTTTTATGC 
TTTTCCTTTT 
GTTTTTTTTT 
GTTCTGTCAC 
CTGGGTTCAA 
CGCCACGCCT 
GCTGGTTTCA 
CAAGTGTGGG 
AAGGTGTCAA 
ATTTTTTGAA 
TCAATGTATA 
CCTGAATGCC 
AGATAGTATT 
CAO0QIGTT0 
CCCCATGTGC 
TTTGAGATTT 
CAGGGTGTGG 
ACTTGAGCTC 
AATAGAAAAA 



51 
I 

AGGACTAGAA 

GGTGGTTGGC 

AACTGATCCG 

GACTCAGACA 

ATGAAGCAAA 

CTCTGTTAAG 

GAGCACTCAG 

CTGAAGAAAT" 

TGGGAGGAGA • 

TTGAAGCTGG 

ACCTCAACTG 

GTTGATGATG 

AAATGTGAGC 

GAGGCACTTC 

CCCTTCACCT -■ 

CTTGGCTAAG. 

TATTCTTCCT 

CCCAAGCTGG 

TTGAGATCAC 

TCTGTAATCA 

GTTACATTTG 

GTGTGTCTTG 

CACATTTTCA 

ATGTTTAATG 

TTTGCAGATT 

CACCACTTTG 

CTGAGATTCA 

TTAAACAGCT 

TTTGGGTGTT 

TACTTCTAGA 

TTGTTTTGTT 

CCAGGCTGGG 

GTGATTCTCT 

GGCTAATTTT 

AACTCCTGAC 

CCACCGCGGC 

TCCACCTTCA 

AGGACTGCCC 

TGTGGGTTTA 

AATACCATAT 

AATGTGTCAT 

GCCAGGCTGT 

TGGGATTACA 

GTTTTGGCTA 

TGGCTTATGC 

AGGAGTTCCA 

ATTAGCCAGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
25B0 
2640 
2700 
2760 
2820 
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TGTGGTGGTG CATGCCTGTA GTCACAGTTA CACGGCAGGC TGAGGTGGGA GGATCACTTG 2880 

AACCCCAGAG GTCAAGACTG CAGTGAGCTG AGATCACACC ACTGTACTCC AGCCTGGGTG 2940 

ACAAAGTGAG • ACTCTATCTC AAAAAGAAAT TAGGATCAAT TTGTCAATTT CTACAACAAC "3000 

AACAACAAAA ACCCCTGTTG GGCACCTTGA TTGAGATTGC ATTGAATTTA TATAAAACTG. 3060 

TTGGGAGAAT TGACATCTTA ATAATATTGA GTCTTCTGGC CTATAAACAA GGTCTGTCTT 3120 
CCTAGGTATT AATGTTTTGT CTTCTATTTC TCTTAATAAT CTTTTGTAGT TTTCAGTGTA -3180 

CAGGTCTACC ATGTCAGCAT TTCATAGTTT TGATGCTAAA TGGTATTTTA AAATTTCAAA 3240 

TTCTAACCAC TTGTTGCTAG. TAAATAGAAA TACAATTGAT GTTGAACTTG TATCCTTCAG 3300 

CCTTGCTAAA CTGTGAGTTC TCATGGTGTT TTTGTAAATT ACATCAACAG TCATGTGTTC 3360 
TATGAATAAA GAGTTTTACT CGTTC . 

Seq ID -NO: 156 Protein sequence: 
Protein. Accession #: Eos sequence 

i 11 21 31 41 51 

I I I i I I 

MPCEKAMELI RELHRAPEGQ LPAFNEDGLR QVLEEMKALY EQNQSDVNEA KSGGRSDLIP 60 

TIKFRHCSIiL. RNRRCTVAYL YDRLLRI RAL RWEYGSVLPN ALRFHMAAEE MSWFNNYKRS 120 

LATYMRSLGG DEGLDITQDM KPPKSLYIEA GCSGAISAQP ATSTSQVHLN CNLHLPGPVS 180 
KRLWRI 

Seq ID NO: 157 DNA Bequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 148-621 

1 11 21 31 41 51 

TTCGGCGCCA AAGCGCGGAG CGGAGGCCGA GGCGAGAGCC TGGCGCTGTA GGACTAGAAC ■ 60 

GAAAGGAGTG AGGCGCCGAG • AGCCCAGATA CCATTTTGGC GTGAGAGCTG GTGGTTGGCA 120 

AGGCCGCGGG AGTGGGAAGC GTCCGCCATG TTCTGCGAAA AAGCCATGGA ACTGATCCGC 180 

GAGCTGCATC GCGCGCCCGA AGGGCAACTG CCTGCGTTCA ACGAGGATGG ACTCAGACAA 240 

GTTCTGGAGG : AGATGAAAGC . TTTGTATGAA ■ CAAAACCAGr CTGATGTGAA TGAAGCAAAG 300 

TCAGGTGGAC GAAGTGATTT GATACCAACT ATCAAATTTC GACACTGTTC TCTGTTAAGA 360 

AATCGACGCT GCACTGTAGC ATACCTGTAT GACCGCTTGC TTCGGATCAG AGCACTCAGA 420 

TGGGAATATG GTAGCGTCTT GCCAAATGCA TTACGATTTC ACATGGCTGC TGAAGAAGTC 480 

CGGTGTCTAA AAGACTATGG AGAATTTGAA GTTGATGATG GCACTTCAGT CCTATTAAAA 540 

AAAAATAGCC AGCACTTTTT ACCTCGATGG AAATGTGAGC AGCTGATCAG ACAAGGAGTC 600 

CTGGAGCACA TCCTGTCATG ACCATGCGCC GAGGCACTTC CAGGCTTCAC TCAACTCATG 660 

GACTCCTCTG TACTCACTCT CTCGACCACT CCCTTCACCT CCCTCTTTGA TTTTAGAAGC 720 

TATAGAGATT GTTTAAGATA ACTAAGAATA CTTGGCTAAG AAG TATAATT TGCTAACTAT 780 

TAAGGACTTT CTTTTTTTAA- TGTTGTACAC TATTCTTCCT ACTCTTTTTT GGTTTTGGTT 840 

TTGTTTTGTA GAGACTGTCT- CACTATGTTG CCCAAGCTGG TCTCAAACTC CTGGCCTCAA 900 

GCAGTCCTCC CACCTTAGCT TCTCAAAGTG TTGAGATCAC AGGCGTGAGC CACTGCACCC 960 

GGCCCCTACT CCTTTTTCTA ATAAGCTGTA TCTGTAATCA' CAGCATTCCT ACAGTTGTTA 1020 

CAGTGTGTTT TTTAAATGAA AGTAAACATG GTTACATTTG AATCTCTTAA ATAAGCAGTC 1080 

ACTTGGCTGG 1 ACAGGAAGAA GGTAGATCCT GTGTGTCTTG' TTTTCTGGTC ATGTGTATTG 1140 

TACAAGCTAG AGAGCTGAAT " TTCTGAGATA CACATTTTCA AATCACATGC AAGTGAAGAT 1200 

GATGGTCTGT AGAAATTTTC AGTATATATA ATGTTTAATG ACATACTAAT TTATCATCTG 1260 

GCTATTTGGG AAGGAAGGAC ACACATGGAT TTTGCACATT TCCACCATGG TGGCTGGTGT 1320 

GGCTTGTGGC TATGGGGTGA TCACCAGTAT CACCACTTTG GAAGGGGACA GTGAAATTGG 1380 

GGCTAGAGAA GGAACTTTGT ACAGTTTTCC CTGAGATTCA GATTGACTGA AAAGT CACAT 1440 

GAAGAGTTGA TTGTCTTTTA ATGGTATGTT TTAAACAGCT GACATTTTAA ATTTTGATGA 1500 

AATCCAGTTT ATTCGTTTGT TCTTTTATGC TTTGGGTGTT- GCATCCGAGA AATCTTTTCC 1560 

CATCCCAAGA TCACAATTTT TTTTCCTTTT TACTTCTAGA AGTGT TATA A TTTTAAGCTT 1620 

TATACTTTGG TCTATGACCC GTTTTTTTTT TTGTTTTGTT. TTGTTTTTTC GTTTGTTTCT 1630 

TTGTTTTGAG ATGGAGTCTT GTTCTGTCAC CCAGGCTGGG GTGCAGTGGC GTGATCTTGG 1740 

CTCACTGCAA TCTCTATCCC CTGGGTTCAA GTGATTCTCT TGTCTCAGCC • TCCCAAGTAG 1800 

CTGGGATTAC AGGCACAGGC CGCCACGCCT GGCTAATTTT TGTATTTTTA : GTAGAGACAG 1860 

AGTTTTACCA TGTTGGCCAG GCTGGTTTCA AACTCCTGAC CTCAAGTGAC CCACCTTGGC 1920 

CTCCCAAAGT TTTGGGATTA CAAGTGTGGG CCACCGCGGC CAGCCTATGA TCCATTTTGA 1980 

ATGAATTTTT TATATGGTGC AAGGTGTCAA" TCCACCTTCA CTTTTTCTTG GGAATATAGA 2040 

TATCCAGCTG TTTCACTACC ATTTTTTGAA AGGACTGCCC TTTGCTCTAT CACCTTTGCA 2100 

TTTTTGTTAA AAAGTAGTTG" TCAATGTATA TGTGGGTTTA TTTCAGGACT CTGTTTTGTT 2160 

CCATTGACCT GTTTTTCTCT CCTGAATGCC AATACCATAT T TGTA TGTAG TGTATGTAAT 2220 

TTTCTAATAA" TTCTTGAAAC AGATAGTATT AATGTGTCAT ATTTTTGCTG TTGTTTGTAT 2280 

TTTTTGTAGA* GATGGGGTTT CACCGTGTTG GCCAGGCTGT GTTGAACTCC TGAGCTAAAG 2340 

CAAT ACACTT GCCTCGTCCT CCCCATGTGC TGGGATTACA GGCGTGAGCC TTGGTGCTGG 2400 

CCCAGTGTAC CACATTTCTT TTTGAGATTT GTTTTGGCTA TGTTAAGTCC TTTGCTTTTG 2460 

ATGTGAAATT TGGGAACAGG CAGGGTGTGG TGGCTTATGC CTGTAATCCT AGAACTTTGG 2520 

GAGGCCTAGA- TGGGTGGATC ACTTGAGCTC AGGAGTTCCA GACCAGCCCG GGCCTATGGC 2580 

AAAACTCCGT CTCTACAAAA AATAGAAAAA ATTAGCCAGG TGTGGTGGTG CATGCCTGTA 2640 

GTCACAGTTA CACGGCAGGC TGAGGTGGGA GGATCACTTG AACCCCAGAG GTCAAGACTG 2700 

CAGTGAGCTG AGATCACACC ACTGTACTCC AGCCTGGGTG ACAAAGTGAG ACTCTATCTC 2760 

AAAAAGAAAT TAGGATCAAT TTGTCAATTT CTACAACAAC AACAACAAAA ACCCCTGTTG 2820 

GGCACCTTGA TTGAGATTGC ATTGAATTTA TATAAAACTG TTGGGAGAAT TGACATCTTA . 2880 

ATAATATTGA GTCTTCTGGC' CTATAAACAA GGTCTGTCTT CCTAGGTATT .AATGTTTTGT 2940 

CTTCTATTTC TCTTAATAAT CTTTTGTAGT. TTTCAGTGTA CAGGTCTACC ATGTCAGCAT 3000 

TTCATAGTTT TGATGCTAAA TGGTATTTTA AAATTTCAAA TTCTAACCAC TTGTTGCTAG 3060 

TAAATAGAAA- TACAATTGAT GTTGAACTTG TATCCTTCAG CCTTGCTAAA CT GTGAGT TC 3120 

TCATGGTGTT TTTGTAAATT ACATCAACAG TCATGTGTTC TATGAATAAA GAGTTTTACT 3180 
CCTTC . . • 

Seq ID NO; 158 Protein sequence: 
Protein Accession U: Eos sequence 

1 ~il 21 31 41 51 . 

! |- | I t I 
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■ MFCEKAMSLI RELHRAPEGQ LPAFNEDGLR QVLEEMKALY EQNQSDVNEA KSGGRSDLIP 60 
TIKFRHCSLL RNRRCTVAYL YDRLLRIRAL RWEYGSVLPN ALRFHMAAEE VRCLKDYGEF 120 
EVDDGTSVLI* KKNSQHFLPR WKCEQLIRQG VLBHILS ■ 

Seq ID NO.: 159 DNA sequence 
Nucleic. Acid Accession 8: Eos sequence 
Coding sequence: 149-229 

V 11 21 • 31- 41 51 

I 1 I I I- I 

GTTCGGCGCC AAAGCGCGGA GCGGAGGCCG AGGCGAGAGC CTGGOGCTGT AGGACTAGAA 

CGAAAGGAGT GAGGCGCCGA GAGCCCAGAT ACCATTTTGG OGTGAGAGCT GGTGGTTGGC 120 

AAGGCCGCGG GAGTGGGAAG CGTCCGCCAT GTTCTGCGAA AAAGCCATGG AACTGATCCG 1B0 

CGAGCTGCAT CGCGCGCCCG AAGGGCAACT- GCCTGCCTTC AACAATTAGC TGGGTGTGGT 240 

GGCACACACC TGTAGTCCCA GCAACTTAGG AGGCTGAAGT GAGAGGATTG CATGGCTCCA 300 

GGAAGTTGAA ACTGCAGTGA ACTGTGGTCA CGCTATTACA CTCCAGCCTG GGTGACAGAC 360 

TGAATCCCTG TCTCAAAAAG GAAAAGGAGG ATGGACTCAG ACAAGTTCTG GAGGAGATGA 420 

AAGCTTTGTA TGAACAAAAC CAGTCTGATG TGTTCTCTGT -TAAGAAATCG ACGCTGCACT 480 
GTAGCATACC TGTATGACCG CTTGCTTCGG ATCAGAGCAC TCAGATGG 



Seq ID NO: 160 Protein sequence: 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

ATGTTCTGCG AAAAAGCCAT GGAACTGATC CGCGAGCTGC ATCGCGCGCC CGAAGGGCAA 60 
CTGCCTGCCT TCAACAATTA G 

Seq ID NO: 161 DNA sequence 
Nucleic Acid Accession fh U10694. 
Coding sequence: 1333-2280 . 

1 11 21 3i: 41 51 

GGATCCGGCC GGATCTCAGG GAGGTGAGGA CTTTGTTCTC AGAGGGTGTG TGTGGACAAA 60 

ACAGGGAGGC CCTGTGTTCG ACAGACACAG TGGTCCCAGG ■ ATTGGAGAGC AGTCCAGGTG 120 

AGGAACCTAA GGGAGGATCG . AGGGTACCTC CAGGCCAGAG AAACTCTCAG, ATCAAGAGAG 180 

TTTGCCCTGC CCCTACTGTC ACCCCAGAGA GCCCGGGCAG. GGCTQTCTGC TGAGGTCCCT 240 

CCTTTATCCT GGGATCACTG GTGTCGGGGA GGGCTGGCCT TGGTCTGAGG GGGCTGCACT 300 

CACGTCAGCA GAGGGAGGGT CCCAGGCCCT. GCCAGGAGTC CAGGTGCAGA CTGAGGGGAC 360 

CCCACTCACC AAACACAGAG GACCTAGCCC CACCCTGCCC CTTGTGTCAG CTGAGGGAAG. 420 

CCGCTGGGTG. GATGGACTCC CCTCACTTCC TCTTCAGGTG TCTCCTGGAG ATAGGGCCTC 480 

AGGTCAACAG AGGGAGGGTT CCAGACCCTG ' CAGGCATCAA GATGAGGACC AGGCAGTATC ' 540 

CTCACCCCAG GACACATGGA CCCCATTGAA TTTAGACATC TCTTACTGTA CTTCCGAGGA 600 

AACCCTGGGC AGGTGTGGGC AGATGTTGGT TGGGGCATGT: CCTTCTGTTC CATATCAGGG 660 

ATGTCAGCTC CTGATCTGAG AGACTCTCAG. GCAAGTAGAG; GAGTAGAGTC CAGTCCCTGC 720 

CAGGAGAAAG GTCAGGGCCC .TGAGTGAGCG CAGAGGGGAC CATCCACCCC AAAAGTGTGT 780 
AG AACTCAAG " AGTGTCCAGC CCGCCCTCTT GACAGCACTG" 1 AGGGACCGGG GCTCTGCCTG . 940 

CAGTCTGCAG CCTAAGGGCC CCTCGATTCC TCTTCCAGGA GCTCCAGGAA GCAGGCAGGC 900 

CTTGGTCTGA GACAGTGTCC TCAGGTCGCA- GAGCAGAGGA GACCCAGGCA GTGTCAGCAG • - 960 

TGAAGGTGAA GTGTTCACCC TGAATGTGCA CCAAGGGCCC CACCTGCCCC AGCACACATG: 1020 

GGACCCCATA GCACCTGGCC CCATTCCCCC TACTGTCACT CATAGAGCCT TGATCTCTGC 1080 

AGGCTAGCTG CACGCTGAGT- AGCCCTCTCA CTTCCTCCCT CAGGTTCTCG GGACAGGCTA 1140 

ACCAGGAGGA CAGGAGCCCC AAGAGGCCCC AGAGCAGCAC. TGACGAAGAC GTGTAAGTCA 1200 

GCCTTTGTTA GAACCTCCAA GGTTCGGTTC TCAGCTGAAG TCTCTCACAC ACTCCCTCTC 1260 

TCCCCAGGCC TGTGGGTCTC CATCGCCCAG CTCCTGCCCA CGCTCCTGAC TGCTGCCCTG 132 0 

ACCAGAGTCA TCATGTCTCT CGAGCAGAGG AGTCCGCACT GCAAGCCTGA TGAAGACCTT 1380 

GAAGCCCAAG GAGAGGACTT. GGGCCTGATG GGTGCACAGG AACCCACAGG CGAGGAGGAG 1440 
GAGACTACCT CCTCCTCTGA CAGCAAGGAG GAGGAGGTGT CTGCTGCTGG GTCATCAAGT 1500 

. CCTCCCCAGA GTCCTCAGGG AGGCGCTTCC TCCTCCATTT CCGTCTACTA CACTTTATGG 1560 

AGCCAATTCG ATGAGGGCTC CAGCAGTCAA GAAGAGGAAG AGCCAAGCTC CTCGGTCGAC 1620 

CCAGCTCAGC TGGAGTTCAT GTTCCAAGAA GCACTGAAAT TGAAGGTGGC TGAGTTGGTT 1680 . 

CATTTCCTGC TCCACAAATA" TCGAGTCAAG GAGCCGGTCA CAAAGGCAGA AATGCTGGAG 1740 

AGCGTCATCA AAAATTACAA GCGCTACTTT CCTGTGATCT TCGGCAAAGC CTCCGAGTTC 1800 

ATGCAGGTGA TCTTTGGCAC TGATGTGAAG GAGGTGGACC CCGCCGGCCA CTCCTACATC I860 

CTTGTCACTG CTCTTGGCCT- CTCGTGCGAT- AGCATGCTGG GTGATGGTCA TAGCATGCCC 1920 

AAGGCCGCCC TCCTGATCAT TGTCCTGGGT GTGATCCTAA CCAAAGACAA CTGCGCCCCT 1980 

GAAGAGGTTA TCTGGGAAGC GTTGAGTGTG ATGGGGGTGT ATGTTGGGAA GGAGCACATG 2040 

TTCTACGGGG AGCCCAGGAA GCTGCTCACC CAAGATTGGG TGCAGGAAAA CTACCTGGAG 2100 

TACCGGCAGG TGCCCGGCAG TGATCCTGCG CACTACGAGT TCCTGTGGGG TTCCAAGGCC 2160 

CACGCTGAAA CCAGCTATGA GAAGGTCATA AATTATTTGG. TCATGCTCAA TGCAAGAGAG 2220 

CCCATCTGCT" ACCCATCCCT TTATGAAGAG GTTTTGGGAG' AGGAGCAAGA GGGAGTCTGA 2280 

GCACCAGCCG CAGCCGGGGC CAAAGTTTGT GGGGTCAGGG CCCCATCCAG CAGCTGCCCT 2340 

GCCCCATGTG ACATGAGGCC CATTCTTCGC TCTGTGTTTG AAGAGAGCAA TCAGTGTTCT 2400 

CAGTGGCAGT GGGTGGAAGT GAGCACACTG TATGTCATCT CTGGGTTCCT TGTCTATTGG 2460 

GTGATTTGGA GATTTATCCT TGCTCCCTTT TGGAATTGTT CAAATGTTCT TTTAATGGTC 2520 

AGTTTAATGA ACTTCACCAT- CGAAGTTAAT GAATGACAGT AGTCACACAT ATTGCTGTTT 2580 

ATGTTATTTA GGAGTAAGAT TCTTGCTTTT GAGTCACATG GGGAAATCCC TGTTATTTTG 2640 

TGAATTGGGA CAAGATAACA TAGCAGAGGA ATTAATAATT TTTTTGAAAC TTGAACTTAG 2700 

CAGCAAAATA GAGCTCATAA AGAAATAGTG AAATGAAAAT. GTAGTTAATT CTTGCCTTAT 2760 

ACCTCTTTCT CTCTCCTGTA AAATTAAAAC ATATACATGT ATACCTGGAT TTGCTTGGCT 2820 

TCTTTGAGCA TGTAAGAGAA ATAAAAATTG AAAGAATAAT TTTTCCTGTT CACTGGCTCA 2880 
TTTTTTCTTC AGACACGCAC TGAACATCTG TTATTCGGAA CACCCTGGGT T 



Seq ID NO: 162 Protein sequence: 
Protein Accession 8: AAA68877.1 
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MSLEQHSPHC KPDEDLEAQG EDLGLMGAQH 
PQGGASSSIS VYYTLWSQFD EGSSSQEEEE 
HKYRVKEPVT KAEMLESVIK NYKRYFPVIF 
LGLSCDSMLG' DGHSMPKAMi LIIVI/3VILT 
PRKLLTQDWV QENYLEYRQV PGSDPAEYEP 
PSLYEEVLGE EQEGV 



31 .41 51 

I I I . 

PTGEESETTS SSDSKEEEVS AAGSSSPPQS 60 

PSSSVDPAQL EFMFQEAIiKIj KVAELVHFLL 120 

GKASEFMQVI- FGTDVKEVDP AGHSYILVTA 180 

KDNCAPBEVI WEALSVMGVY VGKEHMFYGE 240 
LWGSKAHAET SYEKVINYLV MLNAREPICY " 300 



Seq ID NO: 163 DNA sequence 
Nucleic" Acid Accession ft: AF292100 
Coding sequence: 30-809 
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GGGGGGGGAG 
. AAGTTCGTCA 
CTCAAAATGA 
ATATACGAGA 
GATACAAAGA- 
ATGACCTGGC 
CAGCAACACA 
GTGACAGCAT 
CAGGACGATT 
AAGGATTAGA 
AATTCTTAGA 
ACACTTGGAA 
ATGAAGAAGG 
TTGCTGGGAC- 
CTGTACAATA' 
AGATCAATCC 

ccatatcgtA- 

TAAACATTTA 
ATTCTCCCAT 
AAGCACAATG 
CAAAACAATT 
AATAGCACTT 
TTTGGGGGGA 
ATGAAAAAAC 
CTGAAAAATG 
GCTTACCTTA 
CTGCATAATG 
TGCTTTTGAA 
. TTTTGGATTG 
' TTTTACTTTT- 
AAGATAGTAG 
AATATAGGTA 
AATTACTTTA 
CAAACAAGCA 
TCACTCAGAT 
GTGTGTGTGT 
TTTTTTTTTT 
ATTTCCCTGT 
TAAGGCAGGC 
TAGATTCTGC 
AGGAATGTAC 
ATGACATATC 
GACATATTAA 
TTTGAAAATT 
ATCTTGCTGC 
GAAAAATTTT 
TGTCTTGAAG 
GTTTTCTTTA 
TTTTAAATGC 
TATAATTACT 
TAAAAGTTAA . 
CTGAAAAGAG 
AGGGAATATT 
CTGTGAGAGA 
CCCCTTGAAT 
. TGATTCTTTG 
GGTACAAATT 
CACATATAAA 
GTAAGTTCAA 
GCAGTGAAAT 
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AGGCCTGGAG 
GTTTATGATC 
CTGGAAGTTA 
GAGTGTAAAA 
CCCTCAAGAT 
ACTCGATCCA 
GTGCGAGTTC 
AGAACAACTA 
TAAGGATTTT 
TCTAGAAATG 
CTTATGGAAT 
TCTTCTTTTA 
AGCATGGCCT 
AAAAAGTACA 
AATACAACAG 
TCACAATTCA 
TTTCATTCTT 
TAATTCCAAC 
AAGTCTTAGA 
CTGTCTTCAT 
TATAAAAACT- 
CCAGTGAGAG - 
ATTTTTTTTT 
AATGTCAGAA 
TATTGTGTTT 
CCCATGTTTA 7 
TTAGATTCTT 
ATGCTTATCT. 
CTACACAATA 
TTTTTGTCAT 
GAGTTAAATT 
CTGCTCTTTC 
ACTGCTAATC 
TACACGATGC 
TAGCTCAGTT 
GTGTGTGTGT 
TTTTTTTTAA 
TCACTTACTG 
TAGAAATGTA. 
TTGGGATATA- 
TTAGAATAGC 
AAAACTGCTT 
TATTCCAATC 
TCTATTCAGG 
TTTCAGTATT' 
TAAAAGATAT 
ATTTTGAGTT 
AAAGTCATAT 
AAAACCTTTT 
GTGTAATTAT* 
GGTTGTTCAC 
AAGACTGATT 
TTGTACTAAA 
GCCAGAGAGA 
TGCTTTAACT 
TTTTAAACTG 
TTAAAATGTA 
AAATACTTGA 
TTGTGTTTCT 
TG 
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GACACCAACA 
TTCACACAAT 
GATGTTGCAA: 
GGATCATTGG 
GAGAATAAAA 
GCCAGCATTA 
TCCAAACAGG 
AAGGCCCAGA 
TACCAGTTTA 
GCCATTGCCT 
AAATTTTTGT 
GACTTCAGTA 
GTTCTTATTG 
ACAGTGTAGC 
AAAATTGCAC 
GACTGAGGGT 
CTAATGGTGG 
ATTGTGGATT 
AGCTTTATGG - 
CAGAAAACAG ; 
TCTTAAATAT. 
TGATATATTT 
TTTCCTGGCA 
TAACCGACAT 
TAGACTCTTG: 
TATATAGCTG 
CTCCTGTCTA 
GCCTGTCACA' 

caat attct a " 
tgcactatgg. 
ttgaaaatgg: 
agacaaacag ■ 
actgtggttg 
aaaataccag. 
ttctcatcaa 

GTATGTGTGT 
ATTACAAAAG' 
ACATGTGAAG 
ATACTTCAAA 
TATCCAGGCA 
AGTACATTTT 
TTTACATGAT 
ATAACTTTAA> 
CTACTGGAAT 
TCCTGATTTT. 
ATATATCTAT 
ATGGTTATTG 
GGGTTCGTGG 
CAACAAAATA 
AAACAAAAAT 
TATGATGGCA 
TAATGTGGTG 
AAATGATTAC 
GTGAGAGAGA 
CCTAAGTGTT 
AACCATAGGT 
ATGCTTTTTA 
GATGAAGATT 
TACAGTCCCT 



31 
I 

TGAACAAGTT 
CTAGTGAAAA 
CAGATAATTT 
ACAGGAAGAA 
TTGGAATAGA 
GTGTGTTGAT 
AGTTCATGGA 
TACCCAAGAT, 
CTTTTAATTT 
ACTGGAACTT 
TGGAACATCA 
CGATGATTGC 
ATGACTTTGT 
ACTAAAGGAA 
AGTCAATTTC 
TGAGACAAAA 
TTTGGGCTTG 
TCATCTTATA 
TGATTATTTT 
TTGGCATAAG 
ACGCTTTGGG 
AAGTGTACTG 
AATCACATAT- 
GAAAATTTTTn 
ATTTCAAAAG. 
TCCTACAGGG 
CATTATGCAC 
TAAGTTAAAC 
AATTTAGGCA 
AACACAAATG' 
TTGTGATGAG 
TCCATTTTTA 
CCAAATATTT- 
TCTGGCTTCT 
AGCAGAATGC 
ATATATATAT 
CCATGAGCTG: 
AAGGGTTTCT 
TGTTTGATGA 
CTCTCTAAGG 
ATGCAAATAT 
TTTGAAATAG 
TTCAAGAATG 
TGGTTATTAA 7 
TTTGTAAATA 
ATATCTATAT 
GTTTCAGATT. 
CCTAATGCCT 
GTGTTTGTCA 
ACATAAAGCT. 
TCTTAGAATT 
TGATT ATTCT 
ACATATGGCT 
TTGACAGAGA 
TCAGTCCTCA 
ACAGTTTCCT 
AATAGAAAAA 
ATCl'TTAGTG 
GTCATATTAC 




CTTTAAGGAT 
TCTTCTAGTC 
TCTGTGGACC 
GAGGTTTTCA 
AATTAAACAT 
CTAGTTGCAA 
GATCTGGAAT 
GTTGTTGATG 
TAGGATAACT. 
GTTCCACAGA 
AGCTTTTATT 
TACATAATTG 
TATTTAATTT. 
TGAGGGTTTT 
AAATTCTCTT 
CCACGAAATT 
ATGACTTCTT 
ACTTCAGAAG 
AGTCTATTTA 
TATCTTGCGT 
ATATATATAT 
CTTTTATGCT 
TGCTTTCTTA 
TTATGGTCTT" 
TCTAGGGTTG 
GGAAATTATT 
ACTAGAAAGC 
CAGTTTTACC" 
AAGAAAAAGG; 
TAAAGAGGAA 
ATATGTACTG 
GATTAATTCA 
TGGATTTTAC 
TCAGGTTGGT 
TTGAATATAA 
AAACAAAACT 
GAAGATAAAT 
GTGTGTGTTT 
AAGGGAGAGA 
TTCCGGTAAA 
TTTTGCCAAA- 
TGTATAAAAT 
AATATCATCT 
CAACAGAGGC 
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CAGAAGGATA 
AGTTGTCTTT 
CCTGAACTTT 
CTGTACAATA 
CAGTTCTGTG 
AAGTTCAGAG 

' GAATTAGGAT 
TTGAAAGAAC 
CCAGGACAAA 
GGAAGATTTA- 
ATACCAAAAG 
TCTAATTATG 

. CGCCCTCAAA 
TGTACATAGT 
ACTGAACTGA 
ACATCTTGGA 
TGGGCCGCTC 
ATCCTAGTTT 
TTCTCGCATA 
ATGAACATCA - 
AGACTATGCT 
GGTGTTTTGG 
TGAGTATCTG 
TGGTGCCTAC 
ACTAGTCTGC 
TAGAAAATGT 
GACTTCATTA 
GTTTTGAATG 
TTTGTTTTAT 
AATTTATAAG 
CAATCTTTAT 
ATTTTGTTGA 
CAAAGATTTT 
CTGTTTTGTT 

ATATATATTT 
GAAAATGGTC 
AACATTTCCG 
TTGATAGGAA 
ATATTAACAA 
TTAAGAAACA 
TTTCCCTATA 
AAAAGAAAAA 
AAAAAGAAGA 
CTTCAATTAT 
TTTTGTTTCC 
CATATGCTGT- 
ATATTTTTCT 
ACTAAACATT 
TTATGTAGCA 
TTTACTAGGG 
GTCTGGCTAC 
GAGTCTGTGT 
CACACACACG 
CTCCCCATGC 
TGTCAAAACA. 
TAGAAGTGCC 
GCATATCTCT 
AATAAAAGCT 



60 
120 
180' 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 ' 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 



Seq ID NO: 164 Protein . sequence : 
Protein Accession AAG00606 



51 



. 1 .11 21 31 41 

I I I I I I 

MNKLKSSQKD KVRQFMIFTQ SSEKTAVSCL SQNDWKLDVA TDNFFQNPEL YIRESVKGSL 
DRKKLEQLYN RYKDPQDENK IGIDGIQQPC DDLALDPASI SVLIIAWKFR AATQCEFSKQ 
EFMDGMTELG CDSIEQLKAQ IPKMEQELKE PGRFKDPYQF TFNPAKNPGQ KGLDLEMAIA 
YWNLVLNGRF KFLDLWNKFL LEHHKRSIPK DTWNLLLDFS TMIADDMSNY DEEGAWPVLI 
DDPVEFARPO IAGTKSTTV 



60 
120 
180 
240 



248 
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Seq ID NOt 165 DMA sequence 
Nucleic Acid Accession #: AF256215 
Coding sequences 220-2029 

1 ,11 21. 31 41 .51 • ; 

- I I 'I I 1 I • 

CTCCAGTCCG CATGCTCAGT AGCTGCTGCC G6CCGGGCTG CGGGGCGGCQ TCCGCTGCGC SO, 
GCCTACGGGC TGCGGTGGCG GCCGCCGCGG CACCCGGCAG GGCCCGCCAG TCCCCGCTTC 120 
OCTGCTCCAG AGCCGCCGCC TGGGCCGGGG CAGGGCGGGC CCGGGGCTCC TCCATGCTGC 180 
CAGCCGCCGG GCTGCGGAGC CGACCAAGTG GCTCCTGCGA TGGCGGOGGA AGAGGAGGCT • 240 - 
GCGGCGGGAG GTAAAGTGTT. GAGAGAGGAG AACCAGTGCA TTGCTCCTGT GGTTTCCAGC 300. 
CGCGTGAGTC CAGGGACAAG. ACCAACAGCT ATGGGGTCTT TCAGCTCACA CATGACAGAG 360 
TTTCCACGAA' AACGCAAAGG AAGTGATTCA GACCCATCCC AAGTGGAAGA TGGTGAACAC 420 
CAAGTTAAAA TGAAGGCCTT- CAGAGAAGCT. CATAGCCAAA- CTGAAAAGCG GAGGAGAGAT 480 
AAAATGAATA ACCTGATTGA AGAACTGTCT GCAATGATCC. CTCAGTGCAA CC CCAT GGCG 540 
CGTAAACTGG ACAAACTTAC AGTTTTAAGA ATGGCTGTTC, AACACTTGAG ATCTTTAAAA. 600 
GGCTTGACAA' ATTCTTATGT GGGAAGTAAT TATAGACCAT- CATTTCTTCA GGATAATGAG • 660 
CTCAGACATT TAATCCTTAA GACTGCAGAA -GGCTTCTTAT TTGTGGTTGG ATGTGAAAGA 720 
GGAAAAATTC TCTTCGTTTC TAAGTCAGTC." TCCAAAATAC ' TTAATTATGA TCAGGCTAGT 780 
TTGACTGGAC AAAGCTTATT TGACTTCTTA CATCCAAAAG ATGTTGCCAA AGTAAAGGAA 840 
CAACTTTCTT CTTTTGATAT TTCACCAAGA GAAAAGCTAA TAGATGCCAA" AACTGGTTTG 900 
CAAGTTCACA GTAATCTCCA CGCTGGAAGG- ACACGTGTGT ATTCTGGCTC AAGACGATCT. 950 

TTTTTCTGTC GGATAAAGAG TTGTAAAATC TCTGTCAAAG AAGAGCATGG ATGCTTACCC 1020 

AACTCAAAGA- AGAAAGAGCA CAGAAAATTC TATACTATCC ATTGCACTGG TTACTTGAGA 1080 

AGCTGGCCTC. CAAATATTGT . TGGAATGGAA GAAGAAAGGA : ACAGTAAGAA AGACAACAGT 1140 

AATTTTACCT GCCTTGTGGC CATTGGAAGA TTACAGCCAT ATATTGTTCC ACAGAACAGT 1200 

. GGAGAGATTA ATGTGAAACC AACTGAATTT, AT AACCCGGT TTGCAGTGAA TGGAAAATTT 1260 

GTCTATGTAG ATCAAAGGGC AACAG CGATT TTAGGATATC TGCCTCAGGA ACTTTTGGGA 1320 

ACTTCTTGTT ATGAATATTT TCATCAAGAT GACCACAATA ATTTGACTGA - CAAGCACAAA 1380 

GCAGTTCTAC AGAGTAAGGA GAAAATACTT; ACAGATTCCT ACAAATTCAG AGCAAAAGAT- 1440 

GGCTCTTTTG TAACTTTAAA AAGCCAATGG TTTAGTTTCA CAAATCCTTG " GACAAAAGAA 1500 

CTGGAATATA TTGTATCTGT CAACACTTTA GTTTTGGGAC ATAGTGAGCC TGGAGAAGCA 1560 

TCATTTTTAC CTTGTAGCTC . TCAATCATCA GAAGAATCCT CTAGACAGTC CTGTATGAGT' 1620 

GTACCTGGAA. TGTCTACTGG AACAGTACTT GGTGCTGGTA GTATTGGAAC AGATATTGCA' 1680 

AATGAAATT C • TGGATTTACA GAGGTTACAG TCTTCTTCAT ACCTTGATGA TTCGAGTCCA . 1740 

ACAGGTTTAA TGAAAGATAC TCATACTGTA AACTGCAGGA GTATGTCAAA. TAAGGAGTTG 1800 

TTTCGACCAA GTCCTTCTGA AATGGGGGAG- CTAGAGGCTA CCAGGCAAAA CCAGAGTACT 1860 

GTTGCTGTGC ACAGCCATGA GCCACTCCTC. AGTGATGGTG. CACAGTTGGA ■ TTTCGATGCC 1920 

CTATGTGACA- ATGATGACAC AGCCATGGCT. : GCATTTATGA : ATTACTTAGA AGCAGAGGGG 19B0 

GGCCTGGGAG ACCCTGGGGA CTTCAGTGAC ATCCAGTGGA CCCTCTAGCC TTTGATTTTT 2040 

AACTCCAAAA ATGAGAAACA" TTTTAAAGCA ■, TTATTTACGA AAAAACTGTC- TCAACTATTC 2100 

TTAAGTACTG TATTGATATT GTTTGTATCT TTTATTAATG TTCTACCACT TTTTATAGAT 2160 

TTGCATCTTC CTGTCACAGG GATGTGGGGA' AATACGTT7T CCTCCCAAGA GAACCAAGTT 2220 

TATTATAGAC TCCTTTATTC. AGTGAAATGG CTTATAATCC ACTAGTTGCC ATATTTTTGC 2280 

TAAAATATTT, CTAACCAAGA ' ATACTACTTA CATATTGTTT TGGCTTTGTT TTATTTTTGA - 2340 

TGCAGTTTTT- TTTAGTTGAG GTAATGTAAT ATATTGATGT TTTCCTTTGT GTCTAAGATT 2400 

GATTTATAAT aGTAGGTTTG TATAATTTGG AACATTTTCC ATGCCTTGCG AATTTCCTTA 2460 

ATTGAGGATA GGGCTTACAC ACTTTAAGAA AACAG TGAGT ACTTGAACAT TTAAAGGGAC 2520 

AGTGCAATTT ATAGTCATAA TCACATTGAA -TACTGTATTT GATCTTTGGA GACTTAGGCA 2580 

AGCACAGAGC TGGGATATTT- ATGCTCAGTT GAGCACTTTA AGATGAATTT. TAAGTGAGAT 264 0 * 

GATTTCTTGC TTAAAACTCA GAAAGTCAAA AGAGTTTCAG CTTTCCTTAC AGAAAAGGAA 2700 
GGATCTTGGG CCCTAGATCT TGGGGATTAA CCTCTGCATA, TAAGATTTAC TCTTAATAGG .2760 

CCAGACGTGG TGCTCACGCC TGTAATCCCA GTACTTTGGG AGGCTGAGAC GGGCAGATCA- 2820 

CTTGAGGTCA GGAGTTCAAG ACCAGCCTGG CCAATATGGT GAAACCCCGT TTCTACTAAA • • 2880 

AATACAAAAA AAATTACCCA GGCACTCACT; CTTGAGGTAA CTAACCAACT CCCACGATAA- 2940 

TGACAGTCCA- TTCATGAGCG- CAAAGGCCTC ATGACCTAAT . GGCACACACC TGTAATCCCA 3000 

ACTGCTTGGG AGGCTGAGGC GAGAGGATTG, CTTGAACCTG GGAGGCAGAG GTTGCAGTGA 3060 

GCCGAGATCG CACCACTGCA- CTCCAGTCTG GGCAACAGAG . TGAGACTTCA ■ TCTCAAAAAA 3120 

AGTAAAAAAA AAGATTTAAT ATAATCACTG AAGATCTCTA TTATAGATAG ATTAGGTTTT 3180 

TGACATTGGA AACATACTTA GGGATAGATT TGTCCTAAAG GAAAAAAGTA GGCCCGGGCA 3240 

GA1TAAATGT CTTGTGTAAA GTCACACATT AAATTCAGTC ACACATTAAA TTCATAGAGT 3300 

TTTAAATGTT TAATGTATAT AAACCAGTTT CTTTATACAC ATTTGGGAAA ACATTGGTCT 3360 

CACAGATTAA ATGATT AACT AACTGACCCA GGAACT AGTT . GTAGCTTTCT AAGTAATTAG 3420 

GCAATTACAG TTATTGCCTG TAACCAAAGG TAATAAAACA AAATGACAAG TACATGTTTA 3480 

AAATTATGAG GCAATGAGAA ATAATTTAAA AACCAATTTT CTAGTTATAA TTTAAAATTT 3540 

GGAGAGCATT TTTAACAGTA ATTAATCCAG AGGTGGCTCA AATTGAGTAT A AGAA TTAAG 3600 

ATTATTTAAA ATACTGCATG TCTACCTTCT CGGGGATCAT ACTTTATAAC ACTTTCTGCT 3660 . 

TCAGTAGCTC TTCATAGCTT GCCAAGTATG CTCCCATATT TTCTCTCTCG TGCCTCGCAA ■ 3720 

ATGAAAGTCA GATAGGCTGG GAACTCATGG GGCAGCCCTC AGACTTCAAT GTGGGCTTCA 3780 

AATCCAGTTT CCTGTTCTAT ATGGTGCTAC ATCTTTCCAG AAAATTTCCC TCAGAGCCCC 3840 

TCGCCAAAAC AAAGCATTAT TTTGACCCTG' CATGCTATTT CTTTAGCTGT AGGTGATAGA 3900 

TTAGAACTTC TGTCAGACAT GTTAATGACA AACATACCAA- CAGACAATAA CCAAAGCAAA 3960 

TGTTTCCTTC AAGTGTGAAA TGTGCAGGGG CTCGTGGGCA: AGGATGTATT GGCACACTGT 4020 

CCTCTTGAAC TGATAGTGTC CCAGCAATGT TGGAGGTTGG- CACCATTCCT GGTCCGACAC 4080 , 

TTGAGGACCT GAGAGACATC AGGTTTAGAA TGAGCCAAAG AAATCCTACA AGATGGGGAG. 4140 

AATTGGTGTG CAGCAGCCTA AGTGTTATAG TTAAGTCTAA AGAAGTATGA AAGATCCCCT 4200 

GTGTTCTCTA AATTGAGCAG AGGGGCCTGC CTACCAATAT" CACTTTTTAG GGGACTGAAC 4260 • 

CATTGCAGGT TAGACTTGGC TTCCAAAGAG TCTGCCTAAG CCAGGGGTGG CAGGGTAGGC " 4320. 

CATCATAGCT GGATGGCCTC AAAAGCAGAT GGGGGCAGAC TTGCCCTCGT- GATGCCAGGA 43 80 

TTTGAGAGGC AGAGTTTCTA GAGGGAGACC AGTGCTGCCT CTCACAGTGG CAGTTTTTTC 4440 

TCTTTGCAAG AGGAGGGGCT GTTCAATTCC ATAGACCAGT GGGCAGATAG CCAGTTGAAT 4500 . 

ACTCTGTGCA TGGTTTGATC CTTTATTAGT TCGCTCTAAT ATTTTTCTGT AGATCCTTTT 4560 

GTCCTGGACT CAAAATCTAA TCCATGCATT GTATGATACC GTAGCTCTCC TAAGGTTTGT. 4620 

GTTTCCTTCA AAATGTTTTA GTTTTCTTCA ACTAAATTTG ATTTTTGCTG TTAGAAGTGA 4680 

CATATTTTTA TGGTATACAC TATGTTCCTT TTTTCTACTG CGAGTCAATT. TTTTGAATTT 4740 

TCGTGAGAAA GAATATATCT ACAAATTGCA CGAAAGTATC ATAAAAACAG TACT CT AG AG 4 800 
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WO 02/086443 

CAGCGCTGTC CAATAGAAAT ATAATCTGAG 
ACATTAAAGA AGTAAAAAGA TACAAGTAGA 
CAAAATATCA TTTGAACATG TAATTAATAT 
TTGGTAATAC TAGTCTTCAA AATCTGGTAT 
GTACTAGCCA CATTGCAAGT GCTCAGTAGC 
AGCACAGTTC TAGGTTCCAC CCTAACACCC 
AGCTGGAAGT AAACATAGAG ATCAAACCTC 
. GTTTAAATGG CTTGCATGAG GTCATACAGC 
CCAGGCACTC TTCCCACTCC ACTACATTAC 
TGTAGAGTAG GCCGGGCGCA GTGGCTCATG. 
GTGGGOGGAT. CACGAGGTCA GGAGATCGAC 

• CTCTACTGAA AATACAAAGC AAAATTAGCC 
CTGCTCTGGA GGCTGAGGpfr GAATGGCGTG 
AGATCGCGCC ACTGCACCCC AGCCTGGGCG 
AAAAAAAAAA AAGAAAAGAA ; AAGAAAAGTC 

. GAAGTAGACC AAAGTTTATA- CCATAAGGAT 
AATTATTT AT . TGATCCTTGA- ATCTGTAAGA 

• AATTTAACCT: TTTGAAAATA ATAAACTTTA 
. CTTGGAATCA AGTGAAATGA GTTATATGGT. 

CAAAGACAAA CAGGAAAGTA CAGAATAGAG 
AAGTGTTTAT TTACAGTGTC ACGACAGAAA- 
GATGTCCGTA AAATCTGGGG CACAGGTACA- 
TGTTTAGTAG- TGTCCAGTTT CAGATCATGC 
CATCACTGAG CCCTGG AATT . GGAGACTCAT 
GGCCGACATC TATGATTAGC: TAGAAGCCAT 
CCACTTTTCT GTTTTTGTAA TGCTTTCATT 
GCCTATGAGA GGCATTTATG ATTTTTGTGC 
TGTTTTATGA GAAATGCTTT CCAAGGGAGG 
GGCTTAGAGA GCTTTCCAGG TGTAGTGCCA' 
CAGCACGGAA 'CATGCTTTCT GAACTCACTT 
TATTCTTGAG TTTAGATTTG ■ TCTTTTATAC 
»■ CTCGTCTGTA TATTGGTATT TTTAAATTTT 
ATTTTATAAT TACTCATTTG TAGTTTTTTT 
GCTCCCTTAA AA 



Seq ID NO: 166 Protein sequence j 
Protein Accession .AAG34652 



PCTAJS02/12476 



CCACATGTAT 
ACTAATTTTA 
AAAATTATTA 
GTATCTTACA 
CACATGTGGC 
AAGTCCTGTG 
CTTTTAAAAA 
TAAATTCAGC 
TGTAGTGGTA 
CCTGTAATCC" 
ACCATCCTGG 
AGGTGTGGTG 
AACCCAGGAG. 
ACAGAGCGAG 
TAGAGAACAT 
ATTTTTCCTT 
TCAAATAACA 
AAATATCAGA 
CATCACTAAA 
ACTTTTAGTA' 
AGGATGTCTT 
AGAAA7AGCC 
TGCCAAGAGG" 
ACTTGCCCAG 
AAAGAAAAGC 
AGCAGATCTT 
CTACAATAAG 
TCTAGGAAGA' 
ATAAAAACTG 
GAGAGTGTAT 
AATTTTTAGC 
TGTGGTAAAT 
TTTTAATTTA 



AATTTTATTT 
ATGTTTTAAT 
ATGTGATATT 
TTGATAGCAC 
TAGTGGCTAC 
GATTAGAATC 
TGAGGACGCT 
CTCAACAGGG 
ATTCTTAGGG 
CAGCACTTTG 
CCAACATGGT 
GCGGGCGCCT 
GCAGAGATGG 
ACTCCATCTC 
TATATTAAGT 
AAATACCATG 
AGTCTCTATC 
TGTGTTATTA 
TTTAGAAATC 
AATAAATGGA 
TGTTGTCATA 
AATATTTAGT 
TATCTCCCCC 
CACAATGTTA 
TGCTAAGTGG 
TTTTTTCCAA 
TCAGCCTGTC 
TCCTGACACA 
ACCTGGAAAG 
GGTGTATGTC 
TCTTTTCCAG 
AATGAAAAGA 
ATAAACTTCC 



TCTTCTAGCC 

TCAGTATATC 

TTACATTCTT 

ATCTCACTTT 

TGCACTGGAC 

CCAGAATCAG 

GAGGCACAGA 

TCTTCTGATT 

TTAAAAAAAG 

GGAGGCCGAA 

GAAACCCCGT 

GTGGTCCCAG . 

CAGTGAGCCA 

AAAAAAAAAA' 

GGTTATTATT 

TTTGAAGAAC 

CATGTTACCA 

CAGGATGATA; 

TATTGTGAAA 

ATTTAAAAGA 

GTCTTTGAGG 

TCCCAGACCA 

TCAGGTGGGT 

CGGGCAGACA 

CCACTAGGTG 

GCTCCATGGG 

TGGTGTGAGT 

TAAGAACTTT 

AAAACCTGCC 

ACTTCTCATAl 

TTCACTTGTG 

GTGAAATTAT 

TCCAAAAAGT 



4660 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

54 00 

5460 

5520. 

5580 . 

5640 

5700 

5760 

5820 

5860' 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 



MAAEEEAAAG 
QVEDGEHQVK 
QHLRSLKGLT 
LNYDQASLTG 
YSGSRRSFFC 
NSKKDNSNFT 
LPQELLGTSC 
TNPWTKELEY 
SIGTDIANEI 
TRQNQSTVAV. 



11 . 

I 

GKVLREENQC 
MKAFREAHSO; 
NSYVGSNYRP. 
QSLFDFLHPK 
RIKSCKISVK; 
CLVAIGRLQP- 
YEYFHQDDHK 
IVSVNTLVLG 
LDLQRLQSSS 
HSHBPLLSDG 



21 



IAPWSSRVS 
TEKRRRDKMN 



DVAKVKEQLiS 
EEHGCLPNSK.' 
YIVPQNSGEr 
NLTDKHKAVI* 
HSEPGEASFL 
YUDDSSPTGL 
AQLDFDALCD 



31 

PGTRPTAMGS 
NL I EEL SAM I 
LILKTAEGFL 
SFDISPREKL 
KKEHRKFYTP 
NVKPTEFITR 
QSKEKII.TbS : 



41 

I. 

FSSHMTEPPR 
PQCNPMARKL 
FWGCERGKI 
IDAKTGI/QVH 
HCTGYLKSWP 
FAVNGKFVYV 
YKFRAKDGSF 
SRQSCMSVPG 



MKDTHTVNCR 
NDDTAMAAFM 



NYLEAEGGLG 



Seq ID NO: 167 DNA sequence 
Nucleic Acid Accession ft * nm_014400 
Coding sequence: 86-1126 



l 

I 

GGTTACTCAT 
GACGCCAAGG 
GATCTGGACT 
GTGCTACAGC 
GAAGTGCGCG 
CGGACAATTC 
CGGCCTGGATr 
CTGCAACGCC 
ATACCCGCCC 
GGGTACATCG 
CTTCGACGGC 
CTGTGTCCAG 
TGGCTCCTGT. 
CCCTCGAATC 
CACATCTGTC 
GCCAGCGCCA 
GGAGCCCAGG 
TCCTGCAAAA. 
ATTGGCAGCC. 
AAATTTCCCT 
CCCACCACTG 
. CTTCTGCTGC 
GGGTGTTCTA 
TCCTCTTGTG 
AGGATGCTAA. 
GGTGGGACAA 
. ATCGGTTCCC 
. CTTATGTCTG 



11 

I. 

CCTGGGCTCA 
GAGCAGGACG 
GCAGGCTGGC 
TGCGTGCAGA' 
CCGGGCGTGG 
TCGCTGGCAG 
CTTCACGGGC 
AAGCTCAACC 
AACGGCGTGG 
CCGCCGGTCG 
AACGTCACCT- 
GATGAATTCT. 
TGCCAGGGGT 
CCACCOCTTG 
ACCACTTCTA 
ACCAGTCAGA 
TTGACTGGAG' 
GGGGGGCCCC. 
CTTCTGTTGG 
CTCACCTACT 
GACTGGGCTG 
GCTGGTTTGC. 
GCTTTTTGAG 
ATGTT AGGAC * 
GCTTCCTACT 
TGGCTCCCCA 
CATATGTCTT 
TGTGTGATCA 



21 : 
I- 

GGTAAGAGGG 
GAGCCATGGA 
TGCTGGTGCT 
AAGCAGATGA 
ACGTCTGCAC 
TGCSGGGTTG 
TTCTGGCGTT 
TCACCTCGCG 
AGTGCTACAG 
TGAGCTGCTA 
TGACGGCAGC 
GCACTCGGGA 
CCCGCTGTAA 
TCCGGCTGCC 
CCTCGGCCCC 
CTCCGAGACA 
GCGCCGCTGG 
AGCAGCCCCA 
CCGTGGCTGC 
TCTCTGGCCC 
GCCCAGCCCC 
GGCTTTGGGA 
GACAGCTCCT 
AGAGTGAGAG 
CACTTTCTCC 
CTCTAAGCAC 
CCTTACTAGA 
GTTTCTGGCA 



31 
I 

CCCGAGCTCG 
CCCCGCCAGG 
GCTGCTTCGC 
CGGATGCTCC 
CGAGGCCGTG 
CGGTTCGGGA 
CATCCAGCTG 
GGCGCTCGAC 
CTGTGTGGGC 
CAACGCCAGC 
TAATGTGACT 
TGGAGTAACA 
CTCTGACCTC 
CCCTCCAGAG 
AGTGAGACCC 
GGGAGTAGAA 
CCACCAGGAC 
TAATAAAGGC 
TGGTGTCCTA 
TGGGTACCCC 
TGTTTTTCCA 
AATAAAATAC 
GTATCCTTCT 
AAGTCAGCTG 
TAGCCAGCCT 
TGCCTCCCCT 
CTGTGAGCTC 
CATAAATGCC 



41 

I 

GAGGCGGCAC 
AAAGCAGGTG 
GGAGGAGCGC 
CCGAACAAGA 
GGGGCGGTGG 
CTCCCCGGCA 
CAGCAATGCG 
CCGGCAGGTA 
CTGAGCCGGG 
GATCATGTCT 
GTGTCCTTGC 
GGCCCAGGGT 
CGCAACAAGA 
CCCACGACTG 
ACATCCACCA 
CACGAGGCCT 
CGCAGCAATT 
TGTGTGGCTC 
CTGTGAGCTT 
TCTTCTCATC 
ACATTCCCCA 
CGTTGTATAT 
CATCCTTGTC 
TCACGGGGAA 
GGACTTTGGA 
ACTCCCCGCA 
CTCGAGGGCA 
TCAATAAAGA 



51 

r 

KRKGSDSDPS 

DKLTVLRMAV 

LFVSKSVSKI 

SNLHAGRTRV . 

PNIVGMEEER 

DQRATAILGY 

VTLKSQWPSP . 

MSTGTVLGAG 

SPSEMGELEA 

DPGDFSDIQW 



51 
I 

ACCCAGGGGG 
CCCAGGCCAT 
AGGCCCTGGA 
TGAAGACAGT 
AGACCATCCA- 
AGAATGACCG 
CTCAGGATCG 
ATGAGAGTGC 
AGGCGTGCCA 
ACAAGGGCTG 
CTGTCCGGGG 
TCACGCTCAG 
CCTACTTCTC 
TGGCCTCAAC. 
CCAAACCCAT 
CCCGGGATGA 
CAGGGCAGTA 
CCACAGCTGG 
CTCCACCTGG 
ACTTCCTGTT 
GTATCCCCAG 
ATTCTGGCAG 
TCTCCGCTTG 
GGTGAGAGAG 
GCGTGGGGTG 
TCTTTGGGGA 
GGGACCGTGC 
TTTAATTACT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
.1080 
1140 
1200 
1260 
1320. 
1380 
1440 
1500 
1560 
1620 
1680 
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WQ 02/086443 

TTGTATAGTG AAAAAAAA - 

Seq id NO: 168 "Proteiri.seque'ncei ! 
Protein Accession *ftV.NP_0552iS . 



PCT/US02/12476 



I 

MDPARKAGAQ 
CTEAVGAVET 
SRALDPAGNB 
AANVtVSLPV 
LPPPEPTTVA 1 
AGHQDRSNSG 



11 . . 
I - 

AMIWTAGWLL 
IHGQFSLAVX 
SAYPPNGVEC 
RGCVQDEFCT 
STTSVTTSTS 
QYPAKGGPQQ 



21 
I 

LLLLRGGAQA 
GCGSGLPGKN- 
YSCVGIiSREA 
RDGVTGPGFT 
APVRPTSTTK 
PHNKGCVAPT 



31 
I 

LECYSCVQKA 
DRGLDLHGLL 
I OQGTSPPWS 
LSGSCCQGSR 
PMPAPTSQTP 
AGLAALLIAV 



41 51 

I I : 

DDGCSPNKMK TVKCAPGVDV 
AFIQLQQCAQ DRCNAKLNLT 
CYNASDHVYK GCFDGNVTLT 
CNSDLRNKTY. FSPRIPPLVR 
RQGVEHEASR DEEPRLTGGA 
AAGVLL 



60 
120 
180" 
240 
300 



Seq ID NO.- 169 DNA* sequence 
Nucleic Acid Accession #: NMJ)06875 
Coding sequence: 186-1190 



GAATTCGGCA 
CACCAGTTTC 
CCCGGGCGTC 
CCTCCATGTT: 
CGCCAGGAGG 
GTAAGGGGGG: 
CCATCAAAGT- 
CATGCCCACT. 
TGATCCGCCT; 
CTTTGCCCGC- 
CAAGCCGCTG 
TTGTCCATCG. 
AACTCATTGA 
GGACAAGGGT 
CCACTGTCTG 
AGAGGGACCA 
GCTGTGCCCT 
AGATCCTGCT 
AAAGGAGGCC 
TGGCCCCCAA 
GTTGACTTGG 
ATTGAGGATC 
CAAAGGAGCC 
CTCATTTTGC 
TTATTTTGAT 
TCATATGCTT 
AGTAAAGGGA 
TCAGCCCAGG' 
TTTTTTTTTT 
CTGGTGAGAA 
CACCACCAGA 
GCTTGCTGTT 
CTGAGCCGGG 
TCCAAGTGTG 
CCACTATTTA 



- 11 
I 

CGAGCGCGCG 
TCTGCTTTCC 
CACGCCCTGC 
GACCAAGCCT 
CAAGGATCGG 
CTTTGGCACC 
GATTCCCCGG 
CGAAGTCGCA 
GCTTGACTGG 
CCAGGATCTC 
CTTCTTTGGC 
TGACATCAAG 
TTTTGGTTCT: 

gtacagcccc 
gtcactgggc. 
ggagattctg; 
aatccgccgg. 
ggacccctgg 
ctgccccttt 
tggtcagaac 
ttttacaggt 
aggggttaga 
ttcctcccag 
taaggaagtt 
gatgtgtcac 
ttacttgggc 
ccctttcccc 
attttttatt 
tttttttttg 
gaaccttaat 
caataggatg 
tgttttcctg 
attgtccaat 
ccctcctttt 

ATAAAAGTAft 



21 . 
I 

GCGAATCTCA 
ACCCTGGCGC 
GGGCTTAGCG 

ctacaggggc 
gaagcgttcg; 

GTCTTCGCAG 
AATCGTGTGC 
CTGCTATGGA- 
TTTGAGACAC 
TTTGACTATA 
CAAGTAGTGG. 
GATGAGAACA 
GGTGCCCTGC 
CCAGAGTGGA- 
ATCCTCCTCT 
GAAGCTGAGC 
TGCCTGGCCC 
ATGCAAACAC 
GGCCTGGTCC. 
AGCCATCCCA 
CATTACCAGT 
AGACATAAAC - 
AACCTGTGGT 
TATTTTGGTG'. 
CCCACATTGG: 
AAGGGTGCTT 
TAGCCTAGGG 
TTGGGGGAGG 
GGTGAGGGGA 
TCCATAATTT 
GGATGGATGG- 
GGGCGCTCCC 
TACTAAAATG 
TTTTCCTGCX 
TAGAATCAGA 



31 

.1- 

ACGCTGCGCC 
CCCCCAGCCC 
GGTTCAGTGG 
CTCCCGCGCC 
AGGCCGAGTA 
. GACACCGCCT 
TGGGCTGGTC 
AAGTGGGTGC 
AGGAAGGCTT 
TCACAGAGAA 
CAGCCATCCA 
TCCTGATAGA 
TTCATGATGA 
TCTCTCGACA 
ATGACATGGT 
TCCACTTCCC 
CCAAACCTTC 
CAGCCGAGGA 
TTGCTACCCT 
TGGCCATGTC 
CATTAAAGTC 
CAAGTTTGCC 
CCCTGATTTT'. 
AAGTTGTTCC 
CACCTCCTAC 
TCCTTCCAAT 
TCCCATATTG 
TAATGCCCTG 
CCCTACTTTG'; 
GGGAAGGAAT 
TTTTTTGGGG 
TCCAATTTTG 
TAAATAATCA 
TGGATTATTT- 
AAAAAAAAAA 



41 
I 

GTCTGOGGGC 
TGGCTCCCCA 
GCTCAATCTG 
CCCCGGGACC 
TCGACTCGGC 
CACAGATCGA- 
CCCCTTGTCA 
AGGTGGTGGG 
CATGCTGGTC 
GGGCCCACTG 
GCACTGCCAT 
CCTACGCCGT 
ACCCTACACT 
CCAGTACCAT 
GTGTGGGGAC 
AGOCCATGTC 
TTCCCGACCC 
TGTTACCCCT 
AAGCCTGGCC 
ACAGGGAT AG - 
CAGTATTACT 
CAGTTCCCTT 
GGAGGGGGAA 
CATTTTGAGC 
TACCACCACA 
ACCCCAGTAG 
GGTCAAGCTG 
TTGTTACCCC 
TTATCCCAAG 
GGAAGATGGA 
GATGGGCTAG 
CAGATTTTTG 
CGTATTGTGG 
AAAAAGCCAT 



51 
I 

GCTTCCGGGC 

GCTGCGCTGC 

CGCAGCGCCA 

CCCACGCCGC 

CCCCTCCTGG 

CTCCAGGTGG 

GACTCAGTCA 

CACCCTGGCG 

CTCGAGCGGC 

GGTGAAGGCC 

TCCCGTGGAG 

GGCTGTGCCA 

GACTTTGATG 

GCACTCCCGG 

ATTCCCTTTG 

TCCCCAGACT 

TCACTGGAAG- 

CAACCCCTCC 

TGGCCTGGCC 

ATGGACATTT 

AAGGTAAGGG 

CCCAATCCTA 

CTTCTTGCTT 

CCCGGGACTC 

CAAACTTAGT 

CTTTTATTTT 

CTTACCTGCC 

AAGGCTTCTT 

TGCTCTTATT 

CACCACOGGA 

GGGAAATAAG 

CAACCTCCTC 

GGAGGGGAGT 

GTGTGGAAAC 



60 
120* 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 



Seq ID NO: 170 Protein sequence: 
Protein Accession *h NP_006866 



I 

MLTKPLQGPP 
KVIPRNRVLG 
PAQDLFDYIT 
IDFGSGALLH 
DQEILEAELK 
RPCPFGLVLA 



11 

r 

APPGTPTPPP 
WSPLSDSVTC 
EKGPLGEGPS 
DEPYTDFDGT 
FPAHVSPDCC 
TLSLAWPGLA 



21 
I 

GGKDREAFEA 
PLEVALLWKV 
RCFFGQWAA 
RVYSPPEWIS 
ALIRRCLAPK 
PNGQKSHPMA 



31 



41 

I 



EYRLGPLLGK GGFGTVFAGH 
. GAGGGKPGVI RLLDWFETQE 
IQHCHSRGVV HRDIKDENIL 
RHQYHALPAT VWSLGILLYD 
PSSRPSLEEI LLDPWMQTPA 
MSQG 



51 
I 

RLTDRLQVAI 
GFMLVLERPL 
IDLRRGCAKL - 
MVCGDIPFER 
EDVTPQPLQR 



60 
120 
180 
240 
300 



Seq ID NO: 171. DNA sequence 
Nucleic Acid Accession NM_003646 
Coding sequence: 89. .2875 ' 



1 
I 

GCGGCGCGGA 
GCCCTCCGCC 
GAGCAGCGAC 
CGAGCCGGAC 
CGGGCACAGG 
CCCTGGGGCC 
GTCAGCGACA 
CGTTGGGGAG 
AGCCTGCAAG 
CTGTAAGCCG 
GCACCACTGG 
CCAGCAGAAG 



11 

I • ' 

GCGGGCGTGC 
GGCCGGGGCT 
TCCGAGTCGG 
AAGGCGCCGC 
-AAAGCCATCA 
CCGTGCAGCG 
TATGGGGAGC 
CAGTACTGTG 
ATTGTGGTGC 
TCCTTCCGTG 
GTACACAGAC 
TTCACCTTCC 



21 - 

I ; 

TGAGCCCCGG 
AGGGCCGGAT 
CTTCCGCCTC 
GGCGACTCAA- 
CCAAGTCGGG. 
AGTCAGAGCG 
ACATCTGGTT 
TAGCCAGGAT 
ACACGCCCTG 
AATCAGGCTC 
GACGCCAGGA 
ACAGCAAGGA 



31 

;i 

CCGCCGGCCC 
GGAGCCGCGG 
GTCCAGCGGC 
.CAAGCGGCGC 
CCTCCAGCAC 
GCAGATCCGG 
CGAGACCAAC 
GCTGAAGTCA 
.CATCGAGCAG 
CAGGAATGTC 
..CGGCAAGTGT 
GATTGTGGCC 



41 

I 

GGCATGGGCG 
GACGGTAGCC 
TCCGAGCGCG 
TTCCCGGGGC 
CTGGCCCCCC 
AGTACAGTGG 
GTGTCCGGGG 
GTGTCTCGAA 
CTGGAGAAGA 
CGCGAGCCAA 
CGGCACTGTG 
ATCAGCTGCT 



51 
I 

TCTCCCGCGG 

CCGAGGCCCG 

ACGCCGGTCC 

TGCGGCTCTT 

CTCCGCCCAC 

ACTGGAGCGA 

ACTTCTGCTA ' 

GAAAGTGCGC 

TAAATTTCCG 

CCTTTGTACG 

GGAAGGGATT" 

CGTGGTGCAA 



60' 
120 
180 
240 
300 
360 
420 
480 
540- 
600 
660 ' 
720^ 



251 



WO 02/086443 " ' : " . 

GCAGGCATAC CACAGCAAGG TGTCCTGCTT CATGCTGCAG CAGATCGAGG AGCCGTGCTC 780 
GCTGGGGGTC CACGCAGCCG TGGTCATCCC GCCCACCTGG ATCCTCCGCG CCCGGAGGCC 840 
CCAGAATACT CTGAAAGCAA GCAAGAAGAA . GAAGAGGGCA .TCCTTCAAGA GGAAGTCCAG 900 
CAAGAAAGGG CCTGAGGAGG GCCGCTGGAG ACCCTTCATC ATCAGGCCCA CCCCCTCCCC . 960 
5 GCTCATGAAG CCCCTGCTGG TGTTTGTGAA CCCCAAGAGT GGGGGCAACC AGGGTGCAAA - 1020 
GATCATCCAG TCTTTCCTCT GGTATCTCAA TCCCCGACAA GTCTTCGACC TGAGCCAGGG 1080 
AGGGCCCAAG GAGGCGCTGG AGATGTACCG CAAAGTGCAC AACCTGCGGA TCCTGGCGTG 1140 
' CGGGGGCGAC GGCACGGTGG GCTGGATCCT CTCCACCCTG- GACCAGCTAC GCCTGAAGCC ■ 1200 
• * • GCCACCCCCT GTTGCCATCC TGCCCCTGGG TACTGGCAAC GACTTGGCCC GAACCCTCAA 1260 
10 ■ ' CTGGGGTGGG' GGCTACACAG ATGAGCCTGT GTCCAAGATC. CTCTCCCACG. TGGAGGAGGG 132a 
GAACGTGGTA* CAGCTGGACC GCTGGGACCT* CCACGCTGAG CCCAACCCCG AGGCAGGGCC 1380 
' TGAGGACCGAr GATGAAGGCG: CCACCGACCG GTTGCCCCTG GATGTCTTCA - ACAACTACTT 1440 
CAGCCTGGGC TTTGACGCCC ACGTCACCCT GGAGTTCCAC GAGTCTCGAG AGGCCAACCC . 1500 
AGAGAAATTC AACAGCOGCT TTCGGAATAA GATGTTCTAC GCCGGGACAG CTTTCTCTGA 1560 . 

15 CTTCCTGATG GGCAGCTCCA- AGGACCTGGC CAAGCACATC CGAGTGGTGT- GTGATGGAAT 1620 
GGACTTGACT CCCAAGATCC AGGACCTGAA ACCCCAGTGT GTTGTTTTCC TGAACATCCC 1680 
CAGGTACTGT. GCGGGCACCA- TGCCCTGGGG CCACCCTGGG GAGCACCACG ACTTTGAGCC " 1740 
CCAGCGGCAT GACGACGGCT ACCTCGAGGT CATTGGCTTC ACCATGACGT CGTTGGCCGC 1800 
GCTGCAGGTG GGCGGACACG. GCGAGCGGCT GACGCAGTGT CGCGAGGTGG TGCTCACCAC I860 

20 ATCCAAGGCC ATCCCGGTGC AGGTGGATGG CGAGCCCTGC AAGCTTGCAG CCTCACGCAT 1920 
CCGCATCGCC CTGCGCAACC AGGCCACCAT GGTGCAGAAG- GCCAAGCGGC GGAGCGCCGC 1980 
CCCCCTGCAC AGCGACCAGC AGCCGGTGCC AGAGCAGTTG CGCATCCAGG, TGAGTCGCGT 2040' 
CAGCATGCAC GACTATGAGG CCCTGCACTA CGACAAGGAG CAGCTCAAGG. AGGCCTCTGT 2100 
GCCGCTGGGC ACTGTGGTGG. TCCCAGGAGA CAGTGACCTA GAGCTCTGCC GTGCCCACAT 2160 • 

25 • TGAGAGACTC CAGCAGGAGC CCGATGGTGC TGGAGCCAAGi TCCCCGACAT- GCCAGAAACT 2220 
GTCCCCCAAG; TGGTGCTTCC.. TGGACGCCAC CACTGCCAGC CGCTTCTACA GGATCGACCG 2280 
AGCCCAGGAG, CACCTCAACT:" ATGTGACTGA GATCGCACAG GATGAGATTT ATATCCTGG A : 2340 
CCCTGAGCTG CTGGGGGCAT- CGGCCCGGCC TGACCTCCCA ACCCCCACTT CCCCTCTCCC - 2400 
CACCTCACCC TGCTCACCCA CGCCCCGGTC' ACTGCAAGGG GATGCTGCAC CCCCTCAAGG? 2460 

30 - TGAAGAGCTG ATTGAGGCTG CCAAGAGGAA CGACTTCTGT AAGCTCCAGG AGCTGCACCG . ■ 2520 
AGCTGGGGGC GACCTCATGC ACCGAGACGA GCAGAGTCGC ACGCTCCTGC ACCACGCAGT 2580 
CAGCACTGGC AGCAAGGATG TGGTCCGCTA CCTGCTGGAC CACGCCCCCC CAGAGATCCT 2640 
TGATGCGGTG*. GAGGAAAACG' GGGAGACCTG TTTGCACCAA GCAGCGGCCC TGGGCCAGCG 2700 
. CACCATCTGC CACTACATCG TGGAGGCCGG GGCCTCGCTC ATGAAGACAG ACCAGCAGGG 2760 

35 CGACACTCCC CGGCAGCGGG'- CTGAGAAGGC TCAGGACACC GAGCTGGCCG CCTACCTGGA. 2820 
GAACCGGCAG CACTACCAGA TGATCCAGCG GGAGGACCAG GAGACGGCTG TGTAGCGGGC 2880 



PCT/US02/12476 



40 
45 
50 
55 
60 
65 
70 
75 
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85 



Seq ID NO: 172 Protein sequence: 
Protein Accession #: NP 003637 



MEPRDGSPEA 
GIjQHLAPPPP 

- MIjKSVSRRKC 
DGKCRHCGKG 
PPTWILRARR 
NPKSGGNQGA 
LSTLDQItRLK 
LHAEPNPEAG 
KMFYAGTAPS 
GHPGEHHDFE 
GBPCKLAASR 

■ YDKEQLKEAS 
TTASRFYRID 
SLQGDAAPPQ 
YLLDHAPPEI 
AQDTBLAAYL 



11 
I 

RSSDSESASA 
TPGAPCSESE 
AACKIWHTP. 
FQQKPTFHSK 
PQNTLKASKK 
FCIIQSFLWYIi 
PPPPVAILPIi 
PEDRDEGATD 
DFLMGSSKDL 
PQRHDDGYLE 
IRIALRNQAT 
VPLGTVWPG 
RAQEHLNYVT. 
GEELIEAAKR 
LDAVEENGET 
ENRQHYQMIQ 



21 
1 

SSSGSERDAG 
RQIRSTVDWS 
CIEQLEKINF 
EIVAISCSWC 
KKRASFKRKS 
NPRQVFDLSQ 
GTGNDLARTL 
RLPLDVFNNY 
AKHIRWCDG 
VIGFTMTSLA 
MVQKAKRRSA 
DSDLEIiCRAH 
EIAQDEIYIL 
NDFCKLQELH 
CLHQAAALGQ. 
REDQETAV 



31 
I 

PEPDKAPRRL 
ESATYGEHIW 
RCKPSFRESG 
KQAYHSKVSC 
SKKGPEEGRW 
GGPKEALEMY- 
NWGGGYTDEP 
FSLGFDAHVT 
MDLTPKIQDL 
ALQVGGHGER 
APLHSDQQPV- 
IERLQQEPDG 
DPELLGASAR 
RAGGDLMHRD 
RTICHYIVEA 



41 

I 

NKRRPPGLRL 
FETNVSGDFC 
SRKVREPTPV 
PMLQQIEEPC 
RPPIIRPTPS 
RKVHNLRItA 
VSKILSHVEE 
LEFHESREAN 
KPQCWFLNI 
LTQCREWLT 
PEQLRIQVSR 
AGAKSPTCQK 
PDLPTPTSPL 
EQSRTIiJjHKA 
GASLMKTDQQ 



51 

I - 

FGHRKAITKS 
YVGEQYCVAR 
RHHWVHRRRQ. 
SLGVHAAWI- 
PLMKPLLVPV- 
CGGDGTVGWI - 
GNWQLDRWD. 
PEKFNSRFRN' 
PRYCAGTMPW 
TSKAIPVQVD 
VSMHDYEALH 
LSPKWCFLDA 
PTSPCSPTPR 
VSTGSKDWR 
GDTPRQRAEK 



Seq ID NO: 173 DNA sequence 
Nucleic Acid Accession 9: AF232772 
Coding sequence: 1-1662 



1 
I 

ATGCCGGTGC 
GTGCTGGGTG 
CACTACCTGT 
CTTTTTGCCT 
TCCCCGCGGC 
TTGCGCAAGT 
GTGGTGGATG 
GGCGGCACCG 
GGTGAGACGG 
AGCACCTTCT 
TTCAAGGCCC 
GATCCAGCCT 
GTCGGGGGAG 
' GTGCGGTACT 
CAGTGTATTA 
GACTGGTACC 
ACCAACCGAG 
ACAGAGACCC 
TACTTCOGGG 
TACGAGTCAG 
TTCTACCGGG 
ATTATCAAGG 



11 
I 

AGCTGACGAC 
GCATCCTGGC 
CCTTOGGCCT 
TCCTGGAGCA 
GGGGCTCGGT 
GCCTGCGCTC 
GCAACCGCCA 
AGCAGGCCGG 
AGGCCAGCCT 
CGTGCATCAT 
TCGGCGATTC 
GCACCATCGA 
ATGTCCAGAT 
GGATGGCCTT 
GTGGGCCCTT 
ATCAGAAGTT 
TCCTGAGCCT 
CCACTAAGTA 
AGTGGCTCTA 
TGGTCACGGG 
GCCGCATCTG 
CCACCTACGC 



21 
I 

AGCCCTGCGT 
AGCCTATGTG 
GTACGGCGCC 
CCGGCGCATG 
GGCACTGTGC 
GGCCCAGCGC 
GGAGGACGCC 
CTTCTTTGTG 
GCAGGAGGGC 
GCAGAAGTGG 
GGTGGACTAC 
GATGCTTCGA 
CCTCAACAAG 
CAACGTGGAG 
GGGCATGTAC 
CCTAGGCAGC 
TGGCTACCGA 
CCTCCGGTGG 
CAACTCTCTG 
TTTCTTCCCC 
GAACATTCTC 
CTGCTTCCTT 



31 
I 

GTGGTGGGCA 
ACGGGCTACC 
ATCCTGGGCC 
CGACGTGCCG. 
ATTGCCGCAT 
ATCTOCTTCC 
TACATGCTGG 
TGGCGCAGCA 
ATGGACCGTG 
GGAGGCAAGC 
ATCCAGGTGT 
GTCCTGGAGG 
TACGACTCAT 
CGGGCCTGCC 
CGCAACAGCC 
AAGTGCAGCT 
ACTAAGTATA- 
CTCAACCAGC 
TGGTTCCATA 
TTCTTCCTCA 
CTCTTCCTGC 
CGGGGCAATG 



41 

I 

CCAGCCTGTT 
AGTTCATCCA 
TGCACCTGCT 
GCCAGGCCCT 
ACCAGGAGGA 
CTGACCTCAA 
ACATCTTCCA 
ACTTCCATGA 
TGCGGGATGT 
GCGAGGTCAT 
GCGACTCTGA' 
AGGATCCCCA 
GGATTTCCTT 
AGTCCTACTT 
TCCTCCAGCA 
TCGGGGATGA 
CCGCGCGCTC 
AAACCCGCTG 
AGCACCACCT 
TTGCCACGGT 
TGACGGTGCA 
CAGAGATGAT 



51 
I 

TGCCCTGGCA 
CACGGAAAAG 
CATTCAGAGC 
GAAGCTGCCC 
CCCTGACTAC 
GGTGGTCATG 
CGAGGTGCTG 
GGCAGGCGAG 
GGTGCGGGCC 
GTACACGGCC 
CACTGTGCTG 
AGTAGGGGGA 
CCTGAGCAGC 
TGGCTGTGTG 
GTTCCTGGAG 
CCGGCACCTC 
CAAGTGCCTC 
GAGCAAGTCT 
CTGGATGACC 
TATACAGCTT 
GCTGGTGGGC 
CTTCATGTCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



60 
120 
180 
240 
300 
360 
' 420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



252 



5- 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

. CTCTACTCCC TCCTCTATAT 
ATCAACAAAT " CTGGCTGGGG 
CTCATTCCTG TGTCCATCTG 
TGCCAGGACC TGTTCAGTGA 
GGCTGCTACT GGGTGGCCCT 
AAGAAGCCGG AGCAGTACAG 
OGGGTAAAGT GCAATGGGTA 
GGGAGGAGGG AGTGCTGTGT 
AAGAACGGTG. ATGTAGTATG 
AGATGCAGGG CTGCAGGGGA 
TAGGCAGTAG CCTCCTCCTG 
AGTCCCATTC AGTGGCAACT 
TTCTCCCAGC CCATCTGAAC 
GGCCGGTTAG TGTATGTCAC, 
GCGCGTGAGA TACAAGGCCC 
ATGGTGCTTT ATGTGAGATA 
CAGGGAGTTA GCACTGAACT 
GGAACAAAGA GATTGTGGTG 
TTCCACCTGG AAACTGCTCA 
GGGGAGATAA AAAGATTAAG 
TAACCTGTAT ACTCTTGAAT 
CCCCACTTCA CTTTCTTCAA 
CCCTCATCAT CATAGGTAAG 
TAGAAGCTTC. AGCAGGAGGC 
TGATCAAATT GGCTACAATC 
ATGTGTGACT. TTCAGGCTAC 
TCOCAAAGTG AACTCTCAAA 
GGCTTCTCCA GGGAATTCTT 
TCCAGAAACC- AAACTAGGAG 
CTCATGCCTC CTGAGGCTGT 
GAAGCCATTT. TCCAAGTGAC 
TGGGATCCTG ACTAAGCCTT 
TCAAAGGGGC CAACTAACCC 
AAGCCTCTAA TGTACCAAGT 
GGGCAAACCC TGGTGCTTTC 
AACAGTTGCC* ACATCACACA 
CCAACCCATT GAAGGCTGGA . 
GTTCTGCTGG CCAAGAAGTT 
CAAGTGCAGA GTTCAGACTT 
AATGTAGGAT GACAGGCTCT 
GTCAACTTTC CTCAAATCAA 
TCGACTGGTT TTTCTAAGTT 
CAGCTTTATC CCCGTTTCTT 
TTTCCTCATT GAGAATTCAA 
TTTATTTTTA CCTGCTTTTT 
CTTATTTGTC AAAATAAAGA 



GTCCAGCCTT CTGCCGGCCA AGATCTTTGC CATTGCTACC 
CACCTCTGGC GGAAAAACCA TTGTGGTGAA CTTCATTGGC 
GGTGGCAGTT CTCCTGGAGG GGCTGGCCTA* CACAGCTTAT 
GACAGAGCTA GCCTTCCTTG -TCTCTGGGGC TATACTGTAT 
CCTCATGCTA TATCTGGCCA TCATCGCCCG. GCGATGTGGG 
CTTGGCTTTT • GCTGAGGTGT GACATGGCCC CCAAGCAGAG 
AGGGAGGGAA GGGGAATGGA AGAGAAAAGA CAGGGTGGGA 
TTTAGTCTCT TAATGGTCCA AAGGACAAAT CTAAAATGCA 
GCCTGACAGC TCTGTTT AGA ." GGAGGCAACA CTGATCCCCC 
TTCTGTGTTT TCAGACTGCC TGTCTGCTTG CATCTGCACA 
GGCTCCAGAG GGCACTCAGA' AGTTGTGCTA AACCAAGTTA 
TGTGATAGGT ACCTGAGTGA- CGGCAACCTG CGGAAGGAGG 
ACAACCAGAG GTGGCAGGAG.' AATTTCTACT GAGCGAGGTG 
CCCCACCCCA CCCATAAGTA GTCATCAATG CAATAAGATT 
AGAAGCCTGA TCTTTGGGCA TCAGAAAACA GGGTCCAGGA 
CCCCACTCCA CATCAACATT CCAGGGATGA GCCAAACCAG 
GCTTTTAAAA GTGCACATTA- AAAAGGAAAG TTTGCCAGGA 
GTGCTAAAGG AGGCCATAAG CTACACAGAG GCCTTGGGTG 
GACGTCTAGA TGGGTTCTTA GCTTGTCTGT GATCTCTGCT 
CCCCAACATG TTCAGAAAAG AAGTGAAGTC TTGGGTATTT 
TCCTCTCAAA TTCAGCTCTG. ATCTGAGGCT AAGACACACT • 
AGCCACATTT TTTGAGGTAT CACTGCAGTC ACCTCTTCTA - 
GTTTTCAAGG TGGCAATTGG GGCGGAGCCC CGGCTTCTTA 
AAGCGTGTTC TCAGCACATA TGGGAACTAT GAGGAGCCTC 
TTGGAGCTGC TTGGACGGAT: TCCTTGGCAG CCGGGTTAGC 

tgttcttgac aatcatctcc aatggaaagc ttttcagtgt 
tccaaaatgg: ttatctttga gaccatccat tctcctcagt 
acagccaagt tgtgacagtc actgcatttg cc tgct tctt 
ATGAAACTGG ttcctacatc ctaaggttct. TGCTTTCTCT. 
TTTTGGCTGT TTTCCCTCTG CTGCTTTTGG GGAATGAGGG. 

TTGCAATCCA ggctgttctc agcgttttga gtttaaaacc 

TGACTTAAGG GTTGCTTGCT TGCCCTCCAA- ATGTCCTTTC 
GTGCAGAACC AGCACTAAGG TGGACAGCAG ACAAGAGGGC 
GCTTCCTACA AAGACGCAAG GTGTGCTCCG AACCACAGAT • 
CTTCATCTCC CACGAACTCA AGGGTTTTCC AAGTGTAGCT 
GACCTCCAGT. TTCTGGTAAG ACTGCTGGTT GACATCAGAC 
AGGCAGCAGG CATTTGCTAA GGCAGCTGAT CCAGGCAATC 
AAACTATTTT GAGCATTAGA ATGGAGGAAA TCCGGTCAGC 
CGCTAAGGGC TTGTTTTTCT TCAGCATTTA CTTGAAGATT . 
CCTGGCTGTC CTACCATCAG CTCTGCCTTG CACTGTGGTC 
AAACAGGCAG GTACAGGTAG TGGGCTCACA ACGTTTGACC 
ATTTTGTACA TTTTTCAGCA GCAAAACCAA A CTGGG TCTT 
GCAAGGGAAG AGCCTTTATA CAATTGGACG CATTTTGGTT 
ATCCTCTTTT GTATTGTTTC TACAATAATT TGTAAACATA 
TTTTTTTTTT TAATTTTCAG GTCAAGTTTT TTATACTGCA 
TTCTCACAT 



PCTAJS02/12476 



Seq ID NO: 174 Protein (sequences 
Protein Accession #: AAP36984 



MPVQLTTALR 
LFAFLBHRRM 
WDGNRQEDA 
STFSCIMQKW 
VGGDVQILNK 
DWYHQKFLGS 
YFREIVLYNSL 
IIKATYACFL 
LIPVSIWVAV- 
KKPEQYSLAF 



11 

I 

WGTSLFALA 
RRAGQALKLP 
YMLDIFHEVL 
GGKREVMYTA 
YDSWISFLS3 
KCSFGDDRHL 
WFHKHHIiWMT 
RGNAEMIFMS 
LLEGLAYTAY 
ABV 



21 
I 

VLGGILAAYV 
SPRRGSVALC 
GGTEQAGFFV 
FKALGDSVDY 
VRYWMAFNVE 
TNRVLSLGYR 
YESWTGFFP 
LYSLLYMSSL 
CQDLFSETEL 



31 
I 

TGYQFIHTEK 
IAAYQEDPDY 
WRSNFHEAGE 
IQVCDSDTVL 
RACQSYFGCV 
TKYTARSKCL 
FFLIATVIQL 
LPAKIFAIAT 
AFLVSGAILY 



41 

I 

HYLSFGLYGA 
LRKCLRSAQR 
GETEASLQEG 
DPACTIEMLR 
QCISGPLGMY 
TETPTKYLRW 
FYRGRIWNIL 
INKSGWGTSG 
GCYWVALLML 



51 
I 

ILGLHLLIQS 
ISFPDLKWM 
MDRVRDWRA 
VLEEDPQVGG 
RNSLLQQFLE 
LNQQTRWSKS 
LFLLTVQLVG 
RKTIWNFIG 
YLAI IARRCG 



Seq ID NO: 175 DNA sequence 
Nucleic Acid Accession #: NM_000G91 
Coding sequence: 43.. 1404. 



CCAGGAGCCC 
GCCGTGAAGC 
TTCCAGCAGC 
GCGCTGGCCG 
GTCCTAGAGG 
GTGGAGAAGA 
GTGGTCCTCG 
GGGGCCATCG 
GCGAGCCTGC 
AATGGGGGTG 
ACGGGCAGCA 
GTCACGCTGG 
GTGGCCTGCC 
CCAGACTACA 
TCACTGAAAG 
AGTGCCCGGC 
GGCACCGGGG 



11 
I 

CAGTTACCGG 
GCGCCOGCGC 
TGGAGGCGCT 
CAGACCTGCA 
AGATCGAGTA i 
CGCCCCAGAC 
TCATTGGCAC 
CTGCAGGGAA- 
TGGCTACCAT 
TCCCTGAGAC 
CGGGGGTGGG 
AGCTGGGAGG. 
GACGCATCGC 
TCCTCTGTGA 
AGTTCTACGG 
ACTTCCAGAG 
ATGCCGCCAC 



21 
1 

GAGAGGCTGT 
CGCCTTCAGC 
GCAGCGCCTG 
CAAGAATGAA 
CATGATCCAG 
TCAGCAGGAC 
CTGGAACTAC 
OGCAGTGGTC 
CATCCCCCAG 
CACGGAGCTG 
GAAGATCATC 
GAAGAGTCCC 
CTGGGGGAAA 
CCCCTCGATC 
GGAAGATGCT 
GGTGATGGGC 
TCGCTACATA 



31 
I 

GTCAAAGGCG 
TCGGGCAGGA 
ATCCAGGAGC 
TGGAACGCCT 
AAGCTCCCTG 
GAGCTCTACA 
CCCTTCAACC 
CTCAAGCCCT 
TACCTGGACA- 
CTCAAGGAGA 
ATGACGGCTG 
TGCTAOGTGG 
TTCATGAACA 
CAGAACCAAA 
AAGAAATCCC 
CTGATTGAGG 
GCCCCCACCA 



41 
I 

CCATGAGCAA 
CCCGTCCGCT 
AGGAGCAGGA 
ACTATGAGGA 
AGTGGGCCGC 
TCCACTCGGA 
TCACCATCCA 
CGGAGCTGAG 
AGGATCTGTA 
GGTTCGACCA 
CTGCCAAGCA 
ACAAGAACTG 
GTGGCCAGAC 
TTGTGGAGAA 
GGGACTATGG 
GCCAGAAGGT 
TCCTCAOGGA 



51 
t 

GATCAGCGAG 
GCAGTTCCGA 
GCTGGTGGGC 
GGTGGTGTAC 
GGATGAGCCC 
GCCACTGGGC 
GCCCATGGTG 
TGAGAACATG 
CCCAGTAATC 
TATCCTGTAC 
CCTGACCCCT 
TGACCTGGAC 
CTGCGTGGCC 
GCTCAAGAAG 
AAGAATCATT 
GGCTTATGGG 
CGTGGACCCC 



1380 
1440 
1500 
*1560 
1620 
1680 
1740. 
1800 
1860 
1920 
1980 
2040. 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760. 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
378Q 
3840 
3900 
3960 
4020 



60 
120 
180 
240 
300 
360 
420 
480 
540 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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CAGTCCCCGG TGATGCAAGA GGAGATCTTC GGGCCTGTGC TGCCCATCGT GTGCGTGOGC 1080- 

AGCCTGGAGG AGGCCATCCA GTTCATCAAC CAGCGTGAGA AGCCCCTGGC CCTCTACATG 1140 

TTCTCCAGCA" AOGACAAGGT GATTAAGAAG ATGATTGCAG AGACATCCAG TGGTGGGGTG 1200 

•GCGGCCAACG ATGTCATCGT CCACATCACC TTGCACTCTC TGCCCTTCGG GGGGGTGGGG 1260 

AACAGCGGCA TGGGATCCTA CCATGGCAAG AAGAGCTTCG AGACTTTCTC TCACCGCOGC 1320 

TCTTGCCTGG TGAGGCCTCT GATGAATGAT GAAGGCCTGA AGGTCAGATA CCCCCCGAGC 1380 

CCGGCCAAGA TGACCCAGCA CTGAGGAGGG GTTGCTCCGC CTGGCCTGGC CATACTGTGT 1440 

CCCATCGGAG TGCGGACCAC CCTCACTGGC TCTCCTGGCC CTGGAGAATC. GCTCCTGCAG 1500 

CCCCAGCCCA GCCCCACTCC TCTGCTGACC TGCTGACCTG - TGCACACCCC ACTCCCACAT 1560 

GGGCCCAGGC CTCACCATTC CAAGTCTCCA CCCCTTTCTA GACCAATAAA GAGACAAATA 1S20 - 
CAATTTTCTA ACTCGG 

Seq ID NO: 176 Protein sequence t 
Protein Accession 8: NP_000682 

X 11 21 31 41 51 

MSKISEAVKR ARAAFSSGRT RPLQFRFQQIi EALQRLIQEQ EQELVGALAA DLHKNEWNAY 60 

YEEWYVLEE IEYMIQKLPE WAADEPVEKT PQTQQDELYI HSEPLGWLV IGTWNYPFNL 120 

TIQPMVGAI A AGNAWLKPS- ELSENMASLL- ATI I PQYLDK DLYPVINGGV PETTELLKER • 180 

FDHILYTGST GVGKI IMTAA AKHLTPVTLE LGGKSPCYVD KNCDLDVACR RIAWGKFMNS 240 

GQTCVAPDYI LCDPSIQNQI VEKLKKSLKE' FYGEDAKXSR- DYGRIISARH FQRVKGLIEG 300 

QKVAYGGTGD AATRYIAPTI LTDVDPQSPV MQEEIFGPVL' PIVCVRSLEE AIQFINQREK 360 

PLALYMFSSN- DKVIKKMIAE TSSGGVAAND VIVHITLHSL PFGGVGNSGM GSYHGKKSFE 420 
TFSHRRSCLV RPbMNDEGLK VRYPPSPAKM TQH 

Seq ID NO: 177 DMA sequence * 
Nucleic Acid Accession #: NM_00lb67.1 
Coding sequencer 108-4703 

1 11 21 31 41 51 

CTAACCGACG CGCGTCTGTG GAGAAGCGGC TTGGTGGGGG GTGGTCTCGT GGGGTCCTGC 60 

CTGTTTAGTC GCTTTCAGGG TTCTTGAGCC CCTTCACGAC CGTCACCATG GAAGTGTCAC 120 

CATTGCAGCC TGTAAATGAA' AATATGCAAG TCAACAAAAT AAAGAAAAAT: GAAGATGCTA 180 

AGAAAAGACT GTCTGTTGAA AGAATCTATC AAAAGAAAAC ACAATTGGAA -• CATATTTTGC 240 

TCCGCCCAGA' CACCTACATT- GGTTCTGTGG : AATTAGTGAC CCAGCAAATG : TGGGTTTACG . 300 

ATGAAGATGT TGGCATTAAC TATAGGGAAG TCACTTTTGT TCCTGGTTTG- TACAAAATCT 1 360 

TTGATGAGAT ■ TCTAGTTAAT GCTGCGGACA ACAAACAAAG GGACCCAAAA^ ATGTCTTGTA 420- 

TTAGAGTCAC: AATTGATCCG GAAAACAATT TAATTAGTAT ATGGAATAAT GGAAAAGGTA 480 

TTCCTGTTGT TGAACACAAA GTTGAAAAGA TGTATGTCCC AGCTCTCATA TTTGGACAGC 540 

TCCTAACTTC TAGTAACTAT GATGATGATG AAAAGAAAGT GACAGGTGGT CGAAATGGCT 600 

ATGGAGCCAA ATTGTGTAAC" ATATTCAGTA CCAAATTTAC TGTGGAAACA' GCCAGTAGAG. 660 

AATACAAGAA- AATGTTCAAA- CAGACATGGA TGGATAATAT GGGAAGAGCT GGTGAGATGG 1 720 

AACTCAAGCC CTTCAATGGA GAAGATTATA' CATGTATCAC CTTTCAGCCT GATTTGTCTA 780- 

AGTTTAAAAT GCAAAGCCTG GACAAAGATA TTGTTGCACT AATGGTCAGA AGAGCATATG 840 

ATATTGCTGG ATCCACCAAA' GATGTCAAAG TCTTTCTTAA TGGAAATAAA CTGCCAGTAA 900 

AAGGATTTCG TAGTTATGTG GACATGTATT TGAAGGACAA GTTGGATGAA ACTGGTAACT 960 

CCTTGAAAGT AATACATGAA- CAAGTAAACC ACAGGTGGGA AGTGTGTTTA ACTATGAGTG 1020 

AAAAAGGCTT. TCAGCAAATT .AGCTTTGTCA ACAGCATTGC TACATCCAAG GGTGGCAGAC 1080 

ATGTTGATTA TGTAGCTGAT. CAGATTGTGA CTAAACTTGT TGATGTTGTG. AAGAAGAAGA 1140 

ACAAGGGTGG TGTTGCAGTA AAAGCACATC AGGTGAAAAA TCACATGTGG ATTTTTGTAA - 1200 

ATGCCTTAAT TGAAAACCCA ACCTTTGACT CTCAGACAAA AGAAAACATG. ACTTTACAAC 1260 

CCAAGAGCTT TGGATCAACA TGCCAATTGA GTGAAAAATT TATCAAAGCT GCCATTGGCT 1320 

GTGGTATTGT AGAAAGCATA CTAAACTGGG TGAAGTTTAA GGCCCAAGTC CAGTTAAACA- 1380 

AGAAGTGTTC AGCTGTAAAA CATAATAGAA- TCAAGGGAAT TCCCAAACTC GATGATGCCA 1440 

ATGATGCAGG GGGCCGAAAC TCCACTGAGT- 5 GTAGGCTTAT CCTGACTGAG GGAGATTCAG 1500 

CCAAAACTTT' GGCTGTTTCA GGCCTTGGTG TGGTTGGGAG AGACAAATAT GGGGTTTTCC 1560 

CTCTTAGAGG AAAAATACTC AATGTTCGAG AAGCTTCTCA TAAGCAGATC ATGGAAAATG- 1620 

CTGAGATTAA CAATATCATC AAGATTGTGG GTCTTCAGTA CAAGAAAAAC TATGAAGATG 1680 

AAGATTCATT GAAGACGCTT CGTTATGGGA AGATAATGAT TATGACAGAT CAGGACCAAG 1740 

ATGGTTCCCA CATCAAAGGC TTGCTGATTA ATTTTATCCA TCACAACTGG CCCTCTCTTC 1800 

TGCGACATCG TTTTCTGGAG GAATTTATCA CTCCCATTGT AAAGGTATCT AAAAACAAGC 1860 

AAGAAATGGC ATTTTACAGC CTTCCTGAAT TTGAAGAGTG GAAGAGTTCT ACTCCAAATC 1920 

ATAAAAAATG GAAAGTCAAA TATTACAAAG GTTTGGGCAC CAGCACATCA AAGGAAGCTA 1980 

AAGAATACTT TGCAGATATG AAAAGACATC GTATCCAGTT CAAATATTCT GGTCCTGAAG 2040 

ATGATGCTGC TATCAGCCTG GCCTTTAGCA AAAAACAGAT AGATGATCGA AAGGAATGGT 2100 

TAACTAATTT CATGGAGGAT AGAAGACAAC GAAAGTTACT TGGGCTTCCT GAGGATTACT 2160 

TGTATGGACA AACTACGACA TATCTGACAT- ATAATGACTT CATCAACAAG GAACTTATCT 2220 

TGTTCTCAAA TTCTGATAAC GAGAGATCTA TCCCTTCTAT GGTGGATGGT TTGAAACCAG 2280 

GTCAGAGAAA GGTTTTGTTT ACTTGCTTCA AACGGAATGA CAAGCGAGAA GTAAAGGTTG- 2340 

CCCAATTAGC TGGATCAGTG GCTGAAATGT CTTCTTATCA TCATGGTGAG ATGTCACTAA 2400 

TGATGACCAT TATCAATTTG GCTCAGAATT TTGTGGGTAG CAATAATCTA AACCTCTTGC 2460 

AGCCCATTGG TCAGTTTGGT ACCAGGCTAC ATGGTGGCAA GGATTCTGCT AGTCCACGAT 2520 

ACATCTTTAC AATGCTCAGC TCTTTGGCTC GATTGTTATT TCCACCAAAA GATGATCACA 2580 

CGTTGAAGTT TTT ATATGAT GACAACCAGC GTGTTGAGCC TGAATGGTAC ATTCCTATTA 2640 

TTCCCATGGT GCTGATAAAT GGTGCTGAAG GAATCGGTAC TGGGTGGTCC TGCAAAATCC 2700 

CCAACTTTGA TGTGCGTGAA ATTGTAAATA ACATCAGGCG" TTTGATGGAT GGAGAAGAAC 2760 

CTTTGCCAAT GCTTCCAAGT TACAAGAACT TCAAGGGTAC TATTGAAGAA CTGGCTCCAA 2820 

ATCAATATGT- GATTAGTGGT GAAGTAGCTA TTCTTAATTC TACAACCATT GAAATCTCAG 2880 

AGCTTCCCGT CAGAACATGG ACCCAGACAT ACAAAGAACA AGTTCTAGAA CCCATGTTGA 2940 

ATGGCACCGA GAAGACACCT CCTCTCATAA CAGACTATAG GGAATACCAT ACAGATACCA- 3000 

CTGTGAAATT TGTTGTGAAG ATGACTGAAG AAAAACTGGC AGAGGCAGAG AGAGTTG GAC 3060 

TACACAAAGT CTTCAAACTC CAAACTAGTC TCACATGCAA CTCTATGGTG CTTTTTGACC 3120 

ACGTAGGCTG TTTAAAGAAA TATGACACGG TGTTGGATAT TCTAAGAGAC TTTTTTGAAC 3180 

TCAGACTTAA ATATTATGGA TTAAGAAAAG AATGGCTCCT AGGAATGCTT GGTGCTGAAT 3240 

CTGCTAAACT GAATAATCAG GCTCGCTTTA TCTTAGAGAA AATAGATGGC AAAATAATCA 3300 
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TTGAAAATAA GCCTAAGAAA GAATTAATTA AAGTTCTGAT TCAGAGGGGA" TATGATTCGG 3360 
ATCCTGTGAA GGCCTGGAAA GAAGCCCAGC AAAAGGTTCC AGATGAAGAA GAAAATGAAG ' 3420 

AGAGTGACAA CGAAAAGGAA ACTGAAAAGA GTGACTCCGT AACAGATTCT GGACCAACCT 3480 

TCAACTATCT TCTTGATATG CCCCTTTGGT ATTTAACCAA GGAAAAGAAA GATGAACTCT- 3540 

GCAGGCTAAO AAATGAAAAA GAACAAGAGC TGGACACATT AAAAAGAAAG AGTCCATCAG 3600 

ATTTGTGGAA AGAAGACTTG GCTACATTTA TTGAAGAATT GGAGGCTGTT GAAGCCAAGG 3660 

AAAAACAAGA TGAACAAGTC GGACTTCCTG GGAAAGGGGG GAAGGCCAAG GGGAAAAAAA 3720 

CACAAATGGC TGAAGTTTTG CCTTCTCCGC GTGGTCAAAG. AGTCATTCCA CGAATAACCA 3780 

TAGAAATGAA AGCAGAGGCA- GAAAAGAAAA ATAAAAAGAA AATTAAGAAT GAAAATACTG 3840- 

AAGGAAGCCC TCAAGAAGAT GGTGTGGAAC .TAGAAGGCCT AAAACAAAGA TTAGAAAAGA 3900 

' AACAGAAAAG AGAACCAGGT ACAAAGACAA AGAAACAAAC TACATTGGCA TTTAAGCCAA 3960 

TCAAAAAAGG AAAGAAGAGA AATCCCTGGC CTGATTCAGA ATCAGATAGG AGCAGTGACG 4020" 

AAAGTAATTT. TGATGTCCCT CCACGAGAAA CAGAGCCACG GAGAGCAGCA ACAAAAACAA 4080 

AATTCACAAT GGATTTGGAT TCAGATGAAG ATTTCTCAGA' TTTTGATGAA AAAACTGATG 4140 

ATGAAGATTT TGTCCCATCA GATGCTAGTC CACCTAAGAC CAAAACTTCC CCAAAACTTA 4200 

GTAACAAAGA ACTGAAACCA CAGAAAAGTG TCGTGTCAGA CCTTGAAGCT GATGATGTTA 4260 
AGGGCAGTGT- ACCACTGTCT. TCAAGCCCTC CTG CTACACA- TTTCCCAGAT GAAACTGAAAf- 4320 

TTACAAACCC AGTTCCTAAA- AAGAATGTGA CAGTGAAGAA" GACAGCAGCA AAAAGTCAGT 4380 
CTTCCACCTC CACTACCGGT GCCAAAAAAA GGGCTGCCCC AAAAGGAACT AAAAGGGATC 4440 

CAGCTTTGAA' TTCTGGTGTC .TCTCAAAAGC CTGATCCTGC CAAAACCAAG AATCGCCGCA- 4500 

AAAGGAAGCC ATCCACTTCT GATGATTCTG. ACTCTAATTT- TGAGAAAATT GTTTCGAAAG ' 4560 

CAGTCACAAG: CAAGAAATCC AAGGGGGAGA' GTGATGACTT CCATATGGAC TTTGACTCAG - 4620 

CTGTGGCTCC TCGGGCAAAA TCTGTACGGGn CAAAGAAACC TATAAAGTAC CTGGAAGAGT 4680 

CAGATGAAGA- TGATCTGTTT TAAAATGTGA- GGCGATTATT TTAAGTAATT ATCTTACCAA 4740 

GCCCAAGACT GGTTTTAAAG TTACCTGAAG CTCTTAACTT CCTCCCCTCT GAATTTAGTT 4800 
TGGGGAAGGT GTTTTTAGTA CAAGACATCA-.AAGTGAAGTA AAGCCCAAGT GTTCTTTAGC ... 4860' 

TTTTTATAAT ACTGTCTAAA TAGTGACCAT CTCATGGGCA TTGTTTTCTT CTCTGCTTTG 4920 

TCTGTGTTTT GAGTCTGCTT. TCTTTTGTCT TTAAAACCTG ATTTTTAAGT TCTTCTGAAC 4980 

TGTAGAAATA GCTATCTGAT CACTTCAGCG TAAAGCAGTG TGTTTATTAA CCATCCACTA 5040 

AGCTAAAACT AGAGCAGTTT GATTTAAAAG TGTCACTCTT CCTCCTTTTC TACTTTCAGT 5100 

AGATATGAGA TAGAGCATAA- TTATCTGTTT TATCTTAGTT TTATACATAA TTTACCATCA 5160 

GATAGAACTT TATGGTTCTA ■ GTACAGATAC TCTACTACAC TCAGCCTCTT ATGTGCCAAG S220 

TTTTTCTTTA AGCAATGAGA- AATTGCTCAT GTTCTTCATC TTCTCAAATC ATCAGAGGCC 5280 

AAAGAAAAAC ACTTTGGCTG; TGTCTATAAC' TTGACACAGT CAATAGAATG AAGAAAATTA 5340 

GAGTAGTTAT GTGATTATTT CAGCTCTTGA CCTGTCCCCT- CSGGCTGCCT CTGAGTCTGA 5400 

ATCTCCCAAA/ GAGAGAAACC AATTTCTAAG AGGACTGGAT TGCAGAAGAC TCGGGGACAA 5460' 

CATTTGATCC AAGATCTTAA ATGTTATATT; GATAACCATG CTCAGCAATG AGCTATTAGA 5 52 0 

TTCATTTTGG GAAATCTCCA TAATTTCAAT TTGTAAACTT- TGTTAAGACC TGTCTACATT 5580 

GTTATATGTG .TGTGACTTGA. GTAATGTTAT CAACGTTTTT GTAAATATTT ACTATGTTTT 5640 
TCTATTAGCT AAATTCCAAC AATTTTGTAC TTTAATAAAA TGTTCTAAAC ATTGC 

Seq ID NO: 178 Protein sequence: 
Protein Accession #: NP_001058.1 



MEVSPLQPVN 

MWVYDEDVGr 

NGKGIPWEH-- 

TASREYKKMF 

RRAYDIAGST 

LTMSEKGPQQ.- 

WIFVNALIEN; 

VQLNKKCSAV 

YGVFPLRGKI> 

DQDQDGSHIX 

STPNHKKWKV- 

RKEWLTNFME. 

GLKPGQRKVL- 

LNLLQPIGQF 

YIPIIPMVLI=. 

ELAPNQYVIS 

HTDTTVKPW 

DFFELRLKYY- 

GYDSDPVKAW 

KDELCRliRNE- 

KGKKTQMAEV. 

RLEKKQKREP . 

ATKTKFTMDL 

ADDVKGSVPL 

TKRDPALNSG 

DFDSAVAPRA 



11 21 

I I 
ENMQVNKIKK NEDAKKRLSV 

nyrevtfvpg; LYXIFDEILV: 
KVEKMYVPAL/ IFGQLLTSSN 
KQTWMDNMGR- AGEMELKPFN- 
KDVKVFLNGN KLPVKGFRSY- 
ISFVNSIATS KGGRHVDYVA 
PTFDSQTKEN WTLQPKSFGS. 
KHNRIKGIPK LDDANDAGGR 
LNVREASHKQ; IMENAEINNI 
GLLINFIHHN WPSLLRilRFI* 
KYYKGLGTST SKEAKEYFAD 
DRRQRKLLGL PEDYLYGQTT 
FTCFKRNEKR EVKVAQLAGSi 
GTRLHGGKDS ASPRYIFTML 
NGAEGIGTGW- SCKIPNFDVR 
GEVAILNSTT. IEISELPVRT 
KMTBEKLAEA ERVGLHKVFK 
GLRKEWLLGM : LGAESAKLNN 
KEAQQKVPDE EENEESDNEK 
KEQELDTLKR KSPSDLWKED 
LPSPRGQRVI: PRXTIEMKAE 
GTKTKKQTTL AFKPIKKGKK 
DSDEDFSDFD EKTDDEDFVP 
SSSPPATHFP DETEITNPVP 
VSQKPDPAKT KNRRKRKPST 
KSVRAKKPIK YLEESDEDDL 



31 
I 

ERIYQKKTQL 
NAADNKQRDP 
YDDDEKKVTG 
GEDYTCITFQ, 
VDMYLKDKLD 
DQIVTKLVDV- 
TCQLSEKFIK 
NSTECTLILT 
IKIVGLQYKK 
EEFITPIVKV, 
MKRHRIQFKY 
TYLTYNDFIN 
VAEMSSYHHG 
SSLARLLFPP 
EIVNNIRRLM 
WTQTYKEQVL 
LQTSLTCNSM 
QARFILEKID. 
ETEKSDSVTD 
LATFIEELEA 
AEKKNKKKIK 
RNPWPDSESD 
SDASPPKTKT 
KKNVTVKKTA 



41 

I 

EHILLRPDTY 
KMSCIRVTID 
GRNGYGAKLC 
PDLSKFKMQS 
ETGNSLKVIH- 
VKKKNKGGVA 
AAIGCGIVES 
EGDSAKTLAV- 
NYEDEDSLKT 
SKNKQEMAFY 



KELILFSNSD 
EMSLMMTIIN 
KDDHTLKFLY 
DGEEPLPMLP 
EPMLNGTEKT 
VLFDHVGCLK 
GKIIIENKPK 
SGPTFNYLLD 
VEAKEKQDBQ 



RSSDESNFDV 
SPKLSNKEDK 
AKSQSSTSTT 
IVSKAVTSKK 



51 
I 

IGSVELVTQQ 

PENNLISIWN 

NIFSTKFTVE. 

LDKDIVALMV' 

EQVNHRWEVC 

VKAEQVKNHM 

ILNWVKFKAQ. 

SGLGWGRDK 

LRYGKIMIMT. 

SLPEFEEWKS : 

LAFSKKQIDD - 

NERSIPSMVD 

LAQNFVGSNN; 

DDNQRVEPEW- 

SYKNFKGTIE 

PPLITDYREY 

KYDTVLDILR 

KELIKVLIQR - 

MPLWYLTKEK 

VGLPGKGGKA 

DGVELEGLKQ 

PPRETEPRRA 

PQKSWSDLE 

GAKKRAAPKG 

SKGESDDFHM 



60 
120 
180 * 
240 
300 ' 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 

1020 

1080 

1140 

1200 

1260 ' 

1320 

1380 

1440 

1500 



Seq ID NO: 179 DNA sequence 

Nucleic Acid Accession #: Bos sequence 

Coding sequence: 148-7095 



CACACATACG 
CAAAAAAAAC 
CGGCGAGGGG 
CAGCTCCTCT 
CTTGTTGAAG 
AAATATCCAA 
CAAGTAAATG 



11 

i: 

CACGCACGAT 
ATTTCCTTCG 
CCGCAGACCG: 
GTGTTTGCCG 
AGATTGGCTG 
CATGTAATAG 
TGAATCTTAA 



21 
I 

CTCACTTCGA 
CTCCCCCTCC 
TCTGGAAATG 
CCTGGATTGG 
GTCCTATACA 
CCCAAAACAA 
GAAACTTAAA 



31 
I 

TCTATACACT 
CTCTCCACTC 
CGAATCCTAA 
GCTAATGGAT 
GGAGCACTGA 
TCTCCTATCA 
TTTCAGGGTT 



41 
I 

GGAGGATTAA 
TGAGAAGCAG 
AGCGTTTCCT 
ACTACAGACA 
ATCAAAAAAA 
ATATTGATGA 
GGGATAAAAC 



51 
I 

AACAAACAAA 60 

AGGAGCCGCA 120 

CGCTTGCATT 180 

ACAGAGAAAA 240 

TTGGGGAAAG 300 

AGATCTTACA 360 

ATCATTGGAA 420 
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AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT. AAAGCAAGCA AGATAACTTT TCACTGGGGA- 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT .600 

GAGATGCAAA TCTACTGGTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA- AGCAGT CAAA 660 

5 GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA* AGAAAATTTG 720 . 

GATTTCAAAG CGATTATTGA- TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA' GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTAC ATTTAC 840 

AATCGCTCAT- TGACATCTCC; TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA- TCTCTGAAAG. CCAGTTGGCT GTTTTTTGTG" AAGTTCTTAC AATGCAACAA 960 

10 TCTGGTTATG TCATGCTGAT GGACTACTTA. CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 ' 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG *■ AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCE- TCTT GTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

15 GGTGCTATTC . TCAATAATTT GCTACCCAAT* ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA i TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTGj AAGAAATAAT. CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 
AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA' CACACTACAA TCGCATAGGG ■; 1560 

20 ACGAAATACA : ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC- ATCTTTAAAT ■ TCCACTTCCC. AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG? ATATTTCCTT, GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT: CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG =■ 1800 

AACTTGTCGG GGACTGCAGA" ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG- I860 

2 5 AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCT6" CTATCCCATT' CATCTCTGAG. AACATATCCC AAGGGTATAT ATTTTCCTCC." 1980 

GAAAACCCAG' AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA- CTTCATCAGG" TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG, 2160 

30 AGCTTTCTCC AGACTAATTA- CACTGAGATA" CGTGTTGATG. AATCTGAGAA GACAACCAAG- 2220 

TCCTTTTCTG. : CAGGCCCAGT GATGTCACAG GGTCCCTCAG f TTACAGATCT GGAAATGCCA - 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT - ACTCGCAGAC AACCCAACCG ' 2400' 

GTATACAATG GTGAGACACC TCTTCAACCT TCCTACAGTA GTGAAGTCTT TCCTCTAGTC 2460 

35 ACCCCTTTGT TGCTTGACAA TCAGATCCTC AACACTACCC" CTGCTGCTTC AAGTAGTGAT 2520 

TCGGCCTTGC ATGCTACGCC TGTATTTCGC AGTGTCGATG: TGTCATTTGA ATCCATCCTG 2580 

TCTTCCTATG: ATGGTGCACC TTTGCTTCCA TTTTCCTCTG; CTTCCTTCAG TAGTGAATTG < 2640- 
TTTCGCCATC TGCATACAGT TTCTCAAATC CTTCCACAAG • TT ACTTCAGC TACCGAGAGT . . 2700 

GATAAGGTGC CCTTGCATGC TTCTCTGCCA GTGGCTGGGG GTGATTTGCT ATTAGAGCCC 2760 

40 AGCCTTGCTC AGTATTCTGA TGTGCTGTCC. ACTACTCATG; CTGCTTCAGA GACGCTGGAA 2820 

TTTGGTAGTG. AATCTGGTGT TCTTTATAAA ACGCTTATGT TTTCTCAAGT TGAACCACCC 2880 

AGCAGTGATG CCATGATGCA TGCACGTTCT TCAGGGCCTG AACCTTCTTA TGCCTTGTCT 2940 

GATAATGAGG GCTCCCAACA CATCTTCACT GTTTCTTACA GTTCTGCAAT ACCTGTGCAT 3000 

GATTCTGTGG; GTGTAACTTA- TCAGGGTTCC TTATTTAGCG GCCCTAGCCA TATACCAATA 3060 

45 CCTAAGTCTT, CGTTAATAAC CCCAACTGCAv TCATTACTGC AGCCTACTCA TGCCCTCTCT 3120 

GGTGATGGGG AATGGTCTGG AGCCTCTTCT GATAGTGAAT TTCTTTTACC TGACACAGAT 3180 

GGGCTGACAG CCCTTAACAT. TTCTTCACCT GTTTCTGTAG CTGAATTTAC ATATACAACA 3240 . 

TCTGTGTTTG GTGATGATAA TAAGGCGCTT TCTAAAAGTG AAATAATATA TGGAAATGAG 3300 

ACTGAACTGC AAATTCCTTC TTTCAATGAG ATGGTTTACC CTTCTGAAAG CACAGTCATG - 3360 

50 CCCAACATGT ATGATAATGT AAATAAGTTG AATGCGTCTT. TACAAGAAAC CTCTGTTTCC 3420 

ATTTCTAGCA CCAAGGGCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC TAAGGTTTTT 3480 

GATCATGAGA TTAGTCAAGT TCCAGAAAAT AACTTTTCAG TTCAACCTAC ACATACTGTC 3540 

TCTCAAGCAT CTGGTGACAC TTCGCTTAAA CCTGTGCTTA GTGCAAACTC AGAGCCAGCA • 3600 

TCCTCTGACC CTGCTTCTAG TGAAATGTTA- TCTGCTTCAA CTCAGCTCTT ATTTTATGAG- 3660 

55 ACCTCAGCTT CTTTTAGTAC TGAAGTATTG CTACAACCTT CCTTTCAGGC TTCTGATGTT 3720 

GACACCTTGC- TTAAAACTGT TCTTCCAGCT GTGCCCAGTG ATCCAATATT GGTTGAAACC 3780 

CCCAAAGTTG ATAAAATTAG - TTCTACAATG . TTGCATCTCA TTGTATCAAA TTCTGCTTCA 3840 

AGTGAAAACA TGCTGCACTC TACATCTGTA V CCAGTTTTTG ATGTGTCGCC TACTTCTCAT 3900 

APGCACTCTG CTTCACTTCA AGGTTTGACC ATTTCCTATG CAAGTGAGAA ATATGAACCA 3 960 

60 GTTTTGTTAA AAAGTGAAAG TTCCCACCAA GTGGTACCTT CTTTGTACAG TAATGATGAG 4020 

TTGTTCCAAA CGGCCAATTT GGAGATTAAC CAGGCCCATC CCCCAAAAGG AAGGCATGTA 4080 

TTTGCTACAC CTGTTTTATC AATTGATGAA CCATTAAATA CACTAATAAA TAAGCTTATA 4140 

CATTCCGATG AAATTTTAAC . CTCCACCAAA AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 4200 

ATTCCAACAG TTGCTTCTGA TACATTTGTA TCTACTGATC' ATTCTGTTCC TATAGGAAAT 4260 

65 GGGCATGTTG CCATTACAGC TGTTTCTCCC CACAGAGATG GTTCTGTAAC CTCAACAAAG 4320 

TTGCTGTTTC CTTCTAAGGC AACTTCTGAG CTGAGTCATA GTGCCAAATC TGATGCCGGT 4380 

TTAGTGGGTG GTGGTGAAGA- TGGTGACACT GATGATGATG GTGATGATGA TGATGATGAC 4440 

AGAGGTAGTG ATGGCTTATC CATTCATAAG TGTATGTCAT GCTCATCCTA TAGAGAATCA 4500 

CAGGAAAAGG TAATGAATGA TTCAGACACC CACGAAAACA GTCTTATGGA TCAGAATAAT 4560 

70 CCAATCTCAT ACTCACTATC TGAGAATTCT - GAAGAAGATA ATAGAGTCAC AAGTGTATCC 4620 

TCAGACAGTC AAACTGGTAT GGACAGAAGT CCTGGTAAAT CACCATCAGC AAATGGGCTA 4680 

TCCCAAAAGC ACAATGATGG AAAAGAGGAA AATGACATTC AGACTGGTAG TGCTCTGCTT 4740 

CCTCTCAGCC CTGAATCTAA AGCATGGGCA GTTCTGACAA GTGATGAAGA AAGTGGATCA 4 800 

GGGCAAGGTA CCTCAGATAG CCTTAATGAG AATGAGACTT CCACAGATTT CAGTTTTGCA 4860 

75 GACACTAATG AAAAAGATGC TGATGGGATC CTGGCAGCAG GTGACTCAGA AATAACTCCT 4920 

GGATTCCCAC AGTCCCCAAC ATCATCTGTT ACTAGCGAGA ACTCAGAAGT GTTCCACGTT 4980 

TCAGAGGCAG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 5040 

GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 5100 

CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGC ACAC TTT 5160 

80 TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 5220 

ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 5280 
CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGACACTGA AAGAGTTTTA CCAGGAAGTG . 5340 

CAGAGCTGTA CTGTTGACTT AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG 5400 

CACAAGAArC GATACATAAA • TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 5460 

85 CTTGCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 5520 

AACAGACCAA AAGCTTATAT. TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 5580 

TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 5640 



PCTAJS02/12476. 



256 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG 
TTTCTGGTCA CTCAGAAGAG" TGTGCAAGTG CTTGCCTATT 
CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA 
ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC 
CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG 
CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG 
CAGATTCAAC. ACGAAGGAAC TGTCAACATA TTTGGCTTCT 
AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA 
GCCATACTTA GTAAAGAAAC' TGAGGTGCTG GACAGTCATA 
CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA 
CAGTCAAATA- TACAGCAGAG TGACTATTCT GCAGCCCTAA 
AATCGAACTT. CTTCTATCAT, CCCTGTGGAA AGATCAAGGG 
GGAGAAGGCA- CAGACTACAT CAATGCCTCC; TAT ATCATGG - 
TTCATCATTA CCCAGCACCC TCTCCTTCATr ACCATCAAGG- 
GACCATAATG CCCAACTGGT" GGTTATGATT- CCTGATGGCC 
TTTGTTTACT GGCCAAATAA AGATGAGCCT ATAAATTGTG 
ATGGCTGAAG AACACAAATG, TCTATCTAAT GAGGAAAAACi 
TTAGAAGCTA' CACAGGATGAV TTATGTACTT GAAGTGAGGC 
CCAAATCCAG ATAGCCCCAT- TAGTAAAACT- TTTGAACTTA 
GCTGCCAATA GGGATGGGCC TATGATTGTT^ CATGATGAGC 
ACTTTCTGTG CTCTGACAAC. CCTTATGCAC CAACTAGAAA 
TACCAGGTAG CCAAGATGAT . CAATCTGATG : AGGCCAGGAG 
TATCAGTTTC TCTACAAAGT GATCCTCAGC CTTGTGAGCA 
TCCACCTCTC TGGACAGTAA- TGGTGCAGCA TTGCCTGATG 
GAGTCTTTAG. TTTAACACAG AAAGGGGTGG GGGGACTCAC 
TTCCTAAAAT TAGGCAGGAA AATCAGTCTA GTTCTGTTAT 
GACAGTAACT TTCATGACAT. AGGATTCTGC CGCCAAATTT 
CTTTTTGCAA GACTTGTAAT TTACTTATTA TGTTTGAACT 
TATTTCTAAG :■ AATGGAATTG- TGGTATTTTT TTCTGTATTG. 
TTTATAGAGG TTAGGAATTC CAAACTACAG AAAATGTTTG 
GCTGTATTTG TAGCAATTAT CAGGTTTGCT AGAAATATAA 
AAATAAAACA; CTCTTCCATA TGATATTCAA CATTTTACAA- 
AGAAATAATC. TGTTACTTAT TGTAAATACT GCCCTAGTGT 
TTTATAATTG TAGATTTTTA TATTTTACTA' CTGAGTCAAG. 
GTTTAGTTTA ATGACGTAGT TCATTAGCTG- GTCTTACTCT 
TTGTGTTACC TAAGTCATTA. ACTTTGTTTC . AGCATGTAAT 
GAAATACCTT. CATTTTGAAA GAAGTTTTTA TGAGAATAAC 
" AATGGTTTTT ATCCAAGGAA TTGCAAAAAT AAATATAAAT 
AAAAAAAAAA AAAAAAAAAA AAAA 

seq ID NO: ISO Protein sequence: 
Protein Accession #: Eos sequence 



GGAGTGAGGA 
ATACTGTGAG 
GACCCAGTGG 
CAGAGTACTC 
CAGTGGGGCC 
TGCTAGACAG 
TAAAACACAT: 
TTCATGATAC 
TTCATGCCTA 
AACAATTCCA- 
AGCAATGCAA 
TTGGCATTTC 
GCTATTACCA 
ATTTCTGGAG 
AAAACATGGC 
AGAGCTTTAA 
TTATAATTCA' 
ACTTTCAGTGr 
TAAGTGTTAT. 
ATGGAGGAGT 
AAGAAAATTC 
TCTTTGCTGA 
CAAGGCAGGA 
GAAATATAGC 
ATCTGAGCAT 
CTGTTGATTT 
ATATCATTAA 
AAAATGATTG 
ATTTTAACAG 
TTTTTAGTGT- 
CTTTTAATAC 
CTGCAGTATT- 
CTCCATGGAC 
TTTTCTAGTT 
ACCAGTTTTC 
TTTAACTTTT 
ACCTTACCAA 
ATTGCCATTA 



GTACGGGAAC 
GAATTTTACT 
ACGTGTGGTC 
CCTGCCAGTG 
TGTTGTCGTC 
TATGTTGCAG 
CCGTTCACAA 
ACTGGTTGAG 
TGTTAATGCA 
GCTCCTGAGC 
CAGGGAAAAG 
ATCCCTGAGT 
GAGCAATGAA 
GATGATATGG 
AGAAGATGAA 
GGTCACTCTT 
GGACTTTATC ; 
TCCTAAATGG 
AAAAGAAGAA 
GACGGCAGGA 
CGTGGATGTT 
CATTGAGCAG 
AGAGAATCCA 
TGAGAGCTTA 
TGTTTTCCTC 
CCCATCACCT 
CAATGTGTGC 
AATTTTACAG 
AAAATTTCAA 
CAAATTTTTA 
AGTAGCCTGT 
CACCTAAAGT, 
CAAATTTATA 
CTGTGTAATT 
TGACATTGTA 
GTGGAAAATA 
ACATTGTTCA 
AAAAAAAAAA 
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6900 
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I 

MRILKRFLAC 
QSPINIDEDL 
FKASKITFHW: 
ILFEVGTEEN 
TDTVDWIVPK 
. TGKEEIHEAV. 
HKFLTDGYQD 
LIGTEEIIKB- 
RSPTRGSBFS 
GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAFTP 
LNTTPAASSS 
ILPQVTSATE 
KTLMPSQVEP 
SLPSGPSH2P 
PVSVAEPTYT 
LNASLQETSV 
KPVLSANSEP 
AVPSDPILVE 
TISYASEKYE 
EPtNTLINKL 
PHRDGSVTST 
KCMSCSSYRE 
SPGKSPSANG. 
ENETSTDFSF 
HESRIGLAEG 
VISTPPTPIP 
TADSSNHPDN 
QGPLKSTAED 
VLAYYTVRNF 
AKRHAVGPW 
QYVPIHDTLV 
SAALKQCNRE 
HTIKDFWRMI- 
NEEKLIIQDF 



11 

\ 

IQLLCVCRLD 
TQVNVNLKKL 
GKCNMSSDGS 
LDFKAIIDGV 
DTVSISESQL 
CSSEPENVQA 
LGAILNNIiLP 



SLVSTRQEEN 



GXGDVPNTSL 
MNLSGTAESL 
SENPETITYD 
ESFLQTNYTE 
SSRQQDLVST 
DSALHATPVF 
SDKVPLHASL. 
PSSDAMMHAR 
IPKSSLITPT 
TSVFGDDNKA 
SISSTKGMFP 
ASSDPASSEM 
TPKVDKISST 
PVLLKSESSH 
IHSDEILTST 
KLLFP SKATS 
SQEKVMNDSD 
LSQKHNDGKE 
ADTNEKDADG 
LESEKKAVIP- 
PISDDVGAIP 
KHKNRYIKIV 
FWRMIWEHNV. 
TLRNTKIKKG 
VHCSAGVGRT 
EAILSKETEV 
KNRTSSHPV 
WDHNAQLWM 
ILEATQDDYV 
GTFCALTTLM 
PSTSLDSNGA 



21 
i 

WANGYYRQQR 
KFQGWDKTSL 
EHSLEGQKFP 
ESVSRFGKQA 
AVFCEVLTMQ, 
DPENYTSLLV- 
NMSYVLQIVA 
AIVNPGRDSA 
NSTSQPVTKL 
NTVSITEYEE 
VLIPESARNA 
IRVDESEKTT 
VNWYSQTTQ 
PSVDVSFESI 
PVAGGDLLLE 
SSGPEPSYAL 
ASItLQPTHAL 
LSKSEIIYGN 
GSI*AHTTTKV 
LSPSTQLLFY 
MLHLIVSNSA 
QWPSLYSND 
KSSVTGKVFA 



31 



THENSLMDQN 
EKDIQTGSAL 
ILAAGDSEIT 
LVIVSALTFI 
IKHFPKHVAD 
AYDHSRVKLA 
EVXVMITNLV 
SQKGRPSGRV- 
GTYIVLDSML 
LDSHIHAYVN 
ERSRVGISSL 
IPDGQNMAED 
IjEVRHFQCPK 
HQIiEKENSVD 
ALPDGNIAES 



KLVEEIGWSY 
ENTFIHNTGK 
LEMQIYCFDA 
ALDPFILLNL 
QSGYVMLMDY 
TWERPRWYD 
ICTNGLYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 
SEDSTSSGSE 
KSFSAGPVMS 
PVYNGETPLQ 
LSSYDGAPLL 
PSLAQYSDVIi 
SDNEGSQHIF 
SGDGEWSGAS , 
ETELQIPSFN 
FDHEISQVPE 
ETSASFSTEV 
SSENMLHSTS 
ELFQTANLBI- 
GIPTVASDTF 
GLVGGGEDGD. 
NPISYSLSEN 
LPLSPESKAW 
PGFPQSPTSS 
CLWLVGIL2 



QLAEKDGKLT 
EKGRRKCDQY 
VTQYHYTQWP 
QQIQHEGTVN 
ALLIPGPAGK 
SGEGTDYINA 
EFVYWPNKDE 
WPNPDSPISK 
VYQVAKMINL 
LESLV 



41 
I 

TGALNQKNWG 
TVBINLTNDY 
DRFSSFEEAV 
LPNSTDKYYI 
LO.NNFREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTELiPPHT. 
TGAE0SSGSS 
ESLKDPSMEG 
QGPSVTDLEM 
PSYSSEVFPL 
PFSSASFSSE 
STTHAASETL 
TVSYSSAIPV^ 
SDSEPLliPDT 
EMVYPSESTV 
NNFSVQPTHT 
LLQPSFQASD 
VPVFDVSPTS 
NQAHPPKGRH 
VSTDHSVPIG 
TDDDGDDDDD 
SEEDNRVTSV 
AVLTSDEESG 
VTSENSEVFH 
YWRKCFQTAH 
FETLKEFYQE 
DYINANYVDG 
WPADGSEEYG: 
DMGVPEYSLP 
IFGFLKHIRS 
TKLEKQFQLL 
SYIMGYYQSN 
PINCESFKVT 
TFELISVIKE 
MRPGVFADIE 



51 
I 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQLDGEDQTK 
DNPELDLFPB 
GTKYNBAKTN 
VEGTSASLND 
PATSAIPFIS 
NVWFPSSTDI 
PHYSTFAYFP 
VTPIiLLDNQI 
LFRHLHTVSQ 
EFGSESGVLY 
HDSVGVTYQG 
DGLTALNISS 
MPNMYDNVNK 
VSQASGDTSI* 
VDTLLKTVLP 
HMHSASLQGI* 
VFATPVLSID 
NGHVAITAVS 
DRGSDGLSIH 
SSDSQTGMDR 
SGQGTSDSLN 
VSEAEASNSS 
FYLEDSTSPR 
VQSCTVDLGI 
YNRPKAYIAA 
NFLVTQKSVQ 
VLTFVRKAAY 
QRNYLVQTEE 
SQSNIQQSDY 
EFIITQHPLL 
LKAEEHKCLS 
EAANRDGPM1 
QYQFLYKVII* 



60 
120 
180 
240 
300 
360 
420 
4 80 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740. 
1800 
I860. 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
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Coding sequence: 148-4518' 

• .'■ i: .. V*2i* , : ' 31 41 si 

.5 (Lc^C^TACG .CACGCAOGATicTCACrT<XA TCTATACACT GGAGGATTAA AACAAACAAA " ' 60 

CAAAAAAAAC ATTTCCTTCG '. CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 * 
CGGCGAGGGG.' CCGCAGACCG. TCTGGAAATG CGAATCCTAA- AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT . GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA .240 \ 
"■ • CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 " , 

10 AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA- AGATCTTACA 360 ' ] , 

caagtaaatg: tgaatcttaa gaaacttaaa TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 
■ AACACATTCA TTCATAACAC TGGGAAAACA , GTGGAAATTA- ATCTCACTAA- TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA . TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 
15 GAGATGCAAA TCTACTGCTT- TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 . 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC. GTTTTGGGAA GCAGGCTGCT. 780 ' 

TTAGATCCAT TCATACTGTT " GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

' AATGGCTCAT. TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT. TTTTAAAGAT ' 900 
20 ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA- 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA' CAAAACAATT TTCGAGAGCA' ACAGTACAAG 1020. 

TTCTCTAGAC AGGTGTTTTC "CTCATACACT GGAAAGGAAG:- AGATTCATGA- AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT. ATACCAGCCT TCTTGTTACA - 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG ; 1200 

25 CAGTTGGATG GAGAGGACCA. AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT- AGTAGCCATA 1320 

TGCACTAATG- ' GCTTATATGG . AAAATACAGC GACCAACTGA- TTGTCGACAT GCCTACTGAT. 1380 

AATCCTGAAC: TTGATCTTTT CCCTGAATTA- ATTGGAACTG, AAGAAATAAT CAAGG AGGAG 1440 

GAAGAGGGAA -AAGACATTGA* AGAAGGCGCT ATTGTGAATC CTGGTAGAGA: CAGTGCTACA " 1500 

30 AACCAAATCA- GGAAAAAGGA ACCCCAGATT. TCTACCACAA CACACTACAA TCGCATAGGG- 1560 

ACGAAATACA. ATGAAGCCAA GACTAACCGA ' TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG- TTCCCAATAC" ATCTTTAAAT TCCACTTCCC' AACCAGTCAC TAAATTAGCC 1680. 

ACAGAAAAAG ATATTTCCTT.. GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC. TCTAAAACTG: TTCTTAGATC TCCACATATG ' 1800 

35 AACTTGTCGG GGACTGCAGA" ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG^ 1860 - 

AGTTTATTGA- CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG: CTCCAGTCCC 1920 . 

GCAACTTCTG, CTATCCCATT CATCTCTGAG AACATATCCC. AAGGGTATAT' ATTTTCCTCC 1980 

GAAAACCCAG- AGACAATAAC ATATGATGTC1 CTTATACCAG AATCTGCTAG- AAATGCTTCC 2040 , 

• GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

40 GTGTGGTTTC; CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG . 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG. AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG • GGTCCCTCAG. TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT, GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AAOGTGGT AT ACTCGCAGAC AACCCAACCG 2400 

45 GTATACAATG CAGAGGCCAG TAATAGTAGC CATGAGTCTC GTATTGGTCT. AG CTGAG GGG 2460 

TTGGAATCCG AGAAGAAGGC AGTTATACCC CTTGTGATCG TGTCAGCCCT GACTTTTATC 2520 

rGTCTAGTGG TTCTTGTGGG TATTCTCATC TACTGGAGGA AATGCTTCCA GACTGCACAC 2580 

TTTTACTTAG AGGACAGTAC ATCCCCTAGA: GTTATATCCA CACCTCCAAC ACCTATCTTT 2640 

CCAATTTCAG: ATGATGTCGG AGCAATTCCA ATAAAGCACT ■ TTCCAAAGCA TGTTGCAGAT. 2700 

• 50 TTACATGCAA . GTAGTGGGTT. TACTGAAGAA TTTGAGACAC" TGAAAGAGTT TTACCAGGAA 2760 

GTGCAGAGCT GTACTGTTGA CTTAGGTATT ACAGCAGACA GCTCCAACCA CCCAGACAAC 2820 

AAGCACAAGA ATCGATACAT AAATATCGTT; GCCTATGATC ATAGCAGGGT TAAGCTAGCA 2880 

CAGCTTGCTG AAAAGGATGG CAAACTGACT GATTATATCA- ATGCCAATTA TGTTGATGGC 2940 

TACAACAGAC CAAAAGCTTA TATTGCTGCC CAAGGCCCAC TGAAATCCAC AGCTGAAGAT 3000 

55 TTCTGGAGAA- TGATATGGGA ACATAATGTG GAAGTTATTG- TCATGATAAC AAACCTCGTG 3060 

GAGAAAGGAA' GGAGAAAATG TGATCAGTAC TGGCCTGCCG ATGGGAGTGA GGAGTACGGG 3120 

AACTTTCTGQ TCACTCAGAA GAGTGTGCAA GTGCTTGCCT ATTATACTGT GAGGAATTTT 3180 

ACTCTAAGAA ACACAAAAAT AAAAAAGGGC TCCCAGAAAG" GAAGACCCAG TGGACGTGTG. 3240 

GTCACACAGT ATCACTACAC GCAGTGGCCT GACATGGGAG TACCAGAGTA CTCCCTGCCA 3300 

60 GTGCTGACCT TTGTGAGAAA GGCAGCCTAT GCCAAGCGCC ATGCAGTGGG GCCTGTTGTC 3360 

GTCCACTGCA GTGCTGGAGT TGGAAGAACA GGCACATATA- TTGTGCTAGA" CAGTATGTTG 3420 

CAGCAGATTC AACACGAAGG- AACTGTCAAC ATATTTGGCT TCTTAAAACA CATCCGTTCA : 3480 

CAAAGAAATT ATTTGGTACA AACTGAGGAG CAATATGTCT TCATTCATGA TACACTGGTT 3540 

GAGGCCATAC. TTAGTAAAGA AACTGAGGTG CTGGACAGTC ATATTCATGC CTATGTTAAT 3600 

65 GCACTCCTCA TTCCTGGACC AGCAGGCAAA ACAAAGCTAG AGAAACAATT CCAGCTCCTG 3660 

AGCCAGTCAA ATATACAGCA GAGTGACTAT TCTGCAGCCC TAAAGCAATG CAACAGGGAA 3720 

AAGAATCGAA CTTCTTCTAT CATCCCTGTG GAAAGATCAA' GGGTTGGCAT TTCATCCCTG 3780 ■ v 

AGTGGAGAAG GCACAGACTA CATCAATGCC 7CCTATATCA TGGGCTATTA CCAGAGCAAT '3840 

GAATTCATCA TTACCCAGCA CCCTCTCCTT CATACCATCA AGGATTTCTG GAGGATGATA 3900 

70 TGGGACCATA ATGCCCAACT- GGTGGTTATG ATTCCTGATG GCCAAAACAT GGCAGAAGAT 3960 

GAATTTGTTT .ACTGGCCAAA" TAAAGATGAG CCTATAAATT GTGAGAGCTT TAAGGTCACT 4020 

CTTATGGCTG AAGAACACAA ATGTCTATCT AATGAGGAAA" AACTTATAAT TCAGGACTTT 4080 

ATCTTAGAAG CTACACAGGA TGATTATGTA CTTGAAGTGA GGCACTTTCA GTGTCCTAAA 4140 

TGGCCAAATC CAGATAGCCC CATTAGTAAA ACTTTTGAAC TTATAAGTGT TATAAAAGAA 4200 -* 

75 GAAGCTGCCA ATAGGGATGG GCCTATGATT GTTCATGATG AGCATGGAGG AGTGACGGCA 4260 

GGAACTTTCT GTGCTCTGAC AACCCTTATG CACCAACTAG AAAA AGAA AA TTCCGTGGAT 4320 

GTTTACCAGG TAGCCAAGAT GATCAATCTG ATGAGGCCAG GAGTCTTTGC TGACATTGAG 4380 

CAGTATCAGT TTCTCTACAA AGTGATCCTC AGCCTTGTGA GCACAAGGCA GGAAGAGAAT 4440 - • 

CCATCCACCT CTCTGGACAG TAATGGTGCA GCATTGCCTG ATGGAAATAT AGCTGAGAGC 4500 ] 

80 TTAGAGTCTT TAGTTTAACA .CAGAAAGGGG TGGGGGGACT CACATCTGAG CATTGTTTTC 4560 

CTCTTCCTAA AATTAGGCAG GAAAATCAGT CTAGTTCTGT TATCTGTTGA TTTCCCATCA 4620 '. 

CCTGACAGTA ACTTTCATGA CATAGGATTC TGCCGCCAAA TTTATATCAT TAACA ATGT G 4680 

TGCCTTTTTG CAAGACTTGT . AATTTACTTA TTATGTTTGA ACTAAAATGA TTGAATTTTA 4740 

CAGTATTTCT AAGAATGGAA TTGTGGTATT TTTTTCTGTA TTGATTTTAA CAGAAAATTT 4800 

85 CAATTTATAG AGGTTAGGAA TTCCAAACTA CAGAAAATGT- TTGTTTTTAG TGTCAAATTT 4860 

TTAGCTGTAT TTGTAGCAAT TATCAGGTTT GCTAGAAATA TAACTTTTAA TACAGTAGCC 4920 

TGTAAATAAA ACACTCTTCC ATATGATATT CAACATTTTA CAACTGCAGT ATTCACCTAA 49B0 
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AGTAGAAATA ATCTGTTACT TATTGTAAAT ACTGCCCTAG TGTCTCCATG GACCAAATTT 5040 

ATATTTATAA TTGTAGATTT TTATATTTTA CTACTGAGTC AAGTTTTCTA GTTCTGTGTA 5100 

ATTGTTTAGT TTAATGACGT AGTTCATTAG' CTGGTCTTAC TCTACCAGTT TTCTGACATT .5160 

GTATTGTGTT ACCTAAGTCA "TTAACTTTGT TTCAGCATGT AATTTTAACT TTTGTGGAAA' 5220 

5 ATAGAAATAC CTTCATTTTG AAAGAAGTTT TTATGAGAAT AACACCTTAC CAAACATTGT/ 5280 

TCAAATGGTT TTTATCCAAG . GAATTGCAAA AATAAATATA AATATTGCCA TTAAAAAAAA 5340 
AAAAAAAAAA AAAAAAAAAA AAAAAAA 

10 Seq ID NO: 182 Protein sequence: 
Protein Accession #: Bos sequence 

1 • 11 ' 21 31 41 51 

I I I I I I 

15 MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQXNWG KKYPTCNSPK 60 

QSPDUDEDI. TQVNVNLKKL KFQGWDKTSL ENTFIENTGK TVEINLTNDY RVSGGVSEMV 120 

PKASKITFHW- GKCNMSSDGS, EHSLEGQKFP ' LEMQ I YCFDA DRFSSFEEAV KGKGKLRAbS 180 

ILFEVGTEEN; LDFKAI IDGV* ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 
TDTVDWIVFK DTV9ISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY' KFSRQVFSSY . 300 

20 TGKEEIHEAVr CSSEPENVQA' DPENYTSIiLV- TWERPRVVYD TMIEKFAVLY QQLDGEDQTK 360 

HBFIjTDGYQD" LGAILNNLLP NMSYVLQIVA ICTNGLYGKY- SDQLIVDMPT DNPELDLFPE 420 

LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ. ISTTTHYNRI GTKYNEAKTN" 480 

RSPTRGSEFS- GKGDVPNTSIi NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASI4ND 540 

GSKTVLRSPH- MNLSGTAESL NTVSITEYEE' ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

25 ENISQGYIFS SEKPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVl^FPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE- IRVDESEKTT . KSFSAGPVMS - QGPSVTDLEM PHYSTFAYFP* 72 0 

TEVTPHAFTP SSRQQDLVST 1 VNWYSQTTQ FVYNAEASNS SHESRIGLAE GLESEKKAVI 780 

PLVXVSALTF ICLWlivGIL IYWRKCFQTA HFYIiEDSTSP RVISTPPTPI FPISDDVGAI 840 
PIKHPPKHVA- DLHASSGFTE EFETLKEFYQ" EVQSCTVDLG ITADSSNHPD: NKHKNRYINI 900 

30 VAYDHSRVKL, AQIiAEKDGKL' TDYINANYVD. GYNRPKAYIA: AQGPLKSTAE DFWRMIWEHN-^ 960' 

VEVIVMITNLr VEKGRRKCDQ. YWPADGSEEY* GNFLVTQKSV QVLAYYTVRN. • FTLKNTKI KK 1020 

GSQKGRPSGR WTQYHYTQW PDMGVPEYSt, PVLTFVRKAA? YAKRHAVGPV WHCSAGVGR 1080 

TGTYIVLDSM* IiQQIQHEGTV- NIFGFLICHIR SQRNYLVQTE: EQYVFIHDTli VEAILSKETE- 1140 

VLDSHIHAYV: NALLIPGPAG KTKLEKQFQL LSQSNIQQSD YSAALKQCNR EKNRTSSIIP 1200 

35 VERSRVGISS: LSGEGTDYIN ASYIMGYYQS: NEFIITQHPL. LHTIKDFWRM IWDHNAQLW' 1260 

MIPDGQNMAEi DEFVYWPNKD EPINCESFKV TLMAEEHKCL SNEEKLIIQD. FILEATQDDY 1320 

VLEVRHFQCP KWPNPDSPIS KTFELISVIK EEAANRDGPtt"' IVHDEHGGVT AGTFCALTTL 1380 

MHQLEKENSV DVYOVAKMIN LMRPGVFADI EQYQFLYKVI LSLVSTRQEE NPSTSLDSNG 1440 
AALPDGNIAE SLESLV 

40 

Seg ID NO: 183 dna sequence 

Nucleic Acid Accession #: EOS sequence 

Coding sequence: 148-4494 

45 1 11 21 31 41 51 

I I I I . . I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC. CTCTCCACTC . TGAGAAGCAG AGGAGCOGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG. CGAATCCTAA AGCGTTTCCT: CGCTTGCATT 180 

50 CAGCTCCTCT- GTGTTTGCCG- CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA ' 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA' CATGTAATAG- CCCAAAACAA TCTCCTATCA ATATTGATGA. AGATCTTACA- 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA , 420 

AACACATTCft TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

55 GTCAGCGGAG GAGTTTCAGA- AATGGTGTTT AAAGCAAGCA'. AGATAACTTTi TCACTGGGGA. 540 

AAATGCAATA- TGTCATCTGA' TGGATCAGAG CATAGTTT AG AAGGACAAAA - ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCAGAC CGATTTTCAA GTTTTGAGGA' AGCAGTCAAA" 660 

GGAAAAGGGA 1 AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA- AGAAAATTTGv 720 

GATTTCAAAG: CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

60 TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA 5 TTACATTTAC 840 

- AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT; TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT- GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

' TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA - AGCAGTTTGT * 1080 

65 AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC" CTCGAGTCGT- TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA; 1320 

TGCACTAATG: GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

70 AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA' GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620. 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

75 ACAGAAAAAG ATATTTCCTT \ GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG. 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAGA ATCCTTAAAT- ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA" CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT- ATTTTCCTCC 19 B0 

80 GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC . 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT. 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG- 2160 

AGCTTTCTCC AGACTAATTA' CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

85 CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 2460 
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GAATCCGAGA 
CTAGTGGTTC 
TACTTAGAGG 
ATTTCAGATG 
CATGCAAGTA* 
GGTATTACAG 
ATCGTTGCCT 
CTGACTGATT 
GCTGCCCAAG 
AATGTGGAAG 
CAGTACTGGC. 
GTGCAAGTGC 
AAGGGCTCCC 
TGGCCTGACA 
GCCTATGCCA 
AGAACAGGCA 
GTCAACATAT,. 
GAGGAGCAAT 
GAGGTGCTGG. 

ggcaaaacaa: 

GACTATTCTG ■ 

cctgtggaaa 

AATGCCTCCT 
CTCCTTCATA 
GTTATGATTC 
GATGAGCCTA 
CTATCTAATG 
TATGTACTTG 
AGTAAAACTT 
ATGATTGTTC 
CTTATGCACC 
AATCTGATGA' 
ATCCTCAGCC 
GGTGCAGCAT 
AAGGGGTGGG 
ATCAGTCTAG 
GGATTCTGCC 1 
TACTTATTAT 
GGTATTTTTT; 
AAACTACAGA 
AGGTTTGCTA' 
GATATTCAAC 
GTAAATACTG ', 
ATTTTACTAC 
CATTAGCTGG: 
CTTTGTTTCA 
AAGTTTTTAT- 
TGCAAAAATA" 
AAA. 



AGAAGGCAGT 
TTGTGGGTAT 
ACAGTACATC 
ATGTCGGAGC 
GTGGGTTTAC 
CAGACAGCTC 
ATGATCATAG 
ATATCAATGC 
GCCCACTGAA- 
TTATTGTCAT 
CTGCCGATGG 
TTGCCTATTA 
AGAAAGGAAG 
TGGGAGTACC 
AGCGCCATGC 
CATATATTGT 
TTGGCTTCTT. 
ATGTCTTCAT. 

acagtcatat 
agctagagaa 
cagccctaaa 
gatcaagggt 
atatcatggg 
ccatcaagga 
ctgatggcca 
taaattgtga 
aggaaaaact: 
aagtgaggca' 

TTGAACTTAT 
ATGATGAGCA 
AACTAGAAAA' 
GGCCAGGAGT 
TTGTGAGCAC 
TGCCTGATGG 
*GGGACTCACA 
TTCTGTTATC 
GCCAAATTTA 
GTTTGAACTA 
TCTGTATTGA 
AAATGTTTGT 
GAAATATAAC, 
ATTTTACAAC. 
CCCTAGTGTC 
TGAGTCAAGT 
TCTTACTCTA 
GCATGTAATT 
GAGAATAACA 
AATATAAATA 



TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 2520 
TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 2580 
CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 2640 
AATTCCAATA - AAGCACTTTC CAAAGCATGT TGCAGATTTA 2700 
TGAAGAATTT GAGGAAGTGC AGAGCTGTAC TGTTGACTTA 2760 
CAACCACCCA GACAACAAGC ACAAGAATCG ATACATAAAT 2820 
CAGGGTTAAG. CTAGCACAGC TTGCTGAAAA GGATGGCAAA 2880 
CAATTATGTT GATGGCTACA ACAGACCAAA AGCTTATATT 2940 
ATCCACAGCT GAAGATTTCT GGAGAATGAT ATGGGAACAT 3000 
GATAACAAAC CTCGTGGAGA AAGGAAGGAG AAAATGTGAT ■. 3060 
GAGTGAGGAG TACGGGAACT TTCTGGTCAC TCAGAAGAGT * 3120 
TACTGTGAGG. AATTTTACTC TAAGAAACAC AAAAATAAAA 3180 
ACCCAGTGGA- CGTGTGGTCA CACAGTATCA CTACACGCAG" 3240 
AGAGTACTCC CTGCCAGTGC TGACCTTTGT GAGAAAGGCA 3300 
AGTGGGGCCT; GTTGTCGTCC ACTGCAGTGC TGGAGTTGGA - 3360 
GCTAGACAGT ATGTTGCAGC AGATTCAACA CGAAGGAACT 3420 
AAAACACATC . CGTTCACAAA GAAATTATTT GGTACAAACT - 3480 
TCATGATACA- CTGGTTGAGG'.CCATACTTAG TAAAGAAACT v 3540 
TCATGCCTAT GTTAATGCAC TCCTCATTCC TGGACCAGCA- 3600 
ACAATTCCAG CTCCTGAGCC AGTCAAATAT ACAGCAGAGT ■ 3660 
GCAATGCAAC AGGGAAAAGA ATCGAACTTC TTCTATCATC 3720 
TGGCATTTCft- TCCCTGAGTG GAGAAGGCAC AGACT ACATC * 3780 
CTATTACCAG- AGCAATGAAT TCATCATTAC CCAGCACCCT 3840 
.TTTCTGGAGG ATGATATGGG- ACCATAATGC. CCAACTGGTG^ 3 900. 
AAACATGGCA- GAAGATGAAT TTGTTTACTG: GCCAAATAAA; 3 960 
GAGCTTTAAG GTCACTCTTA TGGCTGAAGA ACACAAATGT 4020 
TATAATTCAG GACTTTATCT TAGAAGCTAC. ACAGGATGAT- 4080 
CTTTCAGTGT CCTAAATGGC CAAATCCAGA TAGCCCCATT. 4140 
AAGTGTTATA- AAAGAAGAAG CTGCCAATAG GGATGGGCCT: 4200 
TGGAGGAGTG ACGGCAGGAA CTTTCTGTGC TCTGACAACC 4260 
AGAAAATTCC GTGGATGTTT ACCAGGTAGC CAAGATGATC 4320 
CTTTGCTGAC ATTGAGCAGT^ ATCAGTTTCT CTACAAAGTG • 4380 
AAGGCAGGAA GAGAATCCAT CCACCTCTCT GGACAGTAAT 4440 
AAATATAGCT GAGAGCTTAG, AGTCTTTAGT TTAACACAGA 4500" 
TCTGAGCATT GTTTTCCTCT TCCTAAAATT AGGCAGGAAA • 4560 
TGTTGATTTC CCATCACCTG; ACAGTAACTT TCATGACATA • 4620 
TATCATTAAC AATGTGTGCC TTTTTGCAAG - ACTTGTAATT 4 680 
AAATGATTGA ATTTTACAGT ATTTCTAAGA ATGGAATTGT- 4740 
TTTTAACAGA' AAATTTCAAT TTATAGAGGT TAGGAATTCC 4800 
TTTTAGTGTC AAATTTTTAG CTGTATTTGT AGCAATTATC- 4860 
TTTTAATACA GTAGCCTGTA AATAAAACAC. TCTTCCATAT • 4920 
TGCAGTATTC ACCTAAAGTA GAAATAATCT GTTACTTATT 4980 
TCCATGGACC: AAATTTATAT TTATAATTGT, AGATTTTTAT- 5040 
TTTCTAGTTC TGTGTAATTG TTTAGTTTAA TGACGTAGTT 5100 
CCAGTTTTCTJ GACATTGTAT TGTGTTACCT AAGTCATTAA .5160 
TTAACTTTTG TGGAAAATAG AAATACCTTC ATTTTGAAAG 5220 
CCTTACCAAA CATTGTTCAA ATGGTTTTTA TCCAAGGAAT 5280 
TTGCCATTAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA. 5340 



Seq ID NO: 184 Protein sequence: 
Protein Accession #: EOS sequence 



1 

I 

MRILKRFLAC 
QSPINIDEDL 
FKASKITFHW 
ILPEVGTEEN 
TDTVDWIVFK 
TGKEEIHEAV- 
HEFLTDGYQD 
LIGTEEIIKE 
RSPTRGSEFS 
GSKTVLRSPH- 
ENISQGYIPS 
TAOPDVGSGR 
TEVTPHAFTP 
LVIVSALTFI 
IKHFPKHVAD 
KLAQLAEKDG 
NLVEKGRRKC 
GRWTQYHYT. 
SMLQQIQHEG 
YVNALLIPGP 
SSLSGEGTDY- 
AEDEFVYWPN: 
CPKWPNPDSP 
SVDVYQVAKM 
AESLESLV 



11 

I 

IQLLCVCRLD 
TQVNVNLKKL. 
GKCNMSSDGS 
LDPKAIIDGV 
DTVSISESQIi 
CSSEPENVQA 
LGAILNNLLP 
EEEGKDIEEG 
GKGDVPNTSL 
MNLSGTAESL 
SENPETITYD 
ESFLQTNYTE 
SSRQQDLVST 
CLVVLVGILI 
LHASSGFTEE 
KLTDYINANY 
DQYWPADGSE 
QWPDMGVPEY' 
TVNIFGFIxKH 
AGKTKLEKQF 
INASYIMGYY 
KDEPINCESP 
ISKTFELISV 
INLMRPGVFA 



21 
I 

WANGYYRQQR 
KFQGWDKTSL 
EHSLEGQKFP 
■ ESVSRFGKQA 
AVFCEVLTMQ 
DPENYTSLLV 
NMSYVLQIVA 
AIVNPGRDSA' 

nstsqpvtkl' 
ntvsiteyee 
vlipesarma 
irvdesektt- 
vnwysqttq 
ywrkcfqtak 
feevqsctvd 
vdgynrpkay: 

EYGNFLVTQK- 
SLPVLTFVRK 
1RSQRNYLVQ 
QLLSQSNIQQ 
QSNEPIITQH 
KVTLMAEEHK 
IKEEAANRDG 
DIEQYQFLYK 



31 
I 

KLVBEIGWSY 
ENTFIHNTGK 
LEMQIYCFDA 
ALDPFILLNL 
QSGYVMLMDY 
TWERPRWYD 
ICTNGLYGKY 
TNQIRKKEPQ 
ATEKD I SLTS 
ESLLTSPKLD 



KSPSAGPVMS 
PVYNEASNSS 
FYLEDSTSPR 
LGITADSSNH 
IAAQGPLKST 
SVQVLAYYTV 
AAYAKRHAVG 
TEEQYVFIHD 
SDYSAALKQC 
PLLHTIKDFW 
CLSNEEKLI I 
PMIVHDEHGG 
VILSLVSTRQ 



51 
I 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
FO?SRQVFSSY- 
QQLDGEDQTK- 
DNPELDLFPE- 
GTKYNEAKTN 
VEGTSASLND 
PATSAIPFIS 
NVWPPSSTDI- 
PHYSTFAYFP 
LESBKKAVIP- 
PISDDVGAIP 
KIVAYOHSRV 
HNVEVIVMIT 



41 

I ' 

TGALNQKNWG 
TVEINLTNDY 
DRFSSFEEAV 
LPNSTDKYYI; 
* LQNNFREQQY 
TMIEKPAVLY 
SDQLIVDMPT 
ISTTTHYNRr 
QTVTELPPHT 
TGAEDSSGSS 
ESLKDPSMEG 
QGPSVTDLEM 
HESRIGLAEG 
VISTPPTPIF 
PDNKHKNRYI 
AEDFWRMIWS 
RNFTLRNTKI'- 
PVWHCSAGV 
TLVEAILSKE 
NREKNRTSSI 
RMIWDHNAQL 
QDFILEATQD 
VTAGTFCALT 
EENPSTSLDS NGAALPDGKI 



GRTGTYIVLD 
TEVLDSHIHA 
IPVERSRVGI 
WMIPDGQNM- 
DYVLEVHHFQ 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 
.720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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. 21 
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41 



51 
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CACACATACG CACGCACGAT. CTCACTTCGA TCEATACACT GGAGGATTAA AACAAACAAA 
CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCOGCA 
CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 
5 CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 
CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAT TGGGGAAAGA 
AATATCCAAC ATGTAATAGC CCAAAACAAT CTCCTATCAA TATTGATGAA GATCTTACAC 
AAGTAAATGT GAATCTTAAG AAACTTAAAT TTCAGGGTTG GGATAAAACA TCATTGGAAA 
ACACATTCAT TCATAACACT GGGAAAACAG TGGAAATTAA TCTCA CTAAT GACTACCGTG 

10 TCAGCGGAGG AGTTTCAGAA ATGGTGTTTA AAGCAAGCAA GATAACTTTT CACTGGGGAA 
AATGCAATAT GTCATCTGAT GGATCAGAGC ATAGTTTAGA AGGA CAAAAA TTTCCACTTG 
AGATGCAAAT CTACTGCTTT GATGCGGACC GATTTTCAAG- TTTTGAGGAA GCAGTCAAAG 
GAAAAGGGAA GTTAAGAGCT ■ TTATCCATTT TGTTTGAGGT TGGGACAGAA GAAAATTTGG 
' ATTTCAAAGC * GATTATTGAT" GGAGTCGAAA GTGTTAGTCG TTTTGGGAAG CAGGCTGCTT 

15 TAGATCCATT" CATACTGTTG' AACCTTCTGC CAAACTCAAC TGACAAGTAT TACATTTACA 
ATGGCTCATT GACATCTCCT CCCTGCACAG ACACAGTTGA CTGGATTGTT TTTAAAGAT A . 
CAGTTAGCAT' CTCTGAAAGC- CAGTTGGCTG TTTTTTGTGA AGTTCTTACA ATGCAACAAT 
CTGGTTATGT CATGCTGATG GACTACTTAC AAAACAATTT TCGAGAGCAA CAG TACA AGT 
TCTCTAGACA GGTGTTTTCC TCATACACTG GAAAGGAAGA GATTCATGAA- GCAGTTTGTA 

20 GTTCAGAACC AGAAAATGTT CAGGCTGACC CAGAGAATTA TACCAGCCTT CTTGTTACAT 
GGGAAAGACC TCGAGTCGTT TATGATACCA TGATTGAGAA GTTTGCAGTT TTGTACCAGC. 
AGTTGGATGG AGAGGACCAA ACCAAGCATG AATTTTTGAC AGATGGCTAT CAAGACTTGG . 
GTGCTATTCT CAATAATTTG CTACCCAATA. TGAGTTATGT TCTTCAGATA GTAGCCATAT' 
GCACTAATGG CTTATATGGA* AAATACAGCG ACCAACTGAT TGTCGACATG" CCTACTGATA 

25 ATCCTGAACT TGATCTTTTC CCTGAATTAA TTGGAACTGA AGAAATAATC AAGGAGGAGG- 
AAGAGGGAAA AGACATTGAA GAAGGCGCTA TTGTGAATCC TGGTAGAGAC AGTGCTACAA, 
ACCAAATCAG GAAAAAGGAA CCCCAGATTT CTACCACAAC ACACTACAAT. CGCATAGGGA - 
CGAAATACAA TGAAGCCAAG ACTAACCGAT CCCCAACAAG * AGGAAGTGAA' TTCTCTGGAA 
AGGGTGATGT TCCCAATACA- TCTTTAAATT CCACTTCCCA ACCAGTCACT AAATTAGCCA 

30 CAGAAAAAGA TATTTCCTTG ' ACTTCTCAGA CTGTGACTGA ACTGCCACCT, CACACTGTGG " 
AAGGTACTTC AGCCTCTTTA AATGATGGCT CTAAAACTGT TCTTAGATCT CCACATATG A 
ACTTGTCGGG GACTGCAGAA TCCTTAAATA CAGTTTCTAT^ AACAGAATAT GAGGAGGAGA 
GTTTATTGAC CAGTTTCAAG CTTGATACTG GAGCTGAAGA' TTCTTCAGGC TCCAGTCCCG 
CAACTTCTGC TATCCCATTC ATCTCTGAGA ACATATCCCA AGGGTATATA TTTTCCTCCG 

35 AAAACCCAGA GACAATAACA TATGATGTCC TTATACCAGA ATCTGCTAGA AATGCTTCCG - 
AAGATTCAAC TTCATCAGGT TCAGAAGAAT CACTAAAGGA TCCTTCTATG .GAGGGAAATG 
TGTGGTTTCC TAGCTCTACA GACATAACAG CACAGCCCGA TGTTGGATCA GGCAGAGAGA- 
GCTTTCTCCA GACTAATTAC ACTGAGATAC GTGTTGATGA ATCTGAGAAG ACAACCAAGT 
CCTTTTCTGC- AGGCCCAGTG ATGTCACAGG . GTCCCTCAGT.- TACAGATCTG, GAAATGCCAC 

40 ATTATTCTAC CTTTGCCTAC TTCCCAACTG AGGTAACACC TCATGCTTTT ACCCCATCCT 
CCAGACAACA GGATTTGGTC TCCACGGTCA ACGTGGTATA CTCGCAGACA ACCCAACCGG 
TATACAATGA GGCCAGTAAT AGTAGCCATG- AGTCTOGTAT TGGTCTAGCT GAGGGGTTGG 
AATCCGAGAA GAAGGCAGTT ATACCCCTTG- TGATCGTGTC. AGCCCTGACT TTTATCTGTC 
' TAGTGGTTCT TGTGGGTATT CTCATCTACT. GGAGGAAATG CTTCCAGACT GCACACTTTT 

45 ACTTAGAGGA CAGTACATCC CCTAGAGTTA TATCCACACC TCCAACACCT ATCTTTCCAA 
TTTCAGATGA TGTCGGAGCA ATTCCAATAA AGCACTTTCC AAAGCATGTT GCAGATTTAC 
ATGCAAGTAG TGGGTTTACT GAAGAATTTG AGACACTGAA- AGAGTTTTAC CAGGAAGTGC ■ 
AGAGCTGTAC TGTTGACTTA GGTATTACAG. CAGACAGCTC CAACCACCCA GACAACAAGC 
ACAAGAATCG ATACATAAAT ATCGTTGCCT ATGATCATAG CAGGGTTAAG CTAGCACAGC 

50 TTGCTGAAAA GGATGGCAAA CTGACTGATT ATATCAATGC CAATTATGTT GATGGCTACA 
ACAGACCAAA AGCTTATATT GCTGCCCAAG GCCCACTGAA ATCCACAGCT GAAGATTTCT 
GGAGAATGAT ATGGGAACAT AATGTGGAAG TTATTGTCAT GATAACAAAC CTCGTGGAGA 
AAGGAAGGAG AAAATGTGAT CAGTACTGGC CTGCCGATGG GAGTGAGGAG TACGGGAACT " 
TTCTGGTCAC TCAGAAGAGT GTGCAAGTGC TTGCCTATTA TACTGTGAGG AATTTTACTC 

55 TAAGAAACAC AAAAATAAAA AAGGGCTCCC AGAAAGGAAG ACCCAGTGGA CGTGTGGTCA 
CACAGTATCA CTACACGCAG TGGCCTGACA TGGGAGTACC AGAGTACTCC CTGCCAGTGC 
TGACCTTTGT GAGAAAGGCA GCCTATGCCA AGCGCCATGC AGTGGGGCCT GTTGTCGTCC 
ACTGCAGTGC TGGAGTTGGA- AGAACAGGCA CATATATTGT GCTAGACAGT ATGTTGCAGC 
AGATTCAACA CGAAGGAACT GTCAACATAT TTGGCTTCTT AAAACACATC CGTTCACAAA 

60 GAAATTATTT GGTACAAACT- GAGGAGCAAT ATGTCTTCAT: TCATGATACA CTGGTTGAGG 
CCATACTTAG TAAAGAAACT GAGGTGCTGG ACAGTCATAT. TCATGCCTAT GTTAATGCAC 
TCCTCATT CC TGGACCAGCA GGCAAAACAA AGCTAGAGAA ACAATTCCAG CTCCTGAGCC 
AGTCAAATAT ACAGCAGAGT GACTATTCTG CAGCCCTAAA GCAATGCAAC AGGGAAAAGA 
ATCGAACTTC TTCTATCATC CCTGTGGAAA GATCAAGGGT TGGCATTTCA TCCCTGAGTG ' 

65 GAGAAGGCAC AGACTACATC AATGCCTCCT- ATATCATGGG CTATTACCAG AGCAATGAAT 
TCATCATTAC CCAGCACCCT CTCCTTCATA CCATCAAGGA TTTCTGGAGG ATGATATGGG 
ACCATAATGC CCAACTGGTG. GTTATGATTC CTGATGGCCA AAACATGGCA GAAGATGAAT 
TTGTTTACTG GCCAAATAAA GATGAGCCTA TAAATTGTGA GAGCTTTAAG GTCACTCTTA 
TGGCTGAAGA ACACAAATGT CTATCTAATG . AGGAAAAACT TATAATTCAG GACTTTATCT 

70 TAGAAGCTAC ACAGGATGAT TATGTACTTG AAGTGAGGCA CTTTCAGTGT- CCTAAATGGC 
CAAATCCAGA TAGCCCCATT AGTAAAACTT TTGAACTTAT AAGTGTTATA AAAGAAGAAG 
CTGCCAATAG GGATGGGCCT ATGATTGTTC ATGATGAGCA- TGGAGGAGTG ACGGCAGGAA 
CTTTCTGTGC TCTGACAACC CTTATGCACC AACTAGAAAA ' AGAAAATTCC GTGGATGTTT 
ACCAGGTAGC CAAGATGATC AATCTGATGA • GGCCAGGAGT CTTTGCTGAC. ATTGAGCAGT 

75 ATCAGTTTCT CTACAAAGTG ATCCTCAGCC TTGTGAGCAC AAGGCAGGAA GAGAATCCAT 
CCACCTCTCT GGACAGTAAT- GGTGCAGCAT TGCCTGATGG AAATATAGCT GAGAGCTTAG 
AGTCTTTAGT TTAACACAGA AAGGGGTGGG GGGACTCACA TCTGAGCATT GTTTTCCTCT 
TCCTAAAATT AGGCAGGAAA ATCAGTCTAG TTCTGTTATC TGTTGATTTC CCATCACCTG , 
ACAGTAACTT TCATGACATA GGATTCTGCC GCCAAATTTA TATCATTAAC AATGTGTGCC ' 

80 TTTTTGCAAG ACTTGTAATT TACTTATTAT GTTTGAACTA AAATGATTGA ATTTTACAGT 
ATTTCTAAGA ATGGAATTGT GGTATTTTTT TCTGTATTGA TTTTAACAGA AAATTTCAAT 
TTATAGAGGT TAGGAATTCC AAACTACAGA AAATGTTTGT TTTTAGTGTC AAATTTTTAG 
CTGTATTTGT AGCAATTATC AGGTTTGCTA GAAATATAAC TTTTAATACA GTAGCCTGTA 
AATAAAACAC TCTTCCATAT GATATTCAAC ATTTTACAAC TGCAGTATTC ACCTAAAGTA 
85 GAAATAATCT GTTACTTATT GTAAATACTG CCCTAGTGTC TCCATGGACC AAATTTATAT 
TTATAATTGT AGATTTTTAT ATTTTACTAC TGAGTCAAGT TTTCTAGTTC TGTGTAATTG 
TTTAGTTTAA TGAGGTAGTT. CATTAGCTGG TCTTACTCTA CCAGTTTTCT GACATTGTAT 
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TGTGTTACCT AAGTCATTAA CTTTGTTTCA GCATGTAATT TTAACTTTTG TGGAAAATAG 5220 

AAATACCTTC ATTTTGAAAG AAGTTTTTAT GAGAATAACA CCTTACCAAA CATTGTTCAA 5280 

•ATGGTTTTTA* TCCAAGGAAT ' TGCAAAAATA AATATAAATA TTGCCATTAA AAAAAAAAAA 5340 
AAAAAAAAAA AAAAAAAAAA AAA - ' ' 

Seq ID NO: 186 Protein sequence-. . 
Protein Accession EOS sequence 

1 11 . 21 31 . 41 51. 

I I. I I ' I I 

MVFKASKITP HWGKCNMSSD GSEHSLEGQK FPLEMQIYCP. DADRFSSFEE AVKGKGKIiRA 60 

LSILFEVGTE ENLDFKAIID GVESVSRFGK QAALDPFILL NLLFNSTDKY YIYNGSLTSP 120 

PCTDTVDWIV FKDTVSISES .QIiAVFCEVLT. MQQSGYVMLM" DYLQNNFREQ. QYKFSRQVFS 180 

SYTGKEEIHE AVCSSEPENV QADPENYTSL LVTWERPRW. YDTMIBKFAV- LYQQLDGEDQ 240 

TKHEFLTDGY QDLGAILNNIi LPNMSYVLQI. VA I CTNGL YG~ KYSDQLIVDM PTDNPELDIiF 300 

PELIGTEEII KEEEEGKDIE 1 EGAIVNPGRD SATNQIRKKE PQISTTTHYN- RIGTKYNEAK 360 

TNRSPTRGSE- FSGKGDVPNT SLNSTSQPVT».KIiATEKDISL. TSQTVTEIiPP HTVEGTSASL 420 

NDGSKTVLRS* PHMNLSGTAE SLNTVSITEY EEESLLTSFK- LDTGAEDSSG SSPATSAIPF' 480 

ISENISQGYI- FSSENPETIT. YDVIiIPESAR- NASEDSTSSG. SEESLKDPSM EGNVWFPSST. 540 

DITAQPDVGS- GRESFLQTNY TEIRVDESEK- TTKSFSAGPV : MSQGPSVTDL.- EMPHYSTFAY- 600 

FPTEVTPHAF- TPSSRQQDLV STVNWYSQT TQPVYNEASN SSHESRIGLA EGLESEKKAV 660 

IPLVIVSALT FICLWLVGII LIYWRKCFQT-" AHFYLEDSTS. PRVISTPPTP IFPISDDVGA 720 

IPIKHFPKHV, ADLHASSGFT EEFETLKEFY QEVQSCTVDIj GITADSSNHP DNKHKNRYIN 780 

IVAYDHSRVK: LAQLAEKDGK- LTDYINANYV. DGYNRPKAYI AAQGPI.KSTA EDFWRMIWEH 840 

NVEVIVMITN' LVEKGRRKCD QYWPADGSEE; YGWFLVTQKS. VQVLAYYTVR NFTLRNTKIK 900 

KGSQKGRPSG. RWTQYHYTQ.: WPDMGVPEYS LPVLTFVRKA • AYAKRHAVGP- VWHCSAGVG; 960 

RTGTYIVLDSf MLQQIQHEGT VNIFGFLKHI; .RSQRNYLYQT- EEQYVFIHDT-' LVEAILSKET 1020 

EVLDSHIHAY- VNALLIPGPA GKTKLEKQFQ' LLSQSNIQQS DYSAALKQCN - REKNRTSS 1 1 1080 

PVERSRVGIS" SliSGEGTDYI-NASYIMGYYQ. SNEFIITQHP LLHTIKDFWR' MIWDHNAQLV 1140 

VMIPDGQNMA- EDEFVYWPNK DEPINCESFK VTLMAEEHKC LSNEEKLIIQ DFILEATQDD - 1200 

YVLEVRHFQC PKWPNPDSPI-" SKTFELISVI KEEAANRDGP MIVHDEHGGV TAGTFCALTT 1260- 

LMHQLEKENS. VDVYQVAKMI NLMRPGVFAD IEQYQFLYKV ILSLVSTRQE ENPSTSLDSN 1320 
GAALPDGNIA ESLESLV " 

Seq ID NO: 187 DMA sequence 

Nucleic" Acid Accession #: EOS sequence 

Coding sequence: 148-4632 

1 11 21 31 41 51 

II" I I I I 

CACACATACG CACGCACGAT, CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG. CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG . TCTGGAAATG. CGAATCCTAA AACGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG ■ GCTAATGGAT ACTACAGACA- ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG. GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA- AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA- TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTT AAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA. CTGACAAGTA- TTACATTTAC 840 

AATGGCTCAT. TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG : CCAGTTGGCT GTTTTTTGTG , AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA A GCAGTT TGT 10 B0 

AGTTCAGAAC CAGAAAATGT. TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAG7TGGATG. GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT. GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT. CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

AOGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

.GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC. AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 204 0 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100. 

GTGTGGTTTC ' CTAGCTGTAC . AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT. GATGTCACAG- GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 2460 

GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 2520 

CTAGTGGTTC TTGTGGGTAr TCTCATCTAC .TGGAGGAAAT GCTTCCAGAC TGCACACTTT 2580 

TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 2640 

ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 2700 

CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGACACTGA AAGAGTTTTA CCAGGAAGTG 2760 

CAGAGCTGTA CTGTTGACTT AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG 2820 
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* CACAAGAATC , GAT ACATAAA TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 2880 

CTTGCTGAAA" AGGATGGCAA ACTGACTCAT TATATCAATG CCAATTATGT TGATGGCTAC 2940 " - . - 

AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 3000 

TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 3060 . ._ 

5 AAAGGAAGGA GAAAATGTGA' TCAGTACTGG CCTGCCGATG GGAGTGAGGA GT ACGGG AAC 3120 

TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 3180 

CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 3240 ■ 

ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 3300 

CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 3360 

10 . CAC7GCAGTJG; CTGGAGTTGG AAGAACAGGC, ACATATATTG TGCTAGACAG TATGTTGCAG 3420 

CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 3480 

AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 3540 

GCCATACTTA GTAAAGAAAC . TGAGGTGCTG? GACAGTCATA TTCATGCCTA TGTTAATGCA 3600 

CTCCTCATTC CTGGACCAGC AGGCAAAACA.AAGCTAGAGA AACAATTCCA GGGTCTCACT 3660 

15 ' CTGTCACCCA* GGCTGGAGTG CAGAGGCACA- ATCTCGGCTC ACTGCAACCT TCCTCTCCCT 3720 

GGCTTAACTG ATCCTCCTAC CTCAGCCTCC CGAGTGGCTG GGACTATACT CCTGAGCCAG 3780 

TCAAATATAC AGCAGAGTGA CTATTCTGCA GCCCTAAAGC AATGCAACAG GGAAAAGAAT 3840 

CGAACTTCTT CTATCATCCC TGTGGAAAGA TCAAGGGTTG GCATTTCATC CCTGAGTGGA ' 3900 

GAAGGCACAG . ACTACATCAA TGCCTCCTAT. ATCATGGGCT; ATTACCAGAG ' CAATGAATTC 3960 

20 ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT TCTGGAGGAT GATATGGGAC 4020 

CATAATGCCC AACTGGTGGT TATGATTCCT GATGGCCAAA ACATGGCAGA> AGATGAATTT 4080 

. GTTTACTGGC. CAAATAAAGA TGAGCCTATA AATTGTGAGA GCTTTAAGGT CACTCTTATG 4140 

GCTGAAGAAC ACAAATGTCT. ATCTAATGAG" GAAAAACTTA- TAATTCAGGA CTTTATCTTA 4200 

. GAAGCTACAC AGGATGATTA TGTACTTGAA GTGAGGCACT . TTCAGTGTCC TAAATGGCCA 4260 

25 AATCCAGATA GCCCCATTAG TAAAACTTTT GAACTTATAA GTGTTATAAA AGAAGAAGCT 4320 

GCCAATAGGG ATGGGCCTAT GATTGTTCAT GATGAGCATG GAGGAGTGAC GGCAGGAACT /• 4380 

TTCTGTGCTC TGACAACCCT TATGCACCAA CTAGAAAAAG AAAATTCCGT GGATGTTTAC 4440 

CAGGTAGCCA- AGATGATCAA TCTGATGAGG CCAGGAGTCT TTGCTGACAT TGAGCAGTAT 4500 

CAGTTTCTCT ACAAAGTGAT CCTCAGCCTT. GTGGGCACAA GGCAGGAAGA GAATCCATCC 4560 

30 ACCTCTCTGG ACAGTAATGG TGCAGCATTG. CCTGATGGAA ATATAGCTGA GAGCTTAGAG 4620 

TCTTTAGTTT AACACAGAAA GGGGTGGGGG GACTCACATC TGAGCATTGT TTTCCTCTTC 4680 

CTAAAATTAG GCAGGAAAAT CAGTCTAGTT CTGTTATCTG TTGATTTCCC ATCACCTGAC 4740 

AGTAACTTTC ATGACATAGG ATTCTGCCGC CAAATTTATA TCATTAACAA* TGTGTGCCTT 4800 

TTTGCAAGAC TTGT AATTTA - CTTATT ATGT TTGAACTAAA ATGATTGAAT TTTACAGTAT 4860 

35 TTCTAAGAAT GGAATTGTGG TATTTTTTTC TGTATTGATT. TTAACAGAAA ATTTCAATTT 4920 

ATAGAGGTTA- GGAATTCCAA ACTACAGAAA ATGTTTGTTT TTAGTGTCAA' ATTTTTAGCT 4980 - 

GTATTTGTAG CAATTATCAG GTTTGCTAGA AATATAACTT TTAATACAGT AGCCTGTAAA 5040 

TAAAACACTC TTCCATATGA TATTCAACAT TTTACAACTG CAGTATTCAC CTAAAGTAGA 5100 

AATAATCTGT TACTTATTGT AAATACTGCC CTAGTGTCTC CATGGACCAA ATTTATATTT . 5160 

40 ATAATTGTAG ATTTTTATAT TTTACTACTG AGTCAAGTTT TCTAGTTCTG TGTAATTGTT 5220 

TAGTTTAATG ACGTAGTTCA TTAGCTGGTC TTACTCTACC . AGTTTTCTGA CATTGTATTG 5280 

TGTTACCTAA GTCATTAACT ■ TTGTTTCAGC ATGTAATTTT AACTTTTGTG GAAAATAGAA 5340 

ATACCTTCAT. TTTGAAAGAA GTTTTTATGA GAATAACACC TTACCAAACA TTGTTCAAAT 5400 

GGTTTTTATC CAAGGAATTG CAAAAATAAA TATAAATATT GCCATTAAAA AAAAAAAAAA 5460 
45 AAAAAAAAAA AAAAAAAAAA A 

Seq ID NO: 188 Protein sequence: 
Protein Accession #: EOS sequence 

50 1 . 11 21 31 41 51 

||, I I I I 

MRILKRFIAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDI* TQVNVNLKKL KFQGWDKTSL BNTPIHNTGK TVEINLTNDY RVSGGVSEMV 120 

FKASKITFHW: GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

55 ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFIIiLNIi LPNSTDKYYI YNGSI/TSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVPSSY 300 

TGKEEIHEAV CSSEPENVQA. DPENYTSLI.V TWERPRWYD TMIEKFAVLY . QQLDGEDQTK 360 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE- 420 

LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ, ISTTTHYNRI GTKYNEAKTN 480 

60 RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS; QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKIiD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSB ESLKDPSMEG' NVWFPSSTDI 660 , 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYPP 720 - 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNEASNSS HESRIGLAEG LESEKKAVIP 7 80 

65 LVIVSALTFI CLWLVGILI ' YWRKCPQTAH PYLEDSTSPR VISTPPTPIF PISDDVGAIP 840 

IKHFPKHVAD LHASSGFTEE FETLKEFYQE VQSCTVDLGI TADSSNHPDN KHKKRYINIV 900 

AYDHSRVKLA QLAEKDGKLT DYINANYVDG YNRPKAYIAA QGPLKSTAED FWRMIWEHNV 960 

EVIVMITNLV .EKGRRKCDQY WPADGSEEYG NFLVTQKSVQ VLAYYTVRNF TLRNTKIKKG 1020 

SQKGRPSGRV- VTQYHYTQWP DMGVPEYSLP VLTFVRKAAY AKRHAVGPW VHCSAGVGRT 1080 

70 GTYIVLDSML . QQIQHEGTVN IF GFLKHIRS QRNYLVQTEE QYVFIHDTLV EAILSKETEV 1140 

LDSHIHAYVN ALLIPGPAGK TKLEKQFQGL TLSPRLECRG TISAHCNLPL PGLTDPPTSA 1200 

SRVAGTILLS QSNIQQSDYS "AALKQCNREK NRTSSIIPVE RSRVGISSLS GEGTDYINAS 1260 

YIMGYYQSNE FIITQHPLLH TIKDFWRMIW DHNAQLWMI PDGQKMAEDE FVYWPNKDEP 1320 

INCESFKVTL KAEEHKCLSN EEKLIIQDPI LEATQDDYVL EVRKFQCPKW PNPDSPISKT 1380 

75 FELISV1KEE AANRDGPMIV HDEHGGVTAG TFCALTTLMH QbEKENSVDV YQVAKMINU4 1440 

RPGVFADIEQ YQFLYKVILS LVGTRQEENP STSLDSNGAA LPDGNIAESb ESLV 



Seq ID NO:. 189 DNA sequence 
80 Nucleic Acid Accession §: NM_002820 
Coding sequence : '304 . .831 

1. 11 .21" ' 31 41 51 

85 CCGGTTCGCA AAGAAGCTGA CTTCAGAGGG GGAAACTTTC TTCTTTTAGG AGGCGGTTAG 60 
. CCCTGTTCCA " CGAACCCAGG AGAACTGCTG GCCAGATTAA TTAGACATTG CTATGGGAGA 120 
CGTGTAAACA CACTACTTAT CATTGATGCA TATATAAAAC CATTTTATTT TCGCTATTAT 180 
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TTCAGAGGAA GCGCCTCTGA TTTGTTTCTT TTTTCCCTTT TTGCTCTTTC TGGCTGTGTG 240 
GTTTGGAGAA AGCACAGTTG. GAGTAGCCGG TTGCTAAATA AGTCCCGAGC GCGAGCGGAG . 300 
ACGATGCAGC GGAGACTGGT TCAGCAGTGG AGCGTCGCGG TGTTCCTGCT GAGCTACGCG . 360 
GTGCCCTCCT GCGGGCGCTC GGTGGAGGGT CTCAGCOGCC GCCTCAAAAG AGCTGTGTCT 420 
GAACATCAGC TCCTCCATGA CAAGGGGAAG TCCATCCAAG ATTTACGGCG ACGATTCTTC * 480 
CTTCACCATC TGATCGCAGA AATCCACACA GCTGAAATCA GAGCTACCTC GGAGGTGTCC 540 
CCTAACTCCA AGCCCTCTCC CAACACAAAG AACCACCCCG TCCGATTTGG GTCTGATGAT 600 
GAGGGCAGAT ACCTAACTCA GGAAACTAAC AAGGTGGAGA CGTACAAAGA GCAGCCGCTC 660 
' AAGACACCTG GGAAGAAAAA - GAAAGGCAAG CCCGGGAAAC GCAAGGAGCA GGAAAAGAAA ■ 720 
AAACGGCGAA CTCGCTCTGC. CTGGTTAGAC TCTGGAGTGA CTGGGAGTGG , GCTAGAAGGG 780 
GACCACCTGT CTGACACCTC CACAACGTCG CTGGAGCTOG ATTCACGGTA ACAGGCTTCT. 840 
CTGGCCCGTA' GCCTCAGCGG GGTGCTCTCA GCTGGGTTTT- GGAGCCTCCC TTCTGCCTTG . 900 
GCTTGGACAA. ACCTAGAATT TTCTCCCTTT ATGTATCTCT ATCGATTGTG. TAGCAATTGA , 960 
CAGAGAATAA CTCAGAATAT TGTCTGCCTT- AAAGCAGTAC CCCCCTACCA CACACACCCC 1020 
TGTCCTCCAG. CACCATAGAG AGGCGCTAGA GCCCATTCCT CTTTCTCCAC CGTCACCCAA . 1080 
CATCAATCCT TTACCACTCT ACCAAATAAT. TTCATATTCA AGCTTCAGAA GCTAGTGACC - 1140 
ATCTTCATAA .TTTGCTGGAG. AAGTGTATTT CTTCCCCTTA- CTCTCACACC TGGGCAAACT 1200 
TTCTTCAGTG TTTTTCATTT CTTACGTTCT TTCACTTCAA GGGAGAATAT AGAAGCATTT 1260 
GATATTATCT ACAAACACTG CAGAACAGCA' TCATGTCATA AACGATTCTG AGCCATTCAC 1320 
ACTTTTTATT TAATTAAATG TATTTAATTA AATCTCAAAT TTATTTTAAT GTAAAGAACT 1380 
TAAATTATGT TTTAAACACA TGCCTTAAAT TTGTTTAATT- AAATTTAACT CTGGTTTCTA ■ 1440 
CCAGCTCATA-- CAAAATAAAT GGTTTCTGAA AATGTTTAAG TATTAACTTA CAAGGATATA 1500 
GGTTTTTCTC ATGTATCTTT TTGTTCATTG GCAAGATGAA ATAATTTTTC TAGGGTAATG 1560 
CCGTAGGAAA AATAAAACTT CACATTTAAA AAAAA 



Seq ID NO: 190 Protein sequence: 
Protein Accession #: NP_002811 



1 11 21 31 41 51 

I I I III 

MQRRLVQQWS VAVFLLSYAV PSCGRSVBGL SRRLKRAVSE HQLLHDKGKS IQDLRRRFFL 60 

KHLIAEIHTA EIRATSEVSP NSKPSPNTKN HPVRFGSDDE GRYLTQETNK VETYKEQPLK 120 
TPGKKKKGKP GKRKEQEKKK RRTRSAWLDS GVTGSGLEGD HLSDTSTTSL ELDSR 

Seq id NO j 19.1 DNA sequence 
Nucleic Acid Accession XM_059328 
Coding sequence t 52.. 1023 

1 11 21 31 41 51 

I I I I I I 

GGGCTGTCCG GCXTCACTCCC CTGGGAGCGC GAGCGGTGGA CCCAGGCGGC CATGTCCCGC 60 

CCTCGCATGC GCCTGGTGGT CACCGCGGAC GACTTTGGTT ACTGCCCGCG ACGCGATGAG 120 

GGTATCGTGG AGGCCTTTCT GGCCGGGGCT GTGACCAGCG TGTCCCTGCT GGTCAACGGT 180 

GCGGCCACGG AGAGCGCGGC GGAGCTGGCC CGCAGGCACA GCATCCCCAC GGGCCTCCAC 240 

GCCAACCTGT CCGAGGGCCG CCCCGTGGGT CCGGCCCGCC GTGGCGCCTC ATCGCTGCTC 300 

GGCCCGGAAG GCTT CTTCCT TGGCAAGATG GGATTCCGGG AGGCGGTGGC GGCCGGAGAC 360 

GTGGATTTGC CTCAGGTGCG GGAGGAGCTC GAGGCCCAAC TAAGCTGCTT CCGGGAGCTG 420 

CTGGGCAGGG CCCCCACGCA^ CGCGGACGGG CACCAGCACG TGCACGTGCT CCCAGGCGTG 480 

TGCCAGGTGT TCGCCGAGGC GCTGCAGGCC TATGGGGTGC GCTTTACGCG ACTGCCGCTG 540 

GAGCGCGGTG. TGGGTGGCTG CACTTGGCTG GAGGCCCCCG CGCGTGCCTT CGCCTGCGCC 600 

GTGGAGCGCG ACGCCCGGGC. CGCCGTGGGC CCCTTCTCCC GCCACGGCCT GCGGTGGACA 660 

GACGCCTTCG- TGGGCCTGAG'; CACTTGCGGC CGGCACATGT CCGCTCACCG CGTGTCCGGG 720 

GCCCTGGCGC GGGTCCTGGA^ AGGTACCCTA GCGGGCCACA CCCTGACAGC CGAGCTGATG 780 

GCGCACCCCG' GCTACCCCAG* TGTGCCTCCC ACCGGCGGCT GCGGTGAAGG CCCCGACGCT 840 

TTCTCTTGCT CTTGGGAGCG GCTGCATGAG CTGCGCGTCC TCACCGCGCC CACGCTGCGG 900 

GCCCAGCTTG CCCAGGATGG CGTGCAGCTT TGCGCCCTCG ACGACCTGGA CTCCAAGAGG 960 

CCAGGGGAGG AGGTCCCCTG TGAGCCCACT CTGGAACCCT TCCTGGAACC CTCCCTACTC 1020 

TGACCCCCTA CAGACAACCA AGCACTAATC CCCTTAGTAC CAAGAAAGGG GAGCCAGGAT 1080 

TTAGTCCTGG CCCAGCCCAG AGCTGGGACC TGGAGCACGA TCTGTTGACT- TCCCTGGGTA 1140 

GGACACTGCC ACCTCTGGGC TCAGGTCCTC ATGCCTCCAA ATGGCATCTA GAGTTTGAGC 1200 

AGCCTTCTTG GCTGCAGGCA GGCCTAGCCT GTGGCAGCGG GCTAGGGCCC GCAGAGCATT 1260 

TGGTGCCCCT CCATGTTGCA ATGCAAACAC CTTCACCACT GGGGCAGTGG GGAGAGATGG 1320 
CTATATTAAT AAAATAACGT GTGTCTTTC 

Seq ID NO: 192 Protein sequence: 
Protein Accession #: XP_059328 



1 11 21 31 41 51 

I I I I I I 

MSRPRMRLW TADDFGYCPR RDEGIVKAFL AGAVTSVSLL VNGAATESAA ELARRHSIPT 60 

GLHANLSEGR PVGPARRGAS SLLGPEGFFL GKMGFREAVA AGDVDLPQVR EELEAQLSCF -120 

RELLGRAPTH ADGHQHVHVL PGVCQVPAEA JjQAYGVRFTR LPLERGVGGC TWLEAPARAF 180 

ACAVERDARA AVGPFSRHGL RWTDAFVGLS TCGRHMSAHR VSGALARVLE GTLAGHTLTA 240 

ELMAHPGYPS VPPTGGCGEG PDAFSCSHER LHELRVLTAP TLRAQLAQDG VQLCALDDLD 300 

SKRPGEEVPC EPTLEPFLEP SLL 

Seq ID NO: 193 DNA sequence 

Nucleic Acid Accession #: NM_00568B.l 

Coding sequence: 126.. 4439 

1 11 21 31 41 51 

I I I I I I 

CCGGGCAGGT GGCTCATGCT CGGGAGCGTG GTTGAGCGGC TGGCGCGGTT GTCCTGGAGC 60 
AGGGGCGCAG GAATTCTGAT GTGAAACTAA CAGTCTGTGA GCCCTGGAAC CTCCGCTCAG 120 
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AGAAGATGAA GGAT ATCGAC ATAGGAAAAG AGTATATCAT CCCCAGTCCT GGGTATAGAA 180 
GTGTGAGGGA GAGAACCAGC ACTTCTGGGA * CGCACAGAGA CCGTGAAGAT TCCAAGTTCA " 240" 
GGAGAACTCG ACOGTTGGAA TGCCAAGATG CCTTGGAAAC AGCAGCCOGA GCCGAGGGCC 300 
TCTCTCTTGA TGCCTOCATG CATTCTCAGC TCAGAATCCT GGATGAGGAG CATCCCAAGG 360 
GAAAGTACCA TCATGGCTTG AGTGCTCTGA AGCCCATCCG GACTACTTCC AAACACCAGC 420 
ACCCAGTGGA CAATGCTGGG CTTTTTTCCT GTATGACTTT TTCGTGGCTT TCTTCTCTGG 480 
CCCGTGTGGC CCACAAGAAG . GGGGAGCTCT. CAATGGAAGA CGTGTGGTCT CTGTCCAAGC 540 
ACGAGTCTTC TGACGTGAAC TGCAGAAGAC TAGAGAGACT GTGGCAAGAA GAGCTGAATG 600 
AAGTTGGGCC AGACGCTGCT TCCCTGCGAA GGGTTGTGTG GATCTTCTGC CGCACCAGGC 660 
TCATCCTGTC CATCGTGTGC CTGATGATCA- CGCAGCTGGC TGGCTTCAGT GGACCAGCCT. 720 
TCATGGTGAA ACACCTCTTG ' GAGTATACCC AGGCAACAGA GTCTAACCTG CAGTACAGCT 780 . 
TGTTGTTAGT GCTGGGCCTC CTCCTGACGG AAATCGTGCG GTCTTGGTCG CTTGCACTGA* 840 
CTTGGGCATT GAATTACCGA ACCGGTGTCC GCTTGCGGGG GGCCATCCTA ACCATGGCAT 900 
TTAAGAAGAT. CCTTAAGTTA AAGAACATTA- AAGAGAAATC CCTGGGTGAG : CTCATCAACA 960 
TTTGCTCCAA CGATGGGCAG AGAATGTTTG - AGGCAGCAGC CGTTGGCAGC CTGCTGGCTG; 1020 
GAGGACCCGT- TGTTGCCATC TTAGGCATGA- TTTATAATGT AATTATTCTG GGACCAACAG 1080 
GCTTCCTGGG; ATCAGCTGTT, TTTATCCTCT. TTTACCCAGC AATGATGTTT GCATCACGGC 1140. 
TCACAGCATA TTTCAGGAGA AAATGCGTGG CCGCCACGGA TGAACGTGTC CAGAAGATGA 1200 
ATGAAGTTCT TACTTACATT AAATTTATCA AAATGTATGC- CTGGGTCAAA' GCATTTTCTC 1260 
AGAGTGTTCA AAAAATCCGC GAGGAGGAGC GTCGGATATT. GGAAAAAGCC GGGTACTTCC ■ 1320 
AGGGTATCAC TGTGGGTGTG GCTCCCATTG TGGTGGTGAT TGCCAGCGTG GTGACCTTCT "• 1380 
CTGTTCATAT GACCCTGGGC TTCGATCTGA CAGCAGCACA GGCTTTCACA GTGGTGACAG . 1440 
TCTTCAATTC. CATGACTTTT GCTTTGAAAG TAACACCGTT TTCAGTAAAG TCCCTCTCAG 1500 
AAGCCTCAGT GGCTGTTGAC AGATTTAAGA GTTTGTTTCT AATGGAAGAG GTTCACATGA 1560 
TAAAGAACAA ACCAGCCAGT CCTCACATCA AGATAGAGAT GAAAAATGCC ACCTTGGCAT 1620 
GGGACTCCTC CCACTCCAGT ATCCAGAACT CGCCCAAGCT GACCCCCAAA ATGAAAAAAG: 1680 
ACAAGAGGGC TTCCAGGGGC AAGAAAGAGA AGGTGAGGCA GCTGCAGCGC ACTGAGCATC 1740 
AGGCGGTGCT - GGCAGAGCAG AAAGGCCACC TCCTCCTGGA CAGTGACGAG CGGCCCAGTC 1800 
CCGAAGAGGA AGAAGGCAAG CACATCCACC TGGGCCACCT GCGCTTACAG AGGACACTGC 1860 
ACAGCATCGA TCTGGAGATC CAAGAGGGTA AACTGGTTGG AATCTGCGGC AGTGTGGGAA 1920 
GTGGAAAAAC CTCTCTCATT TCAGCCATTT TAGGCCAGAT GACGCTTCTA GAGGGCAGCA 1980 
TTGCAATCAG TGGAACCTTC GCTTATGTGG CCCAGCAGGC CTGGATCCTC AATGCTACTC ". 2040 
TGAGAGACAA' CATCCTGTTT- GGGAAGGAAT ATGATGAAGA AAGATACAAC TCTGTGCTGA- 2100; 
ACAGCTG CTG/ CCTGAGGCCT. GACCTGGCCA TTCTTCCCAG CAGCGACCTG. ACGGAGATTG 2160 
GAGAGCGAGG AGCCAACCTG AGCGGTGGGC AGCGCCAGAG GATCAGCCTT GCCCGGGCCT 2220 
TGTATAGTGA" CAGGAGCATC TACATCCTGG ACGACCCCCT CAGTGCCTTA' GATGCCCATO- 2280 
TGGGCAACCA CATCTTCAAT AGTGCTATCC GGAAACATCT CAAGTCCAAG ACAGTTCTGT. 2340 
TTGTTACCCA-CCAGTTACAG TACCTGGTTG ACTGTGATGA AGTGATCTTC ATGAAAGAGG 2400 
GCTGTATTAC GGAAAGAGGC ACCCATGAGG AACTGATGAA TTTAAATGGT GACTATGCTA - 2460: 
CCATTTTTAA TAACCTGTTG. CTGGGAGAGA CACCGCCAGT TGAGATCAAT TCAAAAAAGG 2520 
AAACCAGTGG: TTCACAGAAG AAGTCACAAG ACAAGGGTCC TAAAACAGGA TCAGTAAAGA 2580 
AGGAAAAAGC AGTAAAGCCA GAGGAAGGGC AGCTTGTGCA GCTGGAAGAG AAAGGGCAGG 2640 
GTTCAGTGCC CTGGTCAGTA TATGGTGTCT ACATCCAGGC TGCTGGGGGC CCCTTGGCAT 2700 
TCCTGGTTAT TATGGCCCTT. TTCATGCTGA ATGTAGGCAG CACCGCCTTC AGCACCTGGT 2760 
GGTTGAGTTA' CTGGATCAAG CAAGGAAGCG GGAACACCAC TGTGACTCGA GGGAACGAGA: 2820 
CCTCGGTGAG TGACAGCATG AAGGACAATC CTCATATGCA GTACTATGCC AGCATCTACG 2880 
CCCTCTCCAT GGCAGTCATG CTGATCCTGA AAGCCATTCG AGGAGTTGTC TTTGTCAAGG 2940 
GCACGCTGCG AGCTTCCTCC CGGCTGCATG ACGAGCTTTT CCGAAGGATC CTTCGAAGCC 3000 
CTATGAAGTT TTTTGACACG' ACCCCCACAG GGAGGATTCT" CAACAGGTTT TCCAAAGACA 3060 
TGGATGAAGT TGACGTGCGG CTGCCGTTCC AGGCCGAGAT GTTCATCCAG: AACGTTATCC 3120 
TGGTGTTCTT CTGTGTGGGA ATGATCGCAG GAGTCTTCCC GTGGTTCCTT GTGGCAGTGG 3180 
GGCCCCTTGT. CATCCTCTTT TCAGTCCTGC ACATTGTCTC CAGGGTCCTG ATTCGGGAGC 3240 
TGAAGCGTCT GGACAATATC ACGCAGTCAC CTTTCCTCTC CCACATCACG TCCAGCATAC 3300 
AGGGCCTTGC CACCATCCAC GCCTACAATA AAGGGCAGGA GTTTCTGCAC AGAT ACCAGG " 3360 
AGCTGCTGGA- TGACAACCAA', GCTCCTTTTT TTTTGTTTAC GTGTGCGATG- CGGTGGCTGG 3420 
CTGTGCGGCT GGACCTCATC AGCATCGCCC' TCATCACCAC CACGGGGCTG ATGATCGTTC 3480 
TTATGCACGG GCAGATTCCC CCAGCCTATG CGGGTCTCGC CATCTCTTAT' GCTGTCCAGT 3540 
TAACGGGGCT GTTCCAGTTT ACGGTCAGAC TGGCATCTGA GACAGAAGCT CGATTCACCT ' 3600 
CGGTGGAGAG GATCAATCAC; TACATTAAGA CTCTGTCCTT GGAAGCACCT GCCAGAATTA 3660 
AGAACAAGGC TCCCTCCCCT GACTGGCCCC AGGAGGGAGA GGTGACCTTT GAGAACGCAG 3720 
AGATGAGGTA CCGAGAAAAC CTCCCTCTTG TCCTAAAGAA AGTATCCTTC ACGATCAAAC* 3780 
CTAAAGAGAA GATTGGCATT' GTGGGGCGGA CAGGATCAGG GAAGTCCTCG CTGGGGATGG 3840 
CCCTCTTCCG. TCTGGTGGAG; TTATCTGGAG GCTGCATCAA GATTGATGGA GTGAGAATCA 3900 
GTGATATTGG" CCTTGCCGAC CTCCGAAGCA AACTCTCTAT CATTCCTCAA GAGCCGGTGC 3960 
TGTTCAGTGG, CACTGTCAGA TCAAATTTGQ ACCCCTTCAA CCAGTACACT GAAGACCAGA 4020 
TTTGGGATGC CCTGGAGAGG ACACACATGA AAGAATGTAT TGCTCAGCTA CCTCTGAAAC 4080 
TTGAATCTGA- AGTGATGGAG AATGGGGATA ACTTCTCAGT GGGGGAACGG CAGCTCTTGT 4140 
GCATAGCTAG AGCCCTGCTC' CGCCACTGTA AGATTCTGAT TTTAGATGAA- GCCACAGCTG 4200 
CCATGGACAC AGAGACAGAC TTATTGATTC AAGAGACCAT CCGAGAAGCA TTTGCAGACT 4260 
GTACCATGCT GACCATTGCC CATCGCCTGC ACACGGTTCT AGGCTCCGAT AGGATTATGG 4320 
TGCTGGCCCA GGGACAGGTG GTGGAGTTTG ACACCCCATC GGTCCTTCTG TCCAACGACA 4380 
GTTCCCGATT. CTATGCCATG TTTGCTGCTG CAGAGAACAA GGTCGCTGTC AAGGGCTGAC 4440 
TCCTCCCTGT TGACGAAGTC TCTTTTCTTT AGAGCATTGC CATTCCCTGC CTGGGGCGGG 4500 
CCCCTCATCG CGTCCTCCTA CCGAAACCTT GCCTTTCTCG ATTTTATCTT- TCGCACAGCA 4560 
GTTCCGGATT. GGCTTGTGTG TTTCACTTTT AGGGAGAGTC ATATTTTGAT TATTGTATTT. 4620 
ATTCCATATT CATGTAAACA AAATTTAGTT TTTGTTCTTA ATTGCACTCT- AAAAGGTTCA 4680 
GGGAACCGTT ATTATAATTG TATCAGAGGC CTATAATGAA GCTTTATACG TGTAGCTATA 4740 
TCTATATATA ATTCTGTACA' TAGCCTATAT TTACAGTGAA AATGTAAGCT GTTTATTTTA 4800 
TATTAAAATA AGCACTGTGC TAATAACAGT GCATATTCCT TTCTATCATT TTTGTACAGT 4860 
TTGCTGTACT AGAGATCTGG TTTTGCTATT AGACTGTAGG AAGAGTAGCA TTTCATTCTT 4920 
CTCTAGCTGG TGGTTTCACG. GTGCCAGGTT TTCTGGGTGT CCAAAGGAAG ACGTGTGGCA 4980 
ATAGTGGGCC CTCCGACAGC CCCCTCTGCC GCCTCCCCAC AGCCGCTCCA GGGGTGGCTG 5040 
GAGACGGGTG GGCGGCTGGA GACCATGCAG AGCGCCGTGA GTTCTCAGGG CTCCTGCCTT 5100 
CTGTCCTGGT GTCACTTACT GTTTCTGTCA GGAGAGCAGC GGGGCGAAGC CCAGGCCCCT 5160 
TTTCACTCCC TCCATCAAGA ATGGGGATCA CAGAGACATT CCTCCGAGCC GGGGAGTTTC 5220 
TTTCCTGCCT TCTTCTTTTT GCTGTTGTTT CTAAACAAGA ATCAGTCTAT CCACAGAGAG 5280 
TCCCACTGCC TCAGGTTCCT ATGGCTGGCC ACTGCACAGA GCTCTCCAGC TCCAAGACCT 5340 
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GTTGGTTCCA AGCCCTGGAG CCAACTGCTG 
ATTCCCACAC CTOCACAGTT CAGTGGCAGG 
CTCACCGCAG TCGTCGCACA GTCTCTCTCT 
CAGCTCTTGC TAATCAGTGT CTCACACTGG 
ACCTCAGGTT GCTGGTTGCT GTGTGGTTTG 
GGGGCTGGTA GCTCAGGTGG GCGTGGTCAC 
ATGTCGTGAC CAACTAGACA TTCTGTCGCC 
CAAAAATCTG AAAATGTGAA • TAAAATTATT 
AAAAAAAAAA AAAAAAAA 



3 



CTTTTTGAGG 
GCTCAGGATT 
CTCTCTCCCC 
CGTAGAAGTT 



TGCTGTCATC 
TTAGCATGTT 
TTGGATTTTG 



TGGCACTTTT 
TCGTGGGTCT 
TCAAAGTCTG 
TTTGTACTGT 
GCAAACCCCC 
AGTTGAATGG 
TGCTGAACAC 
TAAAAAAAAA 



TCATTTGCCT ' 

GTTTTCCTTT 

CAACTTT AAG * 

AAAGAGACCT 

TTTGTGCTGT 

TCAGCGTTGC 

CTTGTGGAAG 

AAAAAAAAAA 



5400 
5460 
5520. 
5580 
5640 
5700 
5760 
5820 
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Seq ID NO: 194 Protein sequence-. 
Protein Accession §« NP_0O5679.1 



MKDIDIGKEY 
LDASMHSQLR 
VAHKKGELSM 
LSIVCLMITQ 
ALNYRTGVRL 
PWAIliGMIY 
VLTYIKFIKM 
HMTLiGFDLTA 
NKPASPHIKI 
VLAEQKGKIaL 
KTSLISAIIjG 
CCLRPDLAIL 
NHTFNSAIRK 
FNNIiLLGETP 
VPWSVYGVYl 
VSDSMKDNPH 
KFFDTTPTGR 
LVILFSVLHI 
LDDNQAPFFI* 
GLFQFTVRLA 
RYRENLPLVL 
IGLADLRSKL 
SEVMENGDNF 
MLTIAHRLHT 



11 

I ■ 

IIPSPGYRSV 
ILDEEHPKGK 
EDVWSLSKHE 
LAGFSGPAFM 
RGAILTMAFK 
NVI ILGPTGF 
YAWVKAFSQS 
AQAFTWTVF 
EMKNATLAWD- 
LDSDERPSPE 
QMTLLEGSIA- 
PSSDLTBIGE" 
HLKSKTVLFV 
PVEINSKKET 
QAAGGPLAFL 
MQYYASIYAL 
ILNRFSKDMD 
VSRVIiIRELK 
FTCAMRWLAV 
SETEARFTSV 
KKVSFTIKPK 
SIIPQEPVLF 
SVGERQIibCI 
VLGSDRIMVL 



21 
1 

RERTSTSGTH 
YHHGLSALKP' 
SSDVNCRRLE 
VKHLLEYTQA 
KILKLKNIKE 
LGSAVFILFY 
VQKIREEERR 
NSMTFALKVT. 



EEEGKHIHLG. 
ISGTFAYVAQ 
RGANLSGGQR 
THQLQYLVDC 
SGSQKKS0DK 
VIMALFMLNV 
SMAVMLILKA 
EVDVRLPFQA 
RU3NITQSPF 
RLDXiISIALI 
ERINKYIKTL 
EKIGIVGRTG 
SGTVRSNLDP 
ARALLRHCKI 
AQGQWEFDT 



31 

r 

RDREDSKFRR 
IRTTSKHQHP 
RLWQEELNEV 
TESNLQYSLIi 
KSLGELINIC 
PAMMFASRLT 
ILEKAGYFQG 
PFSVKSLSEA 
KLTPKMKKDK 
HLRLQRTLHS ' 
QAWILNATLR 
QRISLARALY 
DEVIFMKEGC 
GPKTGSVKKE 
GSTAFSTWWL 
IRGWFVKGT 
EMFIQNV1LV 
LSHITSSIQG- 
TTTGLMIVLM 
SLEAPARIKN 
SGKSSLGMAL 
FNQYTEDQIW-' 
LILDEATAAM 
PSVLLSNDSS 



41 
I 

TRPLECQDAL 
VDNAGLFSGM 
GPDAASLRRV 
LVLGIiLLTEI 
SNDGQRMFEA 
AYFRRKCVAA 
ITVGVAPIW 
SVAVDRFKSI. 
RASRGKKEKV- 
IDLEIQEGKL 
DNIIiFGKEYD 
SDR'SIYILDD 
ITERGTHEEL 
KAVKPEEGQI* 
SYWIKQGSGN 
LRASSRLHSE 
FFCVGMIAGV 
IATIKAYNK6 
HGQIPPAYAG 
KAPSPDWPQB 
FRIjVELSGGC 
DALERTHMKE 
DTETDLLIQE 
RFYAMFAAAE 



51 
I 




1 


60 


TFSWLSSLAR 


120 


VWIFCRTRLI 


180 


VRSWSLALTW 


240 


AAVGSLXjAGG ' 


300 ' 


TDERVQKMNE 


360 


VIASWTFSV' 


420 


FLMEEVHMIK 


480 


RQLQRTEHQA 


540 


VGICGSVGSG 


600 


EERYNSVLNS 


660 


PLSALDAHVG. 


720 


MNLNGDYATI 


780 


VQLEEKGQGS 


B40. 


TTVTRGNETS ' 


900 


LFRRILRSPM 


960 


FPWFLVAVGP 


1020 


QEFLHRYQEL 


1080 


LAlSYAVQIiT 


1140 


GEVTFENAEM - 


1200 


IKIDGVRISD 


1260 


CIAQLPLKLE 


1320 


TIREAFADCT 


1380 


NKVAVKG 





Seq ID NOs 195 DNA sequence 
Nucleic Acid Accession #: NM_006470 
Coding sequence : 2 2 8 . . 1 92 2 



1 

I 

GCTGTCCTGA 
CGCCAGCACA 
TTGCAGCAGC 
TGGGCCAAGG 
ATCTAATGGC 
CAGACTCTGG 
TGGGCTCCTC 
AGGGGGATCC 
GAAGAGTGAA- 
TGCAGCCGCA. 
ACCACAACTG 
ATCAGCAGTG 
TGGATGCAGC 
GGAAACTCAA 
TGGTGTCGGT 
CTGTGAGGAA' 
TGAGCCAGGC 
GCAAGCAGGA 
ACTGCAAGTT 
ATAAACTCTC 
TGGAGAACTA 
CTCAAGTGTC 
GGGAACAGTT 
ATCTCCGGCT 
ACCCGGACCT 
ACCTGCACAG" 
CCTGCAAAGG 
TCTCCTGGAG 
CCCCACTCAA 
TCCTTTCCTT 
AATTTTCAGA 
TTGTAGATCT 
* AGACTCCAGG 
GGTGATTTGT 
TCTGAATGAA 
GTTTGCAGTA 
AGAGCAGTGG 
CTGCCT CAGC 
TTTGTATTTT 



11 . 
I 

GCCTGAGTAC 
CAGTAATGAG 
.TGCAATCATC 
GACAGAAGAA 
TCCAGGGCCA 
GTCACCCAGC 
GGAGAAGCTT 
TGCTGGTGAG 
GGCAGTGAAG 
TCAGGTGAAC 
GCGATACTGC 
CATCTGCCAG 
CCGCAGGGAC 
GTTGAATGAA. 
GTCAGAGGTC. 
GGCCCAGGCC 
CAAOGGTATC. 
GCTGGAGAGG 
TAAGAACACT 
GGGCATCCGC 
TAAGAAAAAG 
TGCCGTTGTT 
CCTCCAATAT- 
GCAGGAGG AG 
CCCCAGCAGG 
GTACTATTTT 
CATCGACCGG 
CCTCCAATGG 
AGCTGGCCCT 
CTATGGCGTA 
ACCAGTCTAT 
GGGAGAGGAA 
AGCCATATCC 
GGGCAGAAAT 
AACATTCTCC 
ATTCTTTTTT 
CGCGATCTTG 
CTCCCGAGTA 
TAGTAGAGAT 



21 
I 

TCTAGCTGCC 
TGGCCGAGCT 
TAGGCGTGGT- 
AGACAGCCTA 
CTGCCCAGGG 
CCAGATTCTG 
GGCAGGGAGA 
GGGAAAGAGG' 
TCCTGTCTAA 
ATCAAACTGC 
CCTGCCCACC 
GACTGTTGCC 
AAGGAGGCTG 
AATGCCATCT 
AAAGCGGTGG 
AATGTGATGC 
AAGGCCCACC 
ATGGCGGCCA 
GAAGACATCA 
AAAGTTATCA' 
CTCCAGGAGT 
CAGCGCAAAT 
GCGTATGACA- 
AACCGCAAGG" 
TTCCTGCACT. 
GAGGTGGAGA'" 
AAAGGGGAGG" 
AACGGGAAGG 
TTCCGGAGGC 
GAGTATGATA 
GCTGCCTTCT 
CCCGAGAAGC 
CAGACCTTTG 
AACTGCTGAT 
AGCTGCTCTC 
TTTTTTTTGA 
GCTCACTGCA 
GCTGGGATTA 
GGGGTTTCAC 



31 
I 

TTGTCGCCAT 
TCCTCTGGGA 
TCTCTTGTCT 
GGAGCAGAGC 
CCACTGCTCA 
GGTCAGCCAG 
CGGAGGAACA- 
TCCTGTGTGA- 
CCTGCATGGT 
AAAGCCACCT 
ACAGCCCACT. 
AGGAGCACAG 
AACTCCAGTG 
CCAGGCTCCA 
CTGAAATGCA 
TCTTCTTAGA 
TGGAGTACAG 
TCAGCAACAC 
CCTTCCCTAG 
CGGAATCCAC 
TTTCCAAGGA 
ATTGGACTTC 
TCACGTTTGA. 
TCAGCAACAC 
GGCGGCAGGT. 
TCTTCGGGGC 
AGCGCAACAG 
AGTTCACGGC 
TCGGGGTCTA 
CCATGACTCT 
GGCTTTCCAA 
CAGCACCGTC 
CCAGCTACAG 
GGTAGCTGGC 
TTTTGCTCCA 
GACGGAGTCT 
AGCTCCGCCT 
CAGGTGCCTG 
CATGTTGGCC 



41 
I 

CGCATCTGGC 
GGGAGGAAAC 
GACTTGGGCT 
CTCCCAGATG 
GCCCCCAGCC 
CCCAGTGGAA 
GGACAGCGAC 
CTTCTGCCTT 
GAATTACTGT 
GCTGACCGAG 
GTCTGCTTTC 
TGGCCACACC 
CACCCAGTTA 
GGCTAACCAA 
GTTTGGGGAA 
GGAGAAGGAG 
GAGTGCCGAG 
TGTCCAGTTC 
TGTTTACGTA 
TGTACACTTA 
AGAGGAGTAT 
CAAACCTGAG 
CCCGGACACA 
CACGCCCTGG 
GCTGTCCCAG 
AGGCACCTAT 
TTGCATTTCC 
CTGGTACAGT 
TATCGACTTC 
GGTTCACAAG 
GAAGGAAAAC 
CTTGGGGGTG 
TGATGGGATT 
TTTTGAAATC 
TATGGTGCTG 
CGCACTGTTG 
CCCGAGTTCA 
CCACCACACC 
AGGCAGATCT 



TGCCATCCAG 


60 


AGTTAAAATC 


120 


GCACAGATCC 


180 


GCTGAGTTGG y 


240 


CCTCTCAGCC 


300 


GAAGAGGACG 


360 


TCTGCAGAGC. 


420 


GATGACACCA 


480 


GAAGAGCACT 


540 


CCAGTGAAGG 


600 


TGCTGCCCTG 


660 


ATAGTCTCCC 


720 


GACTTGGAGC 


780 


AAGTCTGTTC 


840 


CTCCTTGCTG 


900 


CAAGCTGCGC 


960 


ATGGAGAAGA • 


1020 


TTGGAGGAGT 


1080 


GGGCTGAAGG 


1140 


ATCCAGTTGC 


1200 


GACATCAGAA - 


1260 


CCCAGCACCA 


1320 


GCACACAAGT 


1380 


GAGCATCCCT 


1440 


CAGAGTCTGT 


1500 


GTTGGCCTGA 


1560 


GGAAACAACT 


1620 


GACATGGAGA 


16B0 


CCGGGAGGGA 


1740 


TTTGCCTGCA -■ 


1800 


GCCATCCGGA' 


1860 


ACTGCTCCCT- 


1920 


TGCATTTTAG 


1980 


CTATGGGGTC 


2040 


TTCTCTATGT 


2100 


CCCAGGCTGG 


2160 


AGCAATTCTC 


2220 


CAGCTAATGT 


2280 


CAAACTCCTG 


• 2340 



266 



WO 02/086443 V. : \ * - : / 

ACCTCGTGAT GCACCCACCT GGGCCTCCCA AAGTGCTGGG ATTACATGCG TGAGCCACTG 24O0 
CGCCCTGCCT " GTTTGTAGTA- ATTTTTAGGC ACCAAATCTC CCTCATCTTC TAGTGOCATT 2460 
CTCCTCTCTG"' TTCAGGTAAA- TGTCACACTG TGCCCAGAAT GGATGACCA6 GAACCTTAAA - 2520 
GAGTGGCTGA AAAGATTGCA' GAGTTATCAT AATAAATTGC TAACTTGCGT 



Seq ID. NO: 196- Protein sequence: 
Protein Accession 8: : NP — 006461 ' . 

1 11 21 31 41- 51 

I I I I 1 .1 

MAELDLMAPGT PLPRATAQPP APLSPDSGSP SPDSGSASPV EEEDVGSSEK . LGRETEBQDS • 60 

DSAEQGDPAG' EGXEVLCDFC LDDTRRVKAV KSCLTCMVNY CEEHLQPHQV NIKLQSHLLT 120 

EPVKDHNWRY CPAHHSPLSA FCCPDQQCIC QDCCQEHSGH TIVSLDAARR DKEAELQCTQ 180 

LDLERKLKLN-: ENAISRLQAN QKSVLVSVSE VKAVAEMQFG" ELLAAVRKAQ ANVMLFLEEK 240 

EQAALSQANG: IKAHLEYRSA EMEKSKQELE RMAAISNTVQ FLEEYCKFKN- TEDITFPSVY - 300 

VGLKDKLSGI RKVITESTVH*' LIQLLENYKK KLQEFSKEEE YDIRTQVSAV VQRKYWTSKP - 360 

EPSTREQFLQ; YAYDITFDPD- TAHKYLRLQE ENRKVTNTTP WEHPYPDLPS RFLHWRQVLS 420 

QQSLYLHRYY- FEVEIFGAGT- YVGLTCKGID RKGEERNSCI SGNNFSWSLQ WNGKEFTAHY 480 

SDMETPLKAG: PFRRLGVYID FPGGILSFYG VEYDTMTLVH KFACKFSEPV YAAFWLSKKE 540 
NAIRIVDLGE- EPEKPAPSLG. VTAP , 

Seq ID NO: 197 DNA sequence 
Nucleic Acid Accession NM_004316 
.Coding sequence: 433-1149 

1 11 21 31 41 51 

I I ' I I • I I ' 

CCCGAGACCC GGCGCAAGAG AGCGCAGCCT TAGTAGGAGA GGAACGCGAG ACGCGGCAGA 60 

GCGCGTTCAG CACTGACTTT. TGCTGCTGCT TCTGCTTTTT TTTTTCTTAG AAACAAGAAG 120 

GCGCCAGCGG: CAGCCTCACA CGCGAGCGCC ACGCGAGGCT CCCGAAGCCA ACCCGCGAAG 180 

GGAGGAGGGG AGGGAGGAGG AGGCGGCGTG CAGGGAGGAG AAAAAGCATT TTCACCTTTT 240 

TTGCTCCCAC TCTAAGAAGT CTCCCGGGGA TTTrGTATAT; ATTTTTTAAC TTOCGTCAGG 300 

GCTCCCGCTT CATATTTCCT TTTCTTTCCC TCTCTGTTCC TGCACCCAAG TTCTCTCTGT 360 

GTCCCCCTCG CGGGCCCCGC ACCTCGCGTC CCGGATCGCT CTGATTCCGC GACTCCTTGG 420 

CCGCCGCTGC GCATGGAAAG CTCTGCCAAG. ATGGAGAGCG GCGGCGCCGG. CCAGCAGCCC 490 

' CAGCCGCAGC CCCAGCAGCC CTTCCTGCCX3 CCCGCAGCCT GTTTCTTTGC CACGGCCGCA 540 

GCCGCGGCGG:: CCGCAGCCGC CGCAGCGGCA^ GCGCAGAGCG" CGCAGCAGCA GCAGCAGCAG 600 

CAGCAGCAGC AGCAGCAGCA GCAGGCGCCG CAGCTGAGAC CGGCGGCCGA' CGGCOVGCCC 660 

TCAGGGGGCG GTCACAAGTC AGCGCCCAAG CAAGTCAAGC GACAGCGCTC GTCTTCGCCC 720 

GAACTGATGC GCTGCAAACG' CCGGCTCAAC TTCAGCGGCT TTGGCTACAG CCTGCCGCAG 780 

CAGCAGCCGG CCGCCGTGGC GCGCCGCAAC GAGCGCGAGC GCAACCGCGT CAAGTTGGTC 840 

AACCTGGGCT TTGCCACCCT TCGGGAGCAC GTCCCCAACG GCGCGGCCAA CAAGAAGATG 900 

AGTAAGGTGG AGACACTGCG CTCGGCGGTC GAGTACATCC GCGCGCTGCA GCAGCTGCTG 960 

GACGAGCATG ACGCGGTGAG CGCCGCCTTC CAGGCAGGCG TCCTGTCGCC CACCATCTCC 1020 

CCCAACTACT CCAACGACTT GAACTCCATG GCCGGCTCGC CGGTCTCATC CTACTCGTCG 1080 

GACGAGGGCT CTTACGACCC GCTCAGCCCC GAGGAGCAGG AGCTTCTCGA CTTCACCAAC 1140 

TGGTTCTGAG GGGCTCGGCC TGGTCAGGCC CTGGTGCGAA TGGACTTTGG AAGCAGGGTG 1200 

ATCGCACAAC CTGCATCTTT- AGTGCTTTCT TGTCAGTGGC GTTGGGAGGG GGAGAAAAGG 1260 

. AAAAGAAAAA AAAAGAAGAA GAAGAAGAAA AGAGAAGAAG: AAAAAAACGA AAACAGTCAA 1320 

CCAACCCCAT. CGCCAACTAA GCGAGGCATG CCTGAGAGAC ATGGCTTTCA GAAAACGGGA 13 B0 

AGCGCTCAGA ACAGTATCTT TGCACTCCAA : TCATTCACGG AGATATGAAG AGCAACTGGG 1440 

ACCTGAGTCA ATGCGCAAAA TGCAGCTTGT- GTGCAAAAGC AGTGGGCTCC TGGCAGAAGG 1500 

GAGCAGCACA CGCGTTATAG TAACTCCCAT CACCTCTAAC ACGCACAGCT GAAAGTTCTT 1560 

GCTCGGGTCC CTTCACCTCC CCGCCCTTTC TTAGAGTGCA GTTCTTAGCC CTCTAGAAAC 1620 
GAGXTGGTGT CTTTC 



Seq ID NO: 198 Protein sequence: 
Protein Accession #: NP_0043O7 

1 11 . 21 31 41 51 

I. . I'll I I 

MESSAKMESG GAGQQPQPQP QQPFLPPAAC FFATAAAAAA AAAAAAAQSA QQQQQQQQQQ 60 
QQQQAPQLRP AADGQPSGGG HKSAPKQVKR QRSSSPELMR CKRRLNFSGF GYSLPQQQPA 120 
AVARRNERER NRVKLVNLGF ATLREHVPNG AANKKMSKVE TLRSAVEYIR ALQQLLDEHD 180 
AVSAAFQAGV LSPTISPNYS NDLNSMAGSP VSSYSSDEGS YDPLSPEEQE LLDFTNWF 

Seq ID NO: 199 DNA sequence 
Nucleic Acid Accession 8: NM_00701S 
Coding sequence: 1-1005 

1 11 ' 21 - 31 41 51 

I I- v- - I ill 

ATGACAGAGA . ACTCCGACAA AGTTCCCATT- GCCCTGGTGG GACCTGATGA CGTGGAATTC 60 
TGCAGCCCCC CGGCGTACGC TACGCTGACG GTGAAGCCCT CCAGCCCCGC GCGGCTGCTC . 120 

AAGGTGGGAG CCGTGGTCCT CATTTCGGGA GCTGTGCTGC TGCTCTTTGG GGCCATCGGG 180 

GCCTTCTACT TCTGGAAGGG • GAGCGACAGT CACATTTACA ATGTCCATTA CACCATGAGT 240 

.ATCAATGGGA AACTACAAGA TGGGTCAATG GAAATAGACG CTGGGAACAA CTTGGAGACC 300 

TTTAAAATGG GAAGTGGAGC TGAAGAAGCA ATTGCAGTTA ATGATTTCCA GAATGGCATC 360 

ACAGGAATTC GTTTTGCTGG- AGGAGAGAAG TGCTACATTA AAGCGCAAGT GAAGGCTCGT 420 

ATTCCTGAGG TGGGCGCCGT GACCAAACAG AGCATCTCCT CCAAACTGGA AGGCAAGATC 480 

ATGCCAGTCA AATATGAAGA AAATTCTCTT • ATCTGGGTGG CTGTAGATCA GCCTGTGAAG 540 

GACAACAGCT TCTTGAGTTC TAAGGTGTTA GAACTCTGCG GTGACCTTCC TATTTTCTGG 600 

CTTAAACCAA CCTATCCAAA ■ AGAAATCCAG AGGGAAAGAA GAGAAGTGGT AAGAAAAATT 660 

GTTCCAACTA CCACAAAAAG ACCACACAGT GGACCACGGA GCAACCCAGG CGCTGGAAGA 720 

CTGAATAATG AAACCAGACC CAGTGTTCAA GAGGACTCAC AAGCCTTCAA TCCTGATAAT 780 



267 



10 



WO 02/086443 . 

CCTT ATCATC AGCAGGAAGG GGAAAGCATG ACATTCGACC CTAGACTGGA TCACGAAGGA 840 

ATCTGTTGTA TAGAATGTAG GOGGAGCTAC ACCCACTGCC AGAAGATCTG TGAACCCCTG 900 

GGGGGCTATT ACCCATGGCC TTATAATTAT. CAAGGCTGCC GTTCGGCCTG CAGAGTCATC 960 

ATGCCATGTA GCTGGTGGGT GGCCCGTATC TTGGGCATGG TGTGAAATCA CTTCATATAT 1020 

CACGTGCTGT AAAATAAGAA CTAGCTGAAG AGACAACCAA AGAAGCATTA AGGCAGGTTG 1080 

ATGCTGATGG. GACCATAAAA TATTTTTACA CGCAGCCTGA GCGGTTATTC TTGACACTCT 1140 

TAACAGAATT TTTTTAATCG TTTTCCAGAA CTTTAGTATA TGCAAATGCA CTGAAAGGGT 1200 

AGTTCAAGTC TAAAATGCCA TAACCCCGTT ATTTGTTATT. TTTTATTTGC ATTGATTTGC 1260 

CATAAGTCTT CCCTTGCTTG CATCTTCCAA AGCTATTTOG AAATAAACAC GAAAATTTAC 1320 
AGTTTGCC 



PCT/US02/12476 
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Seq. ID NO: 200 Protein sequence: 
Protein Accession 8 r NP_008946 



MTENSDKVPI 
AFYFWKGSDS 
TGIRFAGGEK 
DNSFLSSKVL 
LNNETRP5VQ 
GGYYPWPYNY 



11 

. I 

ALVGPDDVEP. 
HIYNVHYTMS 
CYIKAQVKAR 
ELCGDLPIFW 
EDSQAFNPDN 
QGCRSACRVI 



21 3i 41 51 

I -. I I I 

CSPPAYATLT VKPSSPARXJ, KVGAWLISG AVLLLFGAIG 
INGKLQDGSM- EIDAGNNLET PKMGSGAEEA IAVNDFQNGI 
IPEVGAVTKQ SISSKLEGKI MPVKYEENSL IWVAVDQPVK 
LKPTYPKEIQ- RERREWRKI VPTTTKRPHS GPRSNPGAGR 
PYHQQEGESM TFDPRLDHEG ICCIECRRSY THCQKICEPL 
MPCSWWVARI LGMV 



Seq ID NO: 201 DNA sequence 

Nucleic Acid Accession It NM_OO0728.2 

Coding sequence : 112 . ; 495 



1 
I 

GTAATAAGAG' 
GTCGACCGGC 
CGGAAGTTCT 
CAGGCGGCGC 
GAGGACGCGC 
GAGCTGAAGC: 
AACACTGCCA 
GTGAAGAGCA . 
GACCTTCAAG 
CATATCCTTA 
AAGGAGGCAC 
TGGAAGAAGA 
GAGAATAATT 
GGAAACTAAT 
GGTTATTTGG 
ACTGTACCAC 
GTATGTAGCA 
AGCATCTATT 
AAATTTTTGT 
TGGATGCAAG 
TTGCTTTTTC 
TCCAATTCAT 
TTGCCTAACT 
TATAGTTTTA 
TGAGAGGTGT 
TTTGTTAAAA 
CTGATCATAT 
ACCATTCTTT 
ACCAGATAAT ; 
CTAAAATATT 
TTCTGATGAG 
CATATTAATA 
GTTTTCTCTG 
ATATCTTGTT 
ATTTTTGTTT 
AAAAAAAAAA 



11 
I 

CGGGGTCTCC 
CGCTCGCGCT 
CCCCCTTCCT 
CATTCAGGTC 
GCCTCCTGCT 
AGGAGCAGGA 
CCTGTGTGAC 
ACTTCGTGCC 
CCTGAGCAGA 
TAAGAGATTC 
AAGCCAAGGA 
GCAGCCCTGC 
TCTGTTGTTT 
ACAATACATT 
AAAGTGTGTA 
TTCGCCTTCT 
GTATCTCATT 
TTACCATATG 
TGGCTTGCTT 
ATTGTTTTCA" 
ATTTTCTTAG 
CTTTTTTTTT 
AAGGTCCCAA 
TATTTTATAT 
AGGTTGAAAT: 
AGACTGTTAT 
TTGTGTGGGT 
TGCCAATGTC 
GTGGGTCTAC 
TTCTACATCT 
ATTTTTAATG 
ATATTAAGTC 
TTTTTTTTTT 
AGATTTTTAA 
TTAATTGTTC 
AAAAAAAAAA 



21 

I ' 

GCGGGGAAGG 
GOCCTGAAAC 
GGCTCTCAGT 
TGCCCTGGAG 
GGCTGCACTG 
GACACAGGGC 
TCATCGGCTG 
CACCAATGTG 
TGAATGACTC 
ACTCAGAAGA 
AGTCTGTGTC: 
TGACACCTAG.' 
TAAGCCACAA 
TTCATTTATT 
TTTAACTCTG 
TGCCAGCCAC 
GCTGTTTTAA 
TTTATCACCT 
GCTTTATTAG 
GATATATAGT 
CAGTGTCTCT 
CTTTTATGTA 
GGTCACAATA 
GTAGATTAGT 
TCATACCTGT 
TTCACCATTT 
ATATTTCTGG 
ATACTGCCTT 
CAACATTGTT 
TTTATACATT 
GGATTGTGTT 
GTTCAATTCA 
TTTAACAGTG 
CTATTTTATT 
ATTGCTAGTA 
AAAAAAAAAA 



31 
I 

CGCCCACAGC 
TCTAGTCGCC. 
ATCTTGGTCC 

agcagcccag: 
gtgcaggact 
tccagctcgg; 

GCAGGCTTGC 

ggttccaaag 
caggaagaag: 

CACATGTGGA- 
TACCAGAAGC. 
AGTTTGGACT" 
AGTTTGTGGT 
TTGGGTAAAT 
TAAGAAACTG 
ATATGAGAGC 
TTTGTATTTC 
TTATTGAAGG 
TGTTGAGTTT- 
TTGGAAACTT 
CACAGAGAAA 
TTGTGCTTTT. 
ACCTTATTCT 
GATCTATTTT- 
GAATATAGAT 
AATTGCCCCT 
GTTCTCAATT 
GATTAGTGTA 
CATTCTTGTT 
TTAGAATCAG 
AAATCAGTGG" 
TGAACACAAT 
TTCTCAGTTT 
TTTTGGTGCT 
GATAGAAATA 



Seq ID NO: 202 Protein sequence: 
Protein Accession 8: NP_000719.1 



41 
I 

AGGTGTGGTG 
AGAGAGGCGG 
TGTACCAGGC 
ACCCGGCCAC 
ATGTGCAGAT 
CTGCCCAGAA 
TGAGCAGATC 
CCTTTGGCAG 
GTGTGTCCTA; 
GAAGGTGACA 
CAGAATCACA 
TCCAGCTTCC 
AATTTGTTAT. 
GCCTTGGAGT 
CCAAACTATT 
TCTAGTATTT- 
CCCAATGACT 
GTCTGTTTAA 
TTAGAGCTCT 
CCTTCCCCTG 
AAGTTGTAAT 
AGTTCATGTC 
ATACTTTCTT 
GAGTTAATTT 
ACCCAATTGT 
GCACCTTTGT 
CTGTCTCA7T 
GTGTTAAAGT 
CAAAAAGATT 
TGTGTTACTA 
GTTAATTTTG 
ACATGTTTTC 
TCAACAGAAA 
AATGTAAATG 
CAATATTTAA 



41 



51 
I 

TTCATCCCGG 

CATGGGTTTC- 

GGGCAGCCTC 

ACTCAGTAAA. 

GAAGGCCAGT 

GAGAGCCTGC 

AGGGGGCATG 

GCGCCGCAGG 

AATCCAATGA ~ 

TGACAGAGGC 

GAACAGTCTC 

AGAACTGTGA 

GACAGCCCTA 

GGGATTGCTG 

TTCTGAAGTG 

CCACAAATAG 

AATGACGTTG 

ATCTTCTGCT . 

TTATATGTTG : 

AATCTGCGGA 

TTGAATAAGA 

TAAGAACTCT 

GTAAAAGTTT 

TTGTATAAGG 

TTCAGTGCCA 

CAAAAAGCAA 

GATTGATTTG 

GAATCTCAAA 

TTAGCTACAT 

TCTACAAAAT 

GGAGAATTAG 

ACTTATTTAG 

TATTCTACAC 

GTACTTAAAC 

AATATTAGGA 



51 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420. 
480 
540 
600 
. 660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 



1 11 21 31 

1 I I I 1 1 

MGFRKFSPFL ALSILVLYQA GSLQAAPFRS ALESSPDPAT LSKEDARLU* AALVQDYVQM 
KASELKQEQE TQGSSSAAQK RACNTATCVT HRIAGLLSRS GGMVKSNFVP TNVGSKAFGR 
. RRRDLQA . 

Seq ID NO: 203 DNA sequence 
Nucleic Acid Accession #:.NM_001741 
Coding sequence: 71.. 496 

1 11 21 ?! 41 51 

i I I - I I I 

CTCTGGCTGG ACGCCGCCGC CGCCGCTGCC ACCGCCTCTG ATCCAAGCCA CCTCCCGCCA 
GAGAGGTGTC ATGGGCTTCC AAAAGTTCTC CCCCTTCCTG GCTCTCAGCA TCTTGGTCCT 
GTTGCAGGCA GGCAGCCTCC ATGCAGCACC ATTCAGGTCT GCCCTGGAGA GCAGCCCAGC 
AGACCCGGCC ACGCTCAGTG AGGACGAAGC GCGCCTCCTG CTGGCTGCAC TGGTGCAGGA 
CTATGTGCAG ATGAAGGCCA GTGAGCTGGA GCAGGAGCAA GAGAGAGAGG GCTCCAGCCT 



60 
120 



60 
120 
180 
240 
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WO 02/086443 

GGACAGCCCC AGATCTAAGC GGTGCGGTAA TCTGAGTACT TGCATGCTGG GCACATACAC 
GCAGGACTTC AACAAGTTTC ACACGT7CCC CCAAACTGCA ATTGGGGTTG GAGCACCT6G 
AAAGAAAAGG ' GATATGTCCA GCGACTTGGA GAGAGACCAT CGCCCTCATG TTAGCATGCC 
CCAGAATGCC AACTAAACTC CTCCCTTTOC TTCCTAATTT CCCTTCtTGC ATCCTTCCTA 
TAACTTGATG CATGTGGTTT GGTTCCTCTC TGGTGGCTCT TTGGGCTGGT ATTGGTGGCT. 
TTCCTTGTGG CAGAGGATGT CTCAAACTTC AGATGGGAGG AAAGAGAGCA GGACTCACAG 
GTTGGAAGAG AATCACCTGG GAAAATACCA GAAAATGAGG GCCGCTTTGA GTCCCCCAGA 
GATGTCATCA GAGCTCCTCT GTCCTGCTTC TGAATGTGCT GATCATTTGA GGAATAAAAT 
TATTTTTCCC C 



Seq ID NO: . 204 Protein sequence: 
Protein Accession #: NP_001732 



PCT/US02/12476 



41 



51 



1 11 21 31 

I I I . I I I 

MGFQKFSPFL ALSILVLLQA GSLHAAPFRS ALE SS PAD PA TLSEDEARLL LAALVQDYVQ 
NKASELEQEQ EREGSSLDSP RSKRCGNLST CMLGTYTQDF NKFHTFPQTA IGVGAPGKKR 
DMSSDLERDH ' RPHVSMPQNA N 

Seq ID NO: 205 DNA sequence 
Nucleic Acid Accession #: NM_005361 
Coding sequence: 1-945 



ATGCCTCTTG 
GAGGCCCTGG 
TCCTCTTCTA 1 
CCTCCCCACA 
AGACAATCCG. 
CTGGAGTCCG 
CTCCTCAAGT. 
AGAAATTGCC 
GTCTTTGGCA 
TGCCTGGGCC 
CTCCTGATAA 
ATCTGGGAGG- 
CATCCCAGGAi 
GTGCCCGGCA 

accagctatg: 

TACCCACCCC 



11 
1 

AGCAGAGGAG 
GCCTGGTGGG 
CTCTAGTGGA 
GTCCTCAGGG 
ATGAGGGCTC 
AGTTCCAAGC 
ATCGAGCCAG 
AGGACTTCTT- 
TCGAGGTGGT 
TCTCCTACGA 
TCGTCCTGGC. 
AGCTGAGTAT 
AGCTGCTCAT. 
GTGATCCTGC 
TGAAAGTCCT 
TGCATGAACG 



21 
I 

TCAGCACTGC 
TGCGCAGGCT 
AGTTACCCTG 
AGCCTCCAGC 
CAGCAACCAA 
AGCAATCAGT 
GGAGCCGGTC 
TCCCGTGATC 
GGAAGTGGTC 
TGGCCTGCTG 
CATAATCGCA- 
GTTGGAGGTG* 
GCAAGATCTG 
ATGCTAGGAG, 
GCACCATACA 
GGCTTTGAGA 



31 
I 

AAGCCTGAAG 
CCTGCTACTG 
GGGGAGGTGC 
TTCTOGACTA 
GAAGAGGAGG 
AGGAAGATGG 
ACAAAGGCAG 
TTCAGCAAAG. 
CCCATCAGCC; 
GGCGACAATC 
ATAGAGGGCG 
TTTGAGGGGA 
GTGCAGGAAA- 
TTCCTGTGGG- 
CTAAAGATCG 
GAGGGAGAAG 



41 
I 

AAGGCCTTGA 
AGGAGCAGCA 
CTGCTGCCGA" 
CCATCAACTA 
GGCCAAGAAT' 
TTGAGTTGGT 

aaAtgctgga- 

CCTCCGAGTA 
ACTTGTACAT. 
AGGTCATGCC 
ACTGTGCCCC 
GGGAGGACAG 
ACTACCTGGA 
GTCCAAGGGC 
GTGGAGAACC 
AGTGA 



51 

1 

GGCCCGAGGA 
GACCGCTTCT 
CTCAGCGAGT- 
CACTCTTTGG: 
GTTTCCCGAC 
TCATTTTCTG 
GAGTGTCCTC 
CTTGCAGCTG' 
CCTTGTCACC 
CAAGACAGGC 
TGAGGAGAAA 
TGTCTTCGCA. 
GTACCGGCAG. 
CCTCATTGAA 
TCACATTTCC 



Seq ID NO: 206 Protein sequence: 
Protein Accession #: NP_005352 



PPHSPQGASS 
LLKYRAREPV 
CXjGLSYDGLL 
HPRKLLMQDL 
YPPLHERALR 



11 21 31 

I I 1 

KPEEGLEARG EALGLVGAQA PATEEQQTAS 
PSTTINYTLW RQSDEGSSNQ EEEGPRMPPD 
TKAEMLESVL RNCQDFFPVIv FSKASEYLQti 
GDNQVMPKTG LLIIVLAIIA IEGDCAPEEK 
VQENYLEYRQ VPGSDPACYE FLWGPRALIE 
EGEE 



41 51 

I I 

SSSTLVEVTL GEVPAADSPS 
LESEFQAAIS RKMVELVHPL 
VFGIEWEVV PISHLYILVT 
IWEELSMLEV FEGREDSVFA 
TSYVKVLHHT LKIGGEPHIS 



Seq ID NO: 207 DNA sequence 
Nucleic Acid Accession #: NM_021115 
Coding sequence: 743-2893 



AAAGGAAGGG 
GGCACCGCCC 
CCCAAACTAA 
CCCTTTGGGT 
GCACCCTGAA 
GGGCGAGCTG 
ACCGCTGCTT 
TTCGCTCAAG 
CACTGTCCAA 
CACGGAGAAG 
AGAAGTGCCC 
GCAAATCTCC 
ACCCGGGGAG 
GGCCCTGATG 

gaccactacc 
ctgcagtgtg: 

GCCCCTCAAC 
GGAGCTCCAG 
GGACGG CCCT-: 
CCGAAGCCCC 
GACCTTCCAG 
CTCTGGGGAT 
CCTGGGCTAT 
CTGGAGCAGC 
CATOGGCCGC 
CTGGACGATT 



11 

I • 

AGGGAGGGAG 
TTAGGAGGGC 
CTGGTGTCTT. 
CCTTACCTCC 
GAGAGAGTGG 
GTGCTGGATG 
CCAGAGGAGG 
CAGGTGAACT 
AGGGCAGGGT 
CCTGGCCCAC 
CTTTGGCTGG 
CCCTTCACTT 
CCTGGGCCTG 
GACAAAGGTG 
TCCACCATTA 
AGCTTCTCCA 
AACTTTCTGG 
GTGAAGAGTG 
ACCCTGACCG 
ACCAACACCA 
CTTCACTACC 
GTCACGGTGA 
GAGCTCCAGG 
CAGGAGCCCA 
GTCCTCTCCC 
GAAGCTCCAG 



21 
I 

AAAGGAGAAG 
CACCCTCAGA 

TGCCCTCAGG 
TAACAGCGCC 
GGACCGCACC 
CCCGCCCCAA 
CTGCCAGGAA 
CCCAGCCAGC 
CGGGGGACCC 
ACCGAAAGGA 
CGCAGCCCTA 
ACATGGCCCA 
AGAATGAGCT 
TCACCACCAC 
ATCCTGAGGG 
AGTGCACATA 
TGAACCTGTC 
TCCTGGCCAA 
TCTCCGTCTA 
AGGCCTTCAT 
TGGACCTGCA 
GCGCTAAGAT 
TCTGCTCAGC 
CAAGTTACCC 
AGGGCCAGAA 



31 

1 

TTGGTTTAGA 
GTCTGACAGC 
CCAAGATGCT 
AGCCCCGGAG 
CCCCAGTTCC 
CTCTGCACAT 
GCACGCCTTG 
GCAGCTGAGG 
GTCCCAGGGC 
GGACCCCATC 
GAGTGCGGTC 
TGTGGCCCAC. 
GGAGGCCCCC 
GACTGGGTCA' 
GGTCATCACC 
GTACATTGAC 
CAACGTGACA 
CGATGGGGAA' 
CCAGACACTC 
CTTCCGGACC 
GCTGAGCTGC 
CTCAGGTGGG 
GCTGACATGC 
TCCTTGTGGA 
TGAAAACACA 
GCTGCACCTG 



41 

I 

GGCCAGCCGG 
AGGTGAAGGT 
CTTCCCGAGG 
AGAGGCAGTC 
TCACAGTCGG 
CACGACATCC 
CCCCCCAAGA 
CCCAAGGCCA 
CTAGATCTCC 
GTGGCCTCCG 
CCTACAACAC 
ACACTCCCCC 
CAGGAGGACA 
GCCTCAGAGG 
ACCGAGCAGG 
TCCAGOGACT 
GTCTACACTG 
CTGCTCTCCA 
CTGGTGGAGG 
TTCCAGGACG 
AACTTTCCCC 
GTGGCCCACT 
ATCAATGCCT 
GGGGCAGTGC 
AATGGGAGCC 
CACTTTGAGA 



51 
I 

ACGAGCTTTG 
CCTAAATCTC 
GAGATGCTAG 
CTGGCAAAGA 
CGGAAGTGCT 
CAGCCCTGrC 
AGAAACTGCC 
CCTCCGCAGC 
TCTCCTCCTC 
AGGAGGCATC 
CCGCACCCCT 
AGAGGCCAGA 
CCAGCCCCAT 
AGAGCCAGGA 
CACCAGCTCT- 
ACCCACTGCT- 
GCTATGGGGT 
TCCGCGGGGT 
GGCAGGTAAT 
ACGGCCTTGG 
GCCGGCCTGA 
TTCACTGCCA" 
CCAAGCCGCA 
ACAATGCCAC 
AATTCTGCAT 
GGCTGTTGCT 



360 
420 
480 
540 
600 
660 
720 
780 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
84 0 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



269 



WO 02/086443 

GCATGACAAG GACAGGATGA CGGTTCACAG OGGGCAGACC AACAAGTCAG CTCTTCTCTA 1620 

CGACTCCCTT CAAACCGAGA'- GTGTCCCTTT TGAGGGCCTG CTGAGCGAAG GCAACACCAT 1680 

CCGCATCGAG TTCACGTCCG ACCAGGCCCG GGOGGCCTCC ACCTTCAACA TCCGATTTGA 1740 

AGOGTTTGAG AAAGGCCACT GCTATGAGCC CTACATCCAG AATGGGAACT TCACTACATC 1800 

CGACCCGACC TATAACATTG GGACTATAGT GGAGTTCACC TGCGACCCOG GCCACTCCCT 1860 
GGAGCAGGGC CCGGCCATCA TCGAATGCAT- CAATGTGOGG GACCCATACT ' GGAATGACAC * 1920 

AGAGCCCCTG TGCAGAGCCA TGTGTGGTGG GGAGCTCTCT GCTGTGGCTG "GGGTGGTATT 1980. 

GTCCCCAAAC TGGCCOGAGC CCTACGTGGA AGGTGAAGAT TGTATCTGGA AGATCCACGT 2040 

GGGAGAAGAG AAACGGATCT TCTTAGATAT CCAGTTCCTG AATCTGAGCA ACAGTGACAT - 2100 

CTTGACCATC TACGATGGCG . ACGAGGTCAT GCCCCACATC TTGGGGCAGT ACCTTGGGAA 2160 

CAGTGGCCCC CAGAAACTGT ACTCCTCCAC GCCAGACTTA ACCATCCAGT TCCATTCGGA 2220 

CCCTGCTGGC CTCATCTTTG GAAAGGGCCA GGGATTTATC ATGAACTACA TAGAGGTATC 2280 

AAGGAATGAC TCCTGCTCGG ATTTACCCGA GATCCAGAAT GGCTGGAAAA CCACTTCTCA 2340 

CACGGAGTTG GTGCGGGGAG. CCAGAATCAC CTACCAGTGT GACCCCGGCT. ATGAC ATCG T 2400 

GGGGAGTGAC ACCCTCACCT. GCCAGTGGGA CCTCAGCTGG AGCAGCGACC CCCCATTTTG 2460 

TGAGAAAATT ATGTACTGCA CCGACCCCGG AGAGGTGGAT CACTCGACCC GCTTAA TTTC 2520 

GGATCCTGTG; CTGCTGGTGG. GGACCACCAT . CCAATACACC TGCAACCCCG GTTTTGTGCT 2580 

TGAAGGGAGT TCTCTTCTGA- CCTGCTACAG CCGTGAAACA GGGACTCCCA TCTGGACGTC 2640 

TCGCCTGCCC CACTGCGTTT CAGAAGCGGC AGCAGAGACG TCGCTGGAAG GGGGGAACAT 2700. 

GGCCCTGGCT ATCTTCATCC CGGTCCTCAT CATCTCCTTA CTGCTGGGAG GAGCCTACAT 2760 

TTACATCACA AGATGTCGCT ACTATTCCAA CCTCCGCCTG CCTCTGATGT ACTCCCACCC. 2820 

CTACAGCCAG ATCACOGTGG AAACCGAGTT TGACAACCCC ATTTACGAGA CAGGGGGAAC 2880 

CCAAAAGGTT. TAGGGTTTCA TTTAAAAAGA GGTACCCTTT AAAAAGGGGC TTGTGAACTC 2940 

AACCCCAATT TCCCCGAGAC ATTTATCCAA AGGCCCTGGG GGCCTTGATT TAAACCCCCA 3000 

AAAGGCGGCT, GTTTTTTGGT TAAACTTTTT AACAAAGGGT TACGGGTTTT TTCCCCGGAT 3060 
TTTATAAATT TTAAAAGTG 



Seq ID NO: 208 Protein sequence: 
Protein Accession ft: NP_066938 

1 11 .21 31 41 51 

I I I : I I I 

MAQEAPQEDT SPMALMDKGE NELTGSASEE SQETTTSTII TTTVITTEQA PALCSVSFSN 60 

PEGYIDSSDY- PLLPLNNFLE CTYNVTVYTG: YGVELQVKSV, NLSDGELLSI RGVDGPTLTV 120 

LANQTIiLVEG- QVIRSPTNTI SVYFRTFQDD GLGTFQLHYQ AFMLSCNFPR RPDSGDVTVM 180 

DLHSGGVAHF KCHLGYELQG. AKMLTCINAS KPHWSSQEPI CSAPCGGAVH NATIGRVLSP 240 

SYPENTNGSQ. FCIWTIEAPE GQKLHLHFER LLLHDKDRMT VHSGQTNKSA LLYDSLQTES 300 

VPFEGLLSEG NTIRIEFTSD QARAASTFNI RFEAFEKGHC YEPYIQNGNF TTSDPTYNIG 360 

TrVEFTCDPG HSLEQGPAII ECINVRDPYW NDTEPLCRAM- OGGELSAVAG WLSPNWPEP 420 

YVEGEDCIWK IHVGEEKRIF LDIQFLNLSM SDIIiTIYDGD EVMPHILGQY LGNSGPQKLY 480 

SSTPDI»TIQF HSDPAGLIFG, KGQGFIMNY1 EVSRNDSCSD LPEIQNGWKT TSHTELVRGA 540 

RITYQCDPGY DXVGSDTLTC QWDbSWSSDP PFCEKIMYCT DPGEVDHSTR LISDPVLLVG 600 

TTIQYTCNPG. FVLEGSSLLT CYSRETGTPI HTSRLPHCVS EAAAETSLEG GNMALAIFIP 660 
VLIISliLLGG AYIYITRCRY YSNLRLPLMY SHPYSQITVE TEFDNPIYET GGTQKV 



Seq ID NO: 209 DNA sequence 

Nucleic Acid Accession #: NM_001327.1 

Coding sequence: 89-631 

I . 11 21 31 41 51 

I I I- I I I 

AGCAGGGGGC GCTGTGTGTA CCGAGAATAC GAGAATACCT CGTGGGCCCT GACCTTCTCT 60 

CTGAGAGCCG GGCAGAGGCT CCGGAGCCAT GCAGGCCGAA GGCCGGGGCA CAGGGGGTTC 120 

GACGGGCGAT GCTGATGGCC CAGGAGGCCC TGGCATTCCT GATGGCCCAG GGGGCAATGC 180 

TGGCGGCCCA GGAGAGGCGG GTGCCACGGG CGGCAGAGGT CCCCGGGGCG CAGGGGCAGC 240 

AAGGGCCTCG GGGCOGGGAG GAGGCGCCCC GCGGGGTCCG CATGGCGGCG CGGCTTCAGG 300 

GCTGAATGGA -TGCTGCAGAT- GCGGGGCCAG GGGGCCGGAG AGCCGCCTGC TTGAGTTCTA 360 

CCTCGCCATG CCTTTCGCGA CACCCATGGA AGCAGAGCTG GCCCGCAGGA GCCTGGCCCA 420 

GGATGCCCCA CCGCTTCCCG. TGCCAGGGGT GCTTCTGAAG GAGTTCACTG TGTCCGGCAA 4 BO 

CATACTGACT ATCCGACTGA' CTGCTGCAGA CCACCGCCAA CTGCAGCTCT C CATCAGCTC 540 

CTGTCTCCAG CAGCTTTCCC TGTTGATGTG GATCACGCAG TGCTTTCTGC CCGTGTTTTT 600 

GGCTCAGCCT CCCTCAGGGC AGAGGCGCTA AGCCCAGCCT GGCGCCCCTT CCTAGGTCAT 660 

GCCTCCTCCC CTAGGGAATG GTCCCAGCAC GAGTGGCCAG TTCATTGTGG GGGCCTGATT 720 
GTTTGTCGCT GGAGGAGGAC GGCTTACATG TTTGTTTCTG TAGAAAATAA AACTGAGCTA 

Seq ID NO: 210 Protein sequence: 
Protein Accession #: NPJ>01318.1 

1 11 - 21 31 41 51 

I - I I I I I 

MQAEGRGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPGGGA 60 
PRGPHGGAAS GLNGCCRCGA RGPESRLLEF YLAKPFATPM EAELARRSLA QDAPPLPVPG 120 
VLLKEFTVSG NILTIRLTAA DHRQLQLSIS SCLQQLSLLM WITQCFLPVF LAQPPSGQRR 



Seq ID NO: 211 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 52-459 

1 " ' 11 21 31 41 51 

I I I I I I 

CCTCGTGGGC CCTGACCTTC TCTCTGAGAG COGGGCAGAG GCTCCGGAGC CATGCAGGCC 60 

GAAGGCCAGG GCACAGGGGG TTCGACGGGC GATGCTGATG GCCCAGGAGG CCCTGGCATT 120 

CCTGATGGCC CAGGGGGCAA TGCTGGCGGC CCAGGAGAGG CGGGTGCCAC GGGCGGCAGA 180 

GGTCCCCGGG GCGCAGGGGC AGCAAGGGCC TCGGGGCCGA GAGGAGGCGC CCCGCGGGGT 240 

CCGCATGGCG GTGCCGCTTC TGCGCAGGAT GGAAGGTGCC CCTGCGGGGC CAGGAGGCCG 300 
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GACAGCCGCC * TGCTTCAGTT ' CCGACTGACT GCTGCAGACC ACCGCCAACT GCAGCTCTCC 360 

ATCAGCTCCT GTCTCCAGCA GCTTTCCCTG TTGATGTGGA TCACGCAGTG CTTTCTGCCC 420 

GTGTTTTTGG. CTCAGGCTCC . CTCAGGGCAG AGGCGCTAAG CCCAGCCTGG CGCCCCTTCC 480 

TAGGTCATGC CTCCTCCCCT AGGGAATGGT CCCAGCACGA GTGGCCAGTT CATTGTGGGG 540 

GCCTGATTGT TTGTCGCTGG AGGAGGACGG CTTACATGTT TGTTTCTGTA GAAAATAAAG 600 
CTGAGCTA - • 

Seq ID NO: 212 Protein sequence: 
Protein Accession #: Eos sequence 

1' . - _ "11 . ~ 21 - ■ 31 41 51 

r r i • • i i i 

MQAEGQGTGG - STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPRGGA 60 
PRGPHGGAAS -AQDGRCPCGA RRPDSRLLQF RLTAADHRQL QLSISSCLQQ LSLLMWITQC 120 
PLPVFLAQAP" SGQRR 



Seq ID NO : 213 DNA- sequence • 
Nucleic Acid Accession #: NM_000555 
Coding sequence: 416. .1498 

1 11- 21 31 41 51 

I I I I I. I 

CTTATTTTTT ATGAATGTCG GATAGCTGCA CCAGCTTGGT GGGGAAAGGG TTTGATGAAT 60 

AGCACAAAGA- CACTGGCTGT TCCCTGGAGG CTGTCCCTTT AAAGGAGAAT CTTAGTTTAT 120 

TCTGGGGGGA GGGGATGCAC ACATTAGAGT, AGGAAAGAGG GCTTGGAATA AAATGAAAAC 180 
ACTCCCCCTT CATAGTCATT GTACTGAAAT GCAAAGACTG. CTTCCTAAGC TGGAGATGCTs 240 

AACCTTGGGT AGCTCCTTCT GTTCTCTTCA AGGGGAATTT TGTCAGGCTA- TGGATTCATT 300 

TACAACTGTT AGTCATGTGG GCATGTGTGA GGAAACAGAT- GCCAGTTTTA ATGTATTTAG 360 

CCCGAAGTTC CAATTTGATA GGAGCCACTG TCAGTCTCTG AGGTTCCACC AAAATATGGA 420 

ACTTGATTTT GGACACTTTG ' ACGAAAGAGA TAAGACATCC AGGAACATGC GAGGCTCCCG 480 

GATGAATGGG TTGCCTAGCC CCACTCACAG CGCCCACTGT AGCTTCTACC GAACCAGAAC 540 

CTTGCAGGCA- CTGAGTAATG AGAAGAAAGC CAAGAAGGTA" CGTTTCTACC GCAATGGGGA 600 

CCGCTACTTC AAGGGGATTG TGTACGCTGT GTCCTCTGAC. CGTTTTCGCA GCTTTGACGC. 660 
CTTGCTGGCT- GACCTGACGC GATCTCTGTC TGACAACATC AACCTGCCTC AGGGAGTGCG - 720 

TTACATTTAC ACCATTGATG: GATCCAGGAA- GATCGGAAGC ATGGATGAAC TGGAGGAAGG 780 

GGAAAGCTAT GTCTGTTCCT CAGACAACTT CTTTAAAAAG- GTGGAGTACA" CCAAGAATGT'- 840 

CAATCCCAAC TGGTCTGTCA ACGTAAAAAC ATCTGCCAAT ATGAAAGCCC . CCCAGTCCTT 900 

GGCTAGCAGC AACAGTGCAG AGGCCAGGGA GAACAAGGAC TTTGTGCGCC CCAAGCTGGT 960 

TAC CAT CATC CGCAGTGGGG TGAAGCCTCG' GAAGGCTGTG CGTGTGCTTC TGAACAAGAA . 1020 

GACAGCCCAC. TCTTTTGAGC AAGTCCTCAC TGATATCACA GAAGCCATCA AACTGGAGAC 1080 

CGGGGTTGTC AAAAAACTCT ACACTCTGGA' TGGAAAACAG GTAACTTGTC TCCATGATTT 1140 

CTTTGGTGAT GATGATGTGT TTATTGCCTG TGGTCCTGAA AAATTTCGCT ATGCTCAGGA 1200 

TGATTTTTCT CTGGATGAAA' ATGAATGCCG AGTCATGAAG GGAAACCCAT CAGCCACAGC 1260 

TGGCCCAAAG GCATCCCCAA CACCTCAGAA GACTTCAGCC AAGAGCCCTG GTCCTATGOG 1320 

CCGAAGCAAG TCTCCAGCTG ACTCAGCAAA CGGAACCTCC AGCAGCCAGC TCTCTACCCC 1380 

CAAGTCTAAG CAGTCTCCCA TCTCTACGCC CACCAGTCCT GGCAGCCTCC GGAAGCACAA 1440 

GGACCTGTAC CTGCCTCTGT CCTTGGATGA CTCGGACTCG CTTGGTGATT CCATGTAAAG 1500 

GAGGGGAGAG TGCTCAGAGT CCAGAGTACA AATCCAAGCC TATCATTGTA GTAGGGTACT 1560 

TCTGCTCAAG TGTCCAACAG GGCTATTGGT GCTTTCAAGT TTTTATTTTG TTGTTGTTGT 1620 

TATTTTGAAA AACACATTGT AATATGTTGG GTTTATTTTC CTGTGATTTC TCCTCTGGGC 1680 

CACTGATCCA CAGTTACCAA TTATGAGAGA TAGATTGATA ACCATCCTTT GGGGCAGCAT 1740 

TCCAGGGATG CAAAATGTGC TAGTCCATGA CCTTTCAATG GAAAGCTTAG GGGCCTGGGG 1800 

TAAATTTGCC CCGTTTAAAT TTGCCCAAAC AGTTTTCCTT TTGTAGAGGG GTGTTTAAAT- 1860 

ATACAGCAAT TAAAAAGTTT * GTGTGGGGAA AAAAAAAACT CATTGGCAGA TCCAAGAATG 1920 

ACAAACACAA GTGCCCCTTT TCTCTGGATC TCAAGAATGG TGGAGGACCC TGGAAGGACA 1980 

GCAAGGCAGC TCCCCAGCCT CACTCTTCAC TCCTGATTGA GGCCCGGGTT TGTTGTCCAG 2040 

CACCAATTCT GGCTGTCAAT GGGGAGAAAT AAACCAACAA CTTATAATTG TGACACCAGA 2100 

TGCTTAGGAT CCTGGTGCTG GGTTAGCTAA GAGAATAGAC AGAATTGGAA AATACTGCAG 2160 

ACATTTCCGA AGAGTTTATA AAGCACAGTG AATTCCTGGT CAATCTCTCC ACTGAGGCAA . 2220 

TTTGGAATCA ATAAGCAATT GATAATAGTT TGGAGTAAGG GACTTCATAT AOCTGATTOC 2280 

TCTAGAAGGC TGTCTAACAT. ACCACATGAT TACATGAACT GTATGGTATC CATCTATCTC 2340 

TGTTCTATTG AATGCCTTGT. TAACAGCCAA CACTGAAAAC ACTGTGAGAA TTTGTTTTCA 2400 

GGTCTGACAC CTTTCAGTCT CTTTTTATAG CAAGAAATCA ATATCCTTTT TATAAAAATT- 2460 

CATGTCTGTA TTTCAGGAGC AAACTCTTCA GGCTCCTTTT TTATAAACTG GTGATTTTTC 2520 

TTTTGTCTAA AAAACACATG AAGAAAATTT ACCAGAAAAA AAAAAAAAAG CCGAAGAATA 2580 

ATGTTATTTA GAAATTATGC TGTCACTGCC AAACAGTAAC CTCCAGGAGA AAACAAGATG 2640 

AATAGCAGAG GCCAATTCAA TAGAATCAGT TTTTTGATAG CTTTTTAACA GTTATGCTTG 2700 

CATTAATAAT TTCAATGTGG ACCAGACATT CTAATTATAT TTTAAATGAA ATGTTACAGC 2760 

ATATTTTAAG CAACTCTTTT TATCTATAAT CCTAATATTT CATACTGAAG ACACAGAAAT 2820 

CTTTCACTTG' TCTTTAACAT TAGAAAGGAT TTCTCTTTAC TAAGGACTGA TCATTTGAAA 2880 

TAGTTTTCAG TCTTTTGAGA TACAGGTTTA TAACACTGCT TTTTTTTTCC TGTAAACATA 2 940 

GCCCATAATG GCAAAAACAA CTAATTTTAA TTGAAGGTCT TGCTTGCCAN TCCTGTGTTG 3000 

GCTTTNACCA AATATAAAAA TTCCCTTATT CCTTGGTAAT GGTGCAAATN TTTGGAAAGG 3060 

CACAGCATCC AAACCAAGCT GCTGTTTGGC TACTGAATGG CTTGCAGTTG TTCCTCCACT' 3120 

CTAAATGGAA' TGAGCTTGCT GTGTGTGTGT GTGGTGGTGG TGGGAGGGGG TGGTGCATGT 3180 

GTGTGTGTGT GTGTGCATCT GCAGCTGCTT CAAAATTAAG AAATACTACA AGACACCCCT 3240 

GTAATGGATT GGTGGCAACT GGGTGGCACT GCTGATGTGC ACTGTGTAGG GGGGAACCCA 3300 

GTGGTGGTGG GGTATCTCAA ATGCCCCTAG ACAAGCTTCA GATGTCTGTA GCTACCAAAA 3360 

ACATTTTCGG, TTCAAGAAAA GTGAGATGAT GGTAGTACTG GTTTCTGGTG AAATTGAAAA- 3420 

ACCCCAAATG ATGAGGATCT CTTTTTGCCC CCTCTCCTTT TTTTGTAAAC CCATTCAAAA 3480 

CCATTAAtAA GCCCATTTTA CTAANCCCCT ATTTCTTTCT AGAAGCTCAG GGTTTNCTTA 3540 

GTGCCTCCCA NAACATTTTG TAGTTAATTG GGAAAAAGTG ATACTTGGAT TAGGGGGTGT 3600 

GGGGATAAAG AATGGTGGGA GGCCTGATTT. TAAAATTCAG GCCAGAACCC CCAATGACTC 3660 

CACCCATAGT NTCACTTTAG GTCTCATTTA GTCCATCACC TTTATTTTAA GTTGAGGAAG 3720 

TGGAGGCTGG TAAAGAGCAG GACCAGAGGA AGAATCCAGA TTTCCTTATG CTTGGGCCTC 3780 

ACACTAGCTC TNTGAGTATT TCCTTGATTG CGGTATATGT ACTACTAGAA AATACCAAAT 3840 

GGATATATTT TCTTTAGGAT AACCTTTGAA CCAACAATNT TCAATAACAA TAGTACATCT 3900 
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.TCCATCTTAC TTTTAATCGA. GTATAAGGAA ATGTTTCTTT ATGGCCATTT TGGAGGGAGC. '3960 

AGGGGATGAG GCTTGGCATA GTCCAAAATT TAAGNCTCCA ATAATTAATT GCATTTTAAA ' 4020 

TTGTTTTAAA" TTGGCCCACT. TTCAAGGCAA TTTTTTTTGT GTGTCTGTAA CTGAGCTCCT 4080- 

CCACCCCTGT CATTCACTTC CAATTTTACC CAATCCAATT TTAGCACTCA AGTTCCATTG. -4140 

TGTTAATTTT: TGCACGGTCT ACACACATCA AGTCAGCAAG CATTTGCCAC CACTCCCTAT 4200 

ACTTCTCCCT CTTTTTTACA CACACACACA CACACACACA CACAATCCAT CTCTTGCTTG 4260 

TTCCTACCTC CCTGATTTTT CTTCCCTACA GAAATAGAAA TAGGGACAAA GAAGGGGAAA 4320 

ATGTATATAT. TGGGGCTGGG CTGAACAACT AACTTCATAA GTAGTATTAA CTAGGGGTAA 4380 

ATTGAGAGAA AAGCTCCTTT- TCTCTTCACT GTTTTGGAAA GGATAGCCAT TAGCATGACT « 4440 
GCTTTGTGTC CTTATGGACT TTAGTATTAG CCTAGATTGA ATTATAGCGT TTTTCTAGCT .4500 

GAAGGAACCT TAAGATCACA TCATCTACTC CTCTACTCCA AATTTCTCAT TCTTCAGGCC 4560 

AGGAAACCGA GACACAGAGG TAAAGTAATT. TCCCCAAGGT. CACACAGCTG GCTGGGGCAG 4620 

GATTGGGTTT ACAACCCACA TCTCCTGGCT CTTATTCCAG GGCCTTTTCC. CACTAAGTAG 4680 

TATTGCCTTC " CATTAGGCTC. CTGAGAGTTA TTTCTCAGGG TCATGTTGCA TCTTGGAGCC 4740 

ACATGCTGCT. GCCCTGATCT CAGTGGGAAA TNCACCCAGC AACCTAATAC AGCCCCTTTT " 4800 

CCCTGCATTC ACCTGGTTCC CATCCACATG GGTTGCAGAT GTCCTTGAAG. AGAGTGAGGC 4860 

ATTGAGGGCC AATAGGAGCA" ATGGGGTCCC TGGCCTTGTC CATCTGATTC AGGAG ATCAC .'■ 4 920 

TGCTCCATCG TGAGGAGCCC TCTGAATAGC CCCCCACTGA ATGCTTGCCT- TGCCCAAATG 4980 

GAATGGAGGA AGATTGATTT TCTCCATCAG TTCACCTTGT GTCATCTCAT AATGGTTGGT * 5040 

CTTTCCAGGC TGAGGGAAAT GTTTCTTGTT. TCCANAGTAN AAAAAAGAAA GAGTGGAACA ' 5100 

ATANCTTTGT- TCATCCTAAC TTTCTGAGAT GGCTTTTCAA CATTTAAAAA AAACTAGTGT 5160 

GGTACCATTC ACTGGCANGA TTTNTTTTAG AATATGGGAG TAAGATGAGG . TAGAG AAAAT 5220 

AACCTGGTCT. CACTGTGGTT- GCCCTCATCC ACAATGTCCC CAAAGCCATC CTGCTNTGAT 5280 

GAGGACAATT TCCAGGTATA' AGCAAGGGGC TTTGTGACAA" AAATGTACCC TGGCTGATGT 5340 

TAAACATTGG CTCCTGTGTT. TGCACCAAAA- TAGCAAGCTG TGTGCTCTAT ACACTCTTCC 5400 

CATCGTCTTG. TGTACACTGC TCCTGTGGCC TTCCACAGCA- GAAACCAGGG CAAAAGGGTC 5460 

caaacacatg: gttttccttg. ctgcaaggct nttcctggga ACTAAGGGGG TATTTATTAG 5520 

TTCAGTTNTA AGAGACCTCC TTCTGGGCTT' ACCCCACTCC TCAGGTACTT- CTCTCTCCTT 5580 

CCTCCTTCTC CTCCACAGTC" ACAAGTAACC AAGGAACCTG- AAAGTGGATG TGTAGCTATT 5640 

TGAAGAAGGC AAGGAACCCT. GAGATTCTTC TTTGAATCCT TTAGTCCAAG- TCTTAGACCA ' 5700 

GTGATTGGTG: CTTACCTTGA- ACAAAATTTT GTCTGTGTTC CTAATCCCTT CAATACTNTG; 5760 

GGTACAATGC. TCCCAATCAC" CCTGCACATT TGATTCTAAA TGGCTTTTAT TTTTTAAAAA' 5820 

TCCATATCCC TAGGACAAGA NAACAGGATG CCTATATCCC CAAAATGAGC TCCAGGACAC . 5B80 

TGATGGGAAT GATCCCAANG' ATCACGCCAC CTCAGAAAAC. GTCTGTGCCA ANAGACTTCC 5940 

CCAGATAGAA NCACTGGGAC. AGTGGTTTGA ACGACTTCTT TTATGGTTGT; CCAGTTTGCT 6000 

ATGGAAATAA . AAGGCATTGA . TTTTTTAAAA' AAGATGATTG. GAACCTGTCT TTGGCCACAT 6060 

AGGGCCACTT GGATCCATTT. CCAGGCCTTA CTCATATATT GCCTTCACTG" AAGGGCTTTG . 6120. 

GCTTTAAGTC CCAGACTGGT CTCCCAAGTG • AACCATAAGT GTTTTGGAGC TCATCTGGGG-: 6180 
TGAGGCATGA GAATGTTGCC CCATCTATCC CTTCAGGAAA AGGTGCCTTC CCTCCCTTTC .6240 

TCCTAAAGCC. TGGTCCCCAA; . AAATTGTTTT TGTCTCCAAA AGTCTAGTAT GGTCTTTATA 6300 

CACCCANACT CTTAGTGTTG CGTCCTGCCT TGTTTCCTTG TTAAGGATCT ATGCANACCT ■ 6360 

CCCGCTTTGG' CTTAGCTAGC GTGACATTGG.' CTATCATTTG ACAAGACTAA CTTTTTTTTT 6420 

TTTTTTTTTG ACTGAGTCTC CCTCTGTCAC CTAGGCTGGA GTGCAGTGGC ACAATCTTGG 54 80 

CTCGCTGCAA CCTTCACCCT TCACCTCCCA GGTCGAAGCG ATTCTCCTGC CTCAGTCTCC 6540 

CGAGTAGCTG GGATTACAGG CGTGCGCCAC CAAATCTGGC TATTTTTTTA TTATTATTAT- • 6600 

TTTTAGTAGA GATGGGGTTT. CACCATGTTG GCCAGACTGG; TCTTGAACTC TTGGCCTCAA 6660 

ATTATCTGCC CACCTCGGCC TCCCAAAGTG; CTGGGATTAC AGGCATGAGC ACCATGCCCA 6720 

GCTGACAAGA CTAATTTTTT ATCCCTTGGT TTATTGGCTT CAACATCTTC TGGAATCAGA 6780 

GGTGATTTTT TCTTACCTTG GATGCCTGAG" ACTAGGGGAG TATAGAATTC CAAITGGTAA 6840 

TTAAGGCATC TTTCTGCTCC TGATCAGAAG GGCAGGTTAG TTGGGAGAGG TCAGATGGCA 690O 

CAACAGAAGT CACCTTGTAA GTAAGGCAAA GACTTTGAAG GCATTAGCGT TTCTCATTAC 6960- 

TTAGGTCAAT AACCTTGAGG GAATCAATGG CTTTTTTGCC GCTCTACCTC TTTGTGTATC 7020 

TCTTTGACTT- TTCTTTCTCT = GTCTAGTTTC CTCTGTTCTC. AGTTTATATT CTATGTTATC 7080 

AGTCTCTCTT TCCACAGTAC AAACATCCAT CCTTTCTCCT GTGCAATTCT GTCTCTCCCT 7140 

CTTATTATCT TTATTTGTAC TTTTTCCTTC CTCCCTGTCT AGGCATTGGG CATGTGCCTC 7200 

TTCTTAGCCT- GTGATTTTGC CTTGGGACTG ATGATAAATT ATTTCCAGAT TCAATCAGCC 7260 

CTGGTCCTAC CCCAGTCCAA TCAGAAGTAT GTTGGTGGGG AATCAACCTG ATCCTGGCCC 7320 

TTTCTTCTTC TCCATTTTCA- TTCGTAATCC CCCTCAGCAG ATCTTTACAA GCAGTTTCCT 7380 

TATAGCTCAT GTATCTTTAG GTCTTTGCCT TCCAAGCACT. GTACAGAATA CTTTGTGGTT 7440 

CCTTTTTAGT CTGACATTTT GTGGAGCAGT GAAGCGTGCT CAGAGACATA ATCAGCTGAA 7500 

GAGAAAAAAT CCACCCATGG ATTTATATCA GCTAAATACT. AATAATTGAT TTTGTTTGAT 7560 

GTGCCCATAA TTTTTAAAGC TGCAAT AT AA • TATAATGAGG GACCACAGGT AATTTCTCCT 7620 

"GTCATTTGTT;. TTGGCTGGAT GGGGGTGGGG GAGTAATTGC TTAAAGTTTT ACCATTACAC 7680 

ATTAAACTCT CTATAATAAT CTTGTTTGGG GCTTGCTAAC TGTTGAGCTG TTTTAACTAA 7740 

ACTGGTAGGC AATCGGAGTT GATTTAAATG AAAAGATAAT TTAACAAATC TAT ACTATAA 7800 

AAAGAGACAT TTGCTTAATT GACATGTATT TTTTCCTTCT GAGTCACCTA AACATTTACT 7860 

CTTGACACCA ACTGTTCATG ATACTGAATA GACAGTCCAT ATAAGAGAAa' TTAGTGGACC 7920 

TAAAGAAGCC AGATTGTAGG TGTTAATTTA TTAAACAGAA- TTGCAAAGCC CTTGGAAATG 7980 

TCACTGCTTG GCAATACCAT ATGGCATGCC AAAAT TTACA ATGACTTTTC TTTATAAGTT 8040 

ATCCAAAAGG GATTTGAACA AGTAAGAGGT TATGCCAAAA TGTCTCCAAT GTATGGTCCT 8100 

GTAATATATT GCAGCTTGAA GCCAATGATC CCTTATGACT TGTATACAAC TAATGCATGT 8160 

TTTATTGAAT TTTGCATTTC CCACGTGTGG TAAGTCTTTA AAATGTTTTT GATCACCTTT 8220 

NTGTGCCATT AAACTTGTAC AGAAAATGTT TTTATGGCCA TTTTCAAAGG GAGAAAGTTT 8280 

AAAATGGAAA CAGCCCACCC . TTTCTGCCCT ATAGCTGTAG' TTAGAATTGA GTACCTGTAG 8340 

CAAAACAGCT GTAATTGGTG . GTTGTAGTGT TAGAGGTGTT AGCTTGCTAG ' TGACTAGCTT 8400 

TGGAGAGTAA ATGCATGGTA TTGTACATCA CATTTCTTAA CTCGTTTTAA CCTCTGAAAA B460 

GAATATATTC TTCTTTGTAG TCCTTCTTCC CACCCCCTTG CCCTCTCCCT CTCCCTGCTC 8520 

CCAGTTGTCT TACAGTTGTA AATATCTGAT TTGAGGCCCA ATAACTCTTG CCAAGTAAAG 8580 

TCAGCAAACA ACAAACAAAC CAAAATGTGG GGAAAAGGCA TTTCTCAACC ATCTCTCAGC 8640 

AGTTATTGAT CATTTCTTAA GGAACAGCAT TGTGATCAAA GACTCAACTT TACGTAAAAA 8700 

TCAGTGGTAA ATTGGGGTTG" TATTGGCCAT TGATTACATT CAGGATTGAA- TAGTTTTCAG 8760 

AATCACATGT AATCCAAAGA ' CAGTAGGTAG TGATGTCCCT TATCCCTGCA GCTGTTTTAA 8820 

GATAGAGACC TCAGAAGACT CTGCTTGACC GATGACCAAT AATTATTTGA AAAAAAAAGA 8880 

AAAAATGAGA GAAATAAAAC AGATATTTAA GAACTTTAGC CACCTATTTA GAATAGTTAT 8940 

AGCCAGAAAA AAAAACAAGG GCATGAGTTC AAATGCATTA CTATCAGTGT CCTAGGCAAT 9000 

ACCTAACCTA CTCTGAAATT GTGATTCAAA AGCAGTATTT CAAGAGGCAT TCTCCTTTTT 9060 

TGGTTTGCTG ACCCCACTTG GACTGGTAGG TTTGGTGAGG CCCCCATAAA CCAGCTGGAG . 9120 
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CAGACCCTTT TCATCTCCTG TGCCTGTAAC ACCCCTCTTC CCCCACCCCC TCCGCAATTC 9180 
' AATGAGGGCT TTCTTGGGTC ■ AGAGGACTTC AAGGTTGTCT AGAGAAGTTT GCCATGTGTG' 9240 
TAAGGTGCIT3. TGAACTGTGA GTGCTGAAGA TTCGCAGCAT TCAATACCAG GCAGCCAAAG 9300 
AGCTGCTCTT . GCAATTATTT TGGCTCTCAA GCTCTGTTCT TCATCGCATT CTCATTTCTG 9360 
TGTACATTTG CAAGATGTGT GTAATGTCAT TTTCCAAAAA TAAAATTTGA TTTCAAT 



Seq ID NO: 214. Protein - sequence : 
Protein Accession fl:.NP_000546 

1 : 11 . 21 31 41 51 

I .1 I I I I * 

MELDFGHFDE RDKTSRNMRG SRMNGLPSPT HSAHCSFYRT RTLQALSNEK KAKKVRFYRN . 60 

GDRYFKGIVY AVSSDRFRSF DALLADLTRS LSDNINLPQG VRYIYTIDGS RKIGSMDBLE 120 

BGESYVCSSD NFFKKVEYTK NVNPNWSVNV KTSANMKAPQ SLASSNSAQA RENKDFVRPK 180 

LVT 1 1 RSGVK ' PRKAVRVLLN KKTAHS FEQV LTDITEAIKL ETGWKKLYT LDGKQVTCLH 240 

DFFGDDDVFI ACGPEKFRYA QDDFSLDENE CRVMKGNPSA TAGPKASPTP QKT SAKS PGP 300 
MRRSKSPADS ANGTSSSQLS TPKSKQSPIS TPTSPGSLRK . HKDLYLPLSL DDSDSLGDSH 



Seq ID NO: 215 DNA sequence 
Nucleic Acid Accession ftr NM_130467 
Coding sequence : 312 .. 644. 

1 11 21 31 41 51 

I I " I I I I 

GGCACGAGGC AGAGCTCTGC AAGGAGAGGT TGTGTCTTOG TTCTTTCCGC CATCTTCGTT 60 

CTTTCCAACA TCTTCGTTCT TTCTCACTGA- CCGAGACTCA GCCGGTAGGT CTGCAGAGTG 120 

GTCTTCCTGG TAATTTAGTT GTGAGTGAAT GTGTGGAGGA GCCAGCGGGC TTAGGACAGG .180 

TCCTGTGGCA CAGTCCGTGG CTTTGAGGGA AAAGGGCCTC GCGGTGGTCC TCCGCCTTCC .240 

CCCAGGTCGT GATGCAGGCG CCATGGGCCG GTAATCGTGG CTGGGCTGGA ACGAGGGAGG 300 

AAGTGAGAGA TATGAGTGAG CATGTAACAA GATCCCAATC CTCAGAAAGA GGAAATGACC 360 

AAGAGTCTTC CCAGCCAGTT GGACCTGTGA TTGTCCAGCA; GCCCACTGAG GAAAAACGTC 420 

AAGAAGAGGA ACCACCAACT GATAATCAGG GTATTGCACC TAGTGGGGAG ATCAAAAATG 480 

AAGGAGCACC TGCTGTTCAA GGGACTGATG TGGAAGCTTT TCAACAGGAA CTGGCTCTGC 540 

TT AAGAT A G A GGATGCACCT GGAGATGGTC CTGATGTCAG GGAGGGGACT CTGCCCACTT 600 

TTGATCCCAC TAAAGTGCTG GAAGCAGGTG AAGGGCAACT ATAGGTTTAA ACCAAGACAA 660 

ATGAAGACTG:AAACCAAGAA .TATTGTTCTT. ATGCTGGAAA TTTGACTGCT AACATTCTCT 720 
TAATAAAGTT TTACAGTTTT CTGCAAAAAA AAAAAAAAAA AAA 



Seq ID NO i 216 Protein sequence t 
Protein Accession #: NP_569734 

1 11 21 31 41 51 

I I- • I I I .1. 

MSEHVTRSQS SERGNDQESS QPVGPVIVQQ PTEEKRQEEE PPTDNQGIAP SGEIKNEGAP 60 
AVQGTDVEAF QQELALIiKIE DAPGDGPDVR EGTLPTFDPT KVLEAGEGQL. 

Seq ID NO: 217 DNA sequence 
Nucleic Acid Accession #: NM_001476.1 
Coding sequence: 82.. 435 

1 11 21 

1 I I 

GCCAGGGAGC TGTGAGGCAG TGCTGTGTGG 
TGAGATTCAT CTGTGTGAAA; TATGAGTTGG 
CCAAGGCGCT ATGTACAGCC TCCTGAAGTG 
GATGAAGTGG AACCAGCAAC .; ACCTGAAGAA 
GCAGCTGCTC AGGAGGGAGA GGATGAGGGA 
GCTGATAGCC AGGAACAGGG TCACCCACAG 
GGGCAGGAGG TGGACCCGCC AAATCCAGAG 
CAATCACAGT GTTAAAAGAA GACACGTTGA 
TTGTTCATTA AAATTCTCCC AATAAAGCTT 

Seq ID. NO: 218. Protein sequence: 
Protein Accession #: NP_001467.1 

1 U . .21 

I -I- • I 

MSWRGRSTYY WPRPRRYVQP PEVIGPMRPE 
DEGAS AGQGP KPEADSQEQG . HPQTGCECED 

Seq ID NO:. 219 DNA sequence 
Nucleic Acid Accession S: NM_001476 

Coding sequence: 90-3671 . . 

1 Ml' 21 31 41 51 

I . I - l I i I 

ACAGCGGAGC GCAGAGTGAG AACCAGCAAC CGAGGCGCCG GGCAGCGACC CCTGCAGCGG 60 

AGACAGAGAC TGAGCGGCCC GGCACCGCCA TGCCTGCGCT CTGGCTGGGC TGCTGCCTCT 120 

GCTTCTCGCT CCTCCTGCCC GCAGCCCGGG CCACCTCCAG GAGGGAAGTC TGTGATTGCA . 180 

ATGGGAAGTC CAGGCAGTGT ATCTTTGATC GGGAACTTCA CAGACAAACT GGTAATGGAT 240 

TCCGCTGCCT CAACTGCAAT GACAACACTG ATGGCATTCA CTGCGAGAAG TGCAAGAATG 300 

GCTTTTACCG GCACAGAGAA AGGGACCGCT GTTTGCCCTG CAATTGTAAC TCCAAAGGTT 360 



31 41 51 

I 1 I 

TTCCTGCCGT CCGGACTCTT TTTCCTCTAC 60 

CGAGGAAGAT CGACCTATTA TTGGCCTAGA 12 0 

ATTGGGCCTA TGCGGCCCGA GCAGTTCAGT 180 

GGGGAACCAG CAACTCAACG TCAGGATCCT 240 
GCATCTGCAG GTCAAGGGCC GAAGCCTGAA ! 300 

ACTGGGTGTG AGTGTGAAGA TGGTCCTGAT . 360 

GAGGTGAAAA CGCCTGAAGA AGGTGAAAAG 420 

AATGATGCAG GCTGCTCCTA TGTTGGAAAT 480 
TACAGCCTTC TGCAAAA 



QFSDEVEPAr PEEGEPATQR QDPAAAQEGE 60 
GPDGQEVDPP NPEEVKTPEE GEKQSQC 
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CTCTTAGTGC TCGATGTGAC AACTCTGGAC GGTGCAGCTG TAAACCAGGT GTGACAGGAG 420 

CCAGATGCGA CCGATGTCTG .CCAGGCTTCC ACATGCTCAC GGATGCGGGG TGCACCCAAG 480 

ACCAGAGACT GCTAGACTCC AAGTGTGACT GTGACCCAGC TGGCATOSCA GGGCCCTGTG 540 

ACGCGGGCCG CTGTGTCTGC AAGCCAGCTG TTACTGGAGA ACGCTGTGAT AGGTGTCGAT 600 

CAGGTTACTA TAATCTGGAT GGGGGGAACC CTGAGGGCTG TACCCAGTGT TTCTGCTATG 660 

GGCATTCAGC CAGCTGCCGC AGCTCTGCAG AATACAGTGT CCATAAGATC ACCTCTACCT 720 

TTCATCAAGA TGTTGATGGC TGGAAGGCTG TCCAACGAAA TGGGTCTCCT. GCAAAGCTCC 780 

AATGGTCACA GCGCCATCAA GATGTGTTTA GCTCAGCCCA ACGACTAGAC CCTGTCTATT 840 

TTGTGGCTCC TGCCAAATTT CTTGGGAATC AACAGGTGAG CTATGGGCAA' AGCCTGTCCT 900 

TTGACTACCG TGTGGACAGA GGAGGCAGAC ACCCATCTGC CCATGATGTG ATTCTGGAAG ' 960 

GTGCTGGTCT ACGGATCACA- GCTCCCTTGA TGCCACTTGG CAAGACACTG. CCTTGTGGGC 1020 

TCACCAAGAC TTACACATTC AGGTTAAATG AGCATCCAAG CAATAATTGG AGCCCCCAGC 1080 

TGAGTTACTT TGAGTATCGA AGGTTACTGC GGAATCTCAC AGCCCTCCGC ATCCGAGCTA " 1140 
CAT ATGGAGA - ATACAGTACT GGGTACATTG ACAATGTGAC CCTGATTTCA GCCCGCCCTG - 1200 

TCTCTGGAGC CCCAGCACCC TGGGTTGAAC AGTGTATATG TCCTGTTGGG TACAAGGGGC 1260 

AATTCTGCCA GGATTGTGCT, TCTGGCTACA AGAGAGATTC AGCGAGACTG GGGCCTTTTG. 1320 

GCACCTGTAT' TCCTTGTAAC TGTCAAGGGG GAGGGGCCTG: TGATCCAGAC ACAGGAGATT 1380 

GTTATTCAGG ■ GGATGAG AAT CCTGACATTG AGTGTGCTGA CTGCCCAATT GGTTTCTACA- 1440 

ACGATCCGCA CGACCCCCGC AGCTGCAAGC CATGTCCCTG TCATAACGGG TTCAGCTGCT 1500 

CAGTGATGCC GGAGACGGAG GAGGTGGTGT GCAATAACTG CCCTCCCGGG GTCACCGGTG 1560 

CCCGCTGTGA GCTCTGTGCT GATGGCTACT TTGGGGACCC CTTTGGTGAA CATGGCCCAG 1620 

TGAGGCCTTG TCAGCCCTGT CAATGCAACA ACAATGTGGA CCCCAGTGCC TCTGGGAATT 1680 

GTGACCGGCT GACAGGCAGG TGTTTGAAGT GTATCCACAA CACAGCCGGC ATCTACTGCG 1740 

ACCAGTGCAA AGCAGGCTAC TTCGGGGACC. CATTGGCTCC. CAACCCAGCA- GACAAGTGTC 1800 

GAGCTTGCAA CTGTAACCCC ATGGGCTCAG AGCCTGT AGG ATGTCGAAGT GATGGCACCT 1860 

GTGTTTGCAA GCCAGGATTT GGTGGCCCCA 7 ACTGTGAGCA- TGGAGCATTC AGCTGTCCAG 1920 

CTTGCTATAA TCAAGTGAAG ATTCAGATGG ATCAGTTTAT GCAGCAGCTT CAGAGAATGG 1980 

AGGCCCTGAT TTCAAAGGCT CAGGGTGGTG ATGGAGTAGT- ACCTGATACA GAGCTGGAAG 2040 

GCAGGATGCA GCAGGCTGAG CAGGCCCTTC AGGACATTCT GAGAGATGCC CAGATTTCAG 2100 

AAGGTGCTAG CAGATCCCTT GGTCTCCAGT TGGCCAAGGT GAGGAGCCAA GAGAACAGCT 2160 

ACCAGAGCCG CCTGGATGAC CTCAAGATGA CTGTGGAAAG AGTTCGGGCT- CTGGGAAGTC 2220 

AGTACCAGAA CCGAGTTCGG. GATACTCACA GGCTCATCAC TCAGATGCAG CTGAGCCTGG 2280 

CAGAAAGTGA AGCTTCCTTG GGAAACACTA ACATTCCTGC CTCAGACCAC TACGTGGGGC 2340 

CAAATGGCTT TAAAAGTCTG GCTCAGGAGG* CCACAAGATT AGCAGAAAGC CACGTTGAGT 2400 

CAGCCAGTAA' CATGGAGCAA CTGACAAGGG AAACTGAGGA CTATTCCAAA CAAGCCCTCT- 2460 

CACTGGTGCG CAAGGCCCTG CATGAAGGAG- TCGGAAGCGG AAGCGGTAGC CCGGACGGTG 2520 

CTGTGGTGCA AGGGCTTGTG GAAAAATTGG. AGAAAA CCAA • GTCCCTGGCC CAGCAGTTGA 2580 

CAAGGGAGGC CACTCAAGCG, GAAATTGAAG. CAGATAGGTC TTA TCAG CAC AGTCTCCGCC 2640 

TCCTGGATTC AGTGTCTCGG CTTCAGGGAG TCAGTGATCA GTCCTTTCAG GTGGAAGAAG. 2700 

CAAAGAGGAT. CAAACAAAAA GCGGATTCAC TCTCAACGCT GGTAACCAGG CATATGG ATG - 2760 

AGTTCAAGCG TACACAAAAG AATCTGGGAA ACTGGAAAGA AGAAGCACAG CAGCT CTTAC 2820 

AGAATGGAAA AAGTGGGAGA" GAGAAATCAG ATCAGCTGCT TTCCCGTGCC AATCTTGCTA- 2880 

AAAGCAGAGC ACAAGAAGCA CTGAGTATGG GCAATGCCAC TTTTTATGAA GTTGAGAGCA 2940- 

TCCTTAAAAA CCTCAGAGAG TTTGACCTGC AGGTGGACAA CAGAAAAGCA GAAGCTGAAG 3000 

AAGCCATGAA- GAGACTCTCC TACATCAGCC AGAAGGTTTC AGATGCCAGT GACAAGACCC 3060 

AGCAAGCAGA AAGAGCCCTG GGGAGCGCTG CTGCTGATGC ACAGAGGGCA AAGAATGGGG 3120 

CCGGGGAGGC CCTGGAAATC TCCAGTGAGA TTGAACAGGA GATTGGGAGT CTGAACTTGG 3180 

AAGCCAATGT GACAGCAGAT GGAGCCTTGG CCATGGAAAA GGGACTGGCC TCTCTGAAGA 3240 

GTGAGATGAG GGAAGTGGAA GGAGAGCTGG AAAGGAAGGA GCTGGAGTTT GACACGAATA 3300 

TGGATGCAGT ACAGATGGTG ATTACAGAAG CCCAGAAGGT TGATACCAGA GCCAAGAACG 3360 

CTGGGGTTAC AATCCAAGAC ACACTCAACA CATTAGACGG CCTCCTGCAT CTGATGGACC 3420 

AGCCTCTCAG TGTAGATGAA GAGGGGCTGG TCTTACTGGA GCAGAAGCTT TCCCGAGCCA 3480 

AGACCCAGAT CAACAGCCAA CTGCGGCCCA TGATGTCAGA GCTGGAAGAG AGGGCACGTC 3540 

AGCAGAGGGG CCACCTCCAT. TTGCTGGAGA CAAGCATAGA TGGGATTCTG GCTGATGTGA 3600 

AGAACTTGGA GAACATTAGG GACAACCTGC CCCCAGGCTG : CTACAATACC CAGGCTCTTG 3660 

AGCAACAGTG AAGCTGCCAT AAATATTTCT CAACTGAGGT: TCTTGGGATA CAGATCTCAG 3720 

GGCTCGGGAG CCATGTCATG - .TGAGTGGGTG GGATGGGGAC ATTTGAACAT GTTTAATGGG 3780 

TATGCTCAGG TCAACTGACC TGACCCCATT CCTGATCCCA TGGCCAGGTG GTTGTCTTAT 3840 

TGCACCATAC TCCTTGCTTC- CTGATGCTGG GCAATGAGGC AGATAGCACT GGGTGTGAGA 3900 

ATGATCAAGG ATCTGGACCC CAAAGAATAG ACTGGATGGA" AAGACAAACT GCACAGGCAG 3960 

ATGTTTGCCT CATAATAGTC GTAAGTGGAG TCCTGGAATT TGGACAAGTG CTGTTGGGAT 4020 

ATAGTCAACT. TATTCTTTGA GTAATGTGAC TAAAGGAAAA AACTTTGACT TTGCCCAGGC 4080 

ATGAAATTCT TCCTAATGTC AGAACAGAGT. GCAACCCAGT CACACTGTGG CCAGTAAAAT 4140 

ACTATTGCCT - CATATTGTCC TCTGCAAGCT TCTTGCTGAT CAGAGTTCCT CCTACTTACA' 4200 

ACCCAGGGTG TGAACATGTT CTCCATTTTC AAGCTGGAAG AAGTGAGCAG TGTTGGAGTG 4260 

AGGACCTGTA AGGCAGGCCC ATTCAGAGCT ATGGTGCTTG CTGGTGCCTG CCACCTTCAA 4320 

GTTCTGGACC TGGGCATGAC ATCCTTTCTT TTAATGATGC CATGGCAACT TAGAGATTGC 4380 

ATTTTTATTA AAGCATTTCC TACCAGCAAA GCAAATGTTG GGAAAGTATT TACTTTTTCG 4440 

GTTTCAAAGT GATAGAAAAG TGTGGCTTGG GCATTGAAAG AGGTAAAATT CTCTAGATTT 4500 

ATTAGTCCTA ATTCAATCCT ACTTTTCGAA CACCAAAAAT GATGCGCATC AATGTATTTT 4560 

ATCTTATTTT CTCAATCTCC TCTCTCTTTC CTCCACCCAT AATAAGAGAA TGTTCCTACT 4620 

CACACTTCAG CTGGGTCACA TCCATCCCTC CATTCATCCT TCCATCCATC TTTCCATCCA 4680 

TTACCTCCAT CCATCCTTCC . AACATATATT TATTGAGTAC CTACTGTGTG CCAGGGGCTG 4740 

GTGGGACAGT GGTGACATAG TCTCTGCCCT CATAGAGTTG ATTGTCTAGT GAGGAAGACA- 4800 

AGCATTTTTA- AAAAATAAAT TTAAACTTAC AAACTTTGTT TGTCACAAGT GGTGTTTATT 4860 

GCAATAACCG CTTGGTTTGC AACCTCTTTG CTCAACAGAA CATATGTTGC AAGACCCTCC 4920 

CATGGGGGCA CTTGAGTTTT GGCAAGGCTG ACAGAGCTCT GGGTTGTGCA CATTTCTTTG 4980 

CATTCCAGCT GTCACTCTGT GCCTTTCTAC AACTGATTGC AACAGACTGT TGAGTTATGA .5040 

TAACACCAGT GGGAATTGCT GGAGGAACCA GAGGCACTTC CACCTTGGCT- GGGAAGACTA 5100 

TGGTGCTGCC TTGCTTCTGT ATTTCCTTGG ATTTTCCTGA AAGTGTTTTT AAATAAAGAA 5160 
CAATTGTTAG ATGCC 

Seq ID NO: 220 Protein sequence: 
Protein Accession 8:NPJ>05553 

1.. 11 21 31 41 51 

I I * I I I I 
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MPALWLGCCfc CFSLLLPAAR ATSRREVCDC NGKSRQCIPD 
DGIHCEKCKM GFYRKRERDR CLPCNCNSKG SI»SARCDNSG 
HMLTDAGCTQ DQRLLDSKCD CDPAGIAGPC DAGXCVCKPA 
PEGCTQCFCY GHSASCRSSA EYSVHKITST FHQDVDGWKA 
SSAQRLDPVY PVAPAKFLGN QQVSYGQSLS FDYRVDRGGR 
MPLGKTLPCG LTKTYTFRLN EHPSNNWSPQ LSYFEYRRbL 
DNVTLISARP VSGAPAPWVE CCICPVGYKG QFCQDCASGY 
GGACDPDTGD CYSGDENPDI ECADCPIGFY NDPHDPRSCK 
CNNCPPGVTG ARCELCADGY FGDPPGEHGP VRPCQPCQCN 
- CIHNTAGIYC DQCKAGYFGD PIAPNPADKC RACKCNPKGS 
KCERGAFSCP ACTOQVKIQM DQFMQQljQRM EAL1SKAQGG 
QDILRDAQIS EGASRSLGLQ. LAKVRSQENS YQSRLDDLKM 
RLITQMQLSli AESEASLGNT NI PASDHYVG PNGFKSIAQE 
ETEDYSKQAL SLVRKALHEG VGSGSGSPDG; AWOGLVEKL 
ADRSYQHSLR: LLDSVSRLQG VSDQSFQVEE' AKRIKQKADS 
NWKEEAQQLL- QNGKSGREKSr DQLLSRANLA' KSRAQEALSM 
QVDNRKAEAE EAMKRLSYIS QKVSDASDKT QQASRALGSA- 
IEQEIGSLNL- EANVTADGAL AMEKGLASLK SEMREVEGSI* 
AQKVDTRAKN AGVTIQDTLN: TU3GLLHLMD QPLSVDEEGL 
MMSELEERAR QQRGHIiHLLE TSIDGILADV KNLENIRDNL 

Seq ID NO: 221 DNA sequence 
Nucleic Acid Accession &: NM_016529 
Coding sequence: 13-1854 



RELHRQTGNG 
RCSCKPGVTG 
VTGERCDRCR 
VQRNGSPAKL 
HPSAHDVILS 
RNLTALRIRA 
KRDSARLGPF 
PCPCHNGFSC 
NNVDPSASGN 
EPVGCRSDGT 
DGWPDTELE 
TVERVRAliGS 
ATRLAESHVE 
EKTKSLAQQL 
LSTLVTRHMD 
GNATFYEVES. 
AADAQRAKNG' 
BRKELEFDTNv 
VLLEQKLSRA 
PPGCYNTQAL 



FRCLNCNDNT 
ARCDRCLPGF 
SGYYNLDGGN' 
QWSQRHQDVF 
GAGLRITAPL 
TYGEYSTGYI 
GTCIPCNCQG 
SVMPETEEW 
CDRLTGRCLK 
CVCKPGFGGP 
GRMQQAEQAL 
QYQNRVRDTH 
SASNMEQLTR 
TREATQAEIE, 
EFKRTQKNLG: 
ILKNLREFDL 
AGEALEI SSEi 
MDAVQMVITE; 
KTQINSQLRP 
EQQ 



' 60 
120 
180 
240 
300 
" 360 
420 
480 
' 540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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GTCAAGAAAA 
AAAGGGGCTG 
ACATTATGCC 
GCTGATCTCT 
ATATTGAAGG. 
CTGCTACTTG: 
GCAACACTGT 
GCGATTAATA- 
AAGGAGGACT 
AATTTGCTGG 
GCGCTCTCCT.: 

atatgctgca; 

GTGAAGGCCA 
GCCCACGTGG; 
TACGCCATCG- 
TACAACCGGG. 
ATTGAGCTTT 
TGCATCGGCC 
GAGAGGTCTT 
AATGGCGAAG" 
TCCCTCATCC 
GGTCATGCTA 
GTTTGTCTGA 
TGGGGAAGCA' 
ATTCCCATTG^ 
TGGTTGGGAT 
GCCAAGCACA 
CGAGTCCTGG-' 
CGCCTGATCA 
CAGCAGGGCG 
GAAGAAGTCA 
AATTTTCCTG 
TTTGTCAGAG- 
AGTTAAGCAG 
AGCTATCTTT 
ATGAAGCATT 
AGAAAAAAAA 



11 
1 

GAATGTCTGT. 
ATAATGTGAT 
ATCTGGAATA 
CTGAGAATGA 
ACAGAGCTCA 
GAGCCACAGC 
TGAAGGCAGA 
TAGGGTATTC 
CTTTGGATGC, 
GCAAGGAAAA- 
TOGAAGTCCG* 
GAGTGTCTCC: 
TCACCCTCGC 
GTGTGGGAAT. 
CACAGTTTTC . 
TGACCAAGTG 
GGTTCGCCTT 
TGTACAATGT 
GCACTCAGGA 
GCTTCAACAC; 
TCTTCTGGTT: 
CCGACTATTT 
AAGCTGGTTT, 
TGCTGACCTG 
CTCCAGATAT 
TATTTCTGGT 
CCTGCAAAAA 
GAAAAGCGGT- 
AGAGGCTGGG 
TCCCGCATGG 
TCCGTGCTTA 
ACTGATCTTA 
AAGACTGGCG: 
TTTGTTAGTT 
GCCCTCCCAA 
CAACTGTGCT 
AAAAAAA 



21 
} 

AATTGTTCGA 
TTTTGAGAGA 
CTTTGCCACG 
GTATGAGGAG 
ACGGTTGGAA 
CATAGAAGAT 
AATTAAAATA 
CTGCCGATTG 
CACAAGGGCA 
TGACGTGGCC 
GAGGAGTTTC 
TCTGCAGAAG 
CATCGGAGAC. 
CAGTGGGAAT 
CTACTTAGAG, 
CATCTTGTAC 
TGTTAATGGA 
GATTTTCACC 
GAGCATGCTC 
AAAGGTTTTC 
TCCCATGAAA 
ATTTGTTGGA 
GGAGACCACA 
GCTGGTGTTT 
GAGAGGACAG 
TCCTACTGCC 
GACATTGCTG 
GCTGCGGGAT 
CCGGAAGAGG 
GTATGCTTTT 
TGACACCACC 
GGAAAGAGAT- 
TCCAAGGCCA 
ACATATTCCC 
CTCGTCTGCA 
CTGTGAGGTC 



31 
1 

ACTCCTTCAG 
CTTTCAAAAG 
GAAGGCTTGC 
TGGCTGAAAG 
GAGTGTTACG:: 
CGCCTTCAAG . 
TGGGTGTTGA' 
GTATCGCAGA 
GCCATTACTC 
CTCATCATCG 
CTGGATTTGG: 
TCTGAGATAG; 
GGCGCCAACG 
GAAGGCATGC 
AAGCTTCTGT 
TGCTTCTATA 
TTTTCTGGGC 
GCTTTGCCGC 
AGGTTTCCCC 
TGGGGTCACT 
GCTCTGGAGC 
AATATTGTTT 
GCTTGGACTA 
TTTGGCATCT 
GCAACTATGG : 
TGTTTGATTG 
GAGGAGGTGC 
AGCAATGGAA 
CCCCCGACGC 
TCTCAAGAAG. 
AAAAAGAAAT 
TCAGTTTGTT 
AAACACCAGG 
TCGCAAACCT. 
GTGCTTAGCC 
TCAAATTAAA 



41 

1 . 

GACGACTTCG 
ACTCAAAATA 
GGACTCTCTG 
TCTATCAGGA 
AGATCATTGA 
CAGGAGTTCC 
CAGGAGACAA 
ATATGGCCCT 
AGCACTGCAC 
ATGGCCACAC 
CACTCTCGTG 
TGGATGTGGT 
ATGTCGGGAT 
AGGCCACCAA 
TGGTTCATGG 
AGAACGTGGT 
AGATTTTATT 
CCTTCACTCT 
AGCTCTACAA 
GCATCAACGC 
ATGATACTGT 
ACACATATGT 
AATTCAGTCA 
ACTCGACCAT 
TCCTGAGCTC 
AAGATGTGGC 
AGGAGCTGGA 
AGAGGCTGAA 
TGTTCCGGGG 
AACACGGAGC 
CCAGGAAGAA 
GCACCCAGTG 
AAACACATTT 
GGAGTGCAGA 
TAACTTTTGT 
AACATTATGT 



51 

.1 

GCTTTACTGT 

tatggaggaa* 

tgtggcttat 

agccagcacc 

gaagaatttg: 

agaaaccatc 

acaagaaagt:- 

tatcctattg' 

tgaccttggg 

cctgaagtac 

caaagcggtc 

gaagaagcgg ■ 

gatccagaca' 

caactcggat. 

agcctggagc 

cctgtatatt 

tgaacgttgg 

GGGAATCTTT 
AATCACCCAG : 
CTTGGTCCAC 
GTTTGACAGT 
TGTTGTTACT 
TCTGGCTGTC 
CTGGCCCACC 
CGCACACTTC- 
ATGGAGAGCA 
AACCAAGTCT 
CGAGCGCGAC 

cagctccctg 
tgttagtcag. 
ataagacatg: 
ttaacacatc- 
ctgtggcctt 

CCACAGGGGA 
TTATGTCGTT 
TTCACCAATA 



Seq ID NO: 222 Protein sequence: 
Protein Accession #: NP_05.7613 



MSVIVRTPSG. 
ENEYEEWLKV 
-KAEIKIWVLT 
KENDVALI ID 
TIiAIGDGAND 
TKCILYCFYK 
TQESMLRFPQ. 
DYLFVGNIVY 
PDMRGQATMV 
KAVLRDSNGK 
RAYDTTKKKS 



11 - 
I 

RLRLYCKGAD 
YQEASTILKD 
GDKQETAINI 
GHTLKYALSF 
VGMIQTAHVG 
NVVLYHEIiW 
LYKITQNGEG 
TYVWTVCLK 
LSSAHPWLGL 
RLNERDRLiIK 
RKK 



21 
I 

nviferlskd 
raqrleecye' 
gyscrlvsqn 
evrrsfldla 
vgisgnegmq. 
pafvngfsgq 

FNTKVFWGHC 
AGLETTAWTK 
FLVPTACLIE 
RLGRKTPPTL 



31 
I 

SKYMEETLCH 
IIEKNLLLLG 
MALIXiLKEDS- 
LSCKAVICCR 
ATNNSDYAIA 
ILFERWCIGL 
INALVHSLIL 
FSHLAVWGSM 
DVAWRAAKHT 
FRGSSLQQGV 



41 

I 

LEYFATEGLR 
ATAIEDRLQA 
LDATRAAITQ 
VSPLQKSEIV 
QFSYIiEKLLL 
YNVIFTALPP 
FWFPMKALEH 
LTWLVFFGIY 
CKKTLLEEVQ 
PHGYAFSQEE 



51 

I 

TLCVAYADLS 
GVPETIATLI* 
HCTDLGNLLG 
DWKKRVKAI 
VHGAWSYNRV 
FTLGIFERSC 
DTVFDSGHAT 
STIWPTIPIA 
ELETKSRVLG 
HGAVSQEEVI 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



Seq ID NO: 223 DNA sequence 
Nucleic Acid Accession #: BC017001 
Coding sequence: 1-394 
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31 



41 
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I .■ 

AACGCTGGGC 
CGOGCAGGGC 
CGTGCTCATC 
CACGATCCGG 
CCAGGGAGGT 
ACTGCAGCGA" 
GTACCTGGGA 
TTCAAGAGCC 
AGGGGTGCAT 
CTACAAGGOC 
AGGATCATGG 
GCCACGTGCT: 
TAACCTCTTA' 
TAAACAAACA 
TATTGGTTCA 
CAGTGCTCGT 
TTCAAGGAAG 
ATCAGAAAGT. 
CTTAGATGCC 
ACAGCAGGGA' 
GGCACACTGC 
TGTACCAGTT 
CCCTGTATTT. 
CTGCTGCTCA 
TGGATGGTCT- 
TGCTTGGTTC . 
GCTTCAGGAT.. 
GGAACTGGGA 
GGCTTTACCC 
.AAGAGCTTGT 
AAAGGTATTT 
GAAGGCACTA 
AGTGAGTCTC 
GGTATGGGGT 
GGGAAAGGGA 
CTCGGCACTT- 
GCATTCTCTT 

gtgtcttcag 
gggaaaattg. 
tggttttgac 
gggtatacag 
aaaataaaag 



I:.. 

AGGGCCGGCG. 
GGCAGCATCC 
CGGCCGCTGC 
GAGGACCCAG 
GACTACTTCT 
GAGACCATCC 
GCATCCACAC 
CAGAAGACAC 
TCTCTTTGCA 
TTGGACTCAC 
CCTTACCCTT 
GCCGGCTCCA* 
TATGGACTAC 
GAAGATTGTT. 
ACAACTCCAC 
TACTTCATGG 
GAGGGGTGGA 
AAAACCTCGT 
CTTGGCCCGA 
ACAGAATTGT' 
CTCCTGCAAT. 
AGCTTTTGCT 
TCCCACAATC 
CACCTGGGAT 
AGGATAGCCT- 
TCCTCCATGT 
TCCAAAGAGA 
CTAGGACAGT' 
AGATTCAAGG 
GGCCATTTTT 
CCCTCTTCCC 
AGACATTGTC 
TCCATGGGCT- 
TAGGCCAGGT 
ACAGAGTAAG 
TTAAGCCTCA 
TCCCCGGGAA 
TGCAATGTGG 
GAAATCTGAA 
AGATGTACCA 
GAACTCTTTG 
TTTATTTAAT 



CGGGTCGGGG; 
ACTCGGGCCG 
CGTCOGTGTT 
ACAGCGTGCC 
ACTCCTTTGG 
CCGCCAAGCT 
CAGACTTGCA 
ACCTGGCCTC 
CCTGGCGAGA 
TGTACAGTGT. 
GAAGCATTAC 
AATTCCCAAG 
ATTCAGCTGC 
TTTCCACATA 
GGAGGTAGGG 
TCCCAAGATG 
GGAAGAGGAA 
CAGAAGTCTG 
GCCAGCTACC 
CATGGCTGAA 
AAAACTGGGA 
GTGTAACAAA 
CTATGGGTTG 
CCCTCATGGA 
TACTCACTTG: 
GGCCTCTCCA 
GTGAGAGTAG 
GTCACTTCTG 
GATGAGAAAC 
CACCTATCAC 
CCTTTCTCTC 
CTGGCCCTCA 
GGACCCACTG 
AGGACATTCC 
GCAGGCCTTG 
GTTTCTCCAG 
AGTTGACTGA 
CATCCTGGAT 
AAAAGTCTGA: 
TGGTGATGTA 
TACTATCTCT 
TTAAAAAAAA 



GGCGCCCGAG 
CATCGCCGCG 
GGACCCCGCC 
CCCCATCGAT 
GGGCTGCCAC 
TGTCCAGTCC 
GTAGCAGCCT- 
CAGCAGGCTG 
GGGTCTGACT 
GGGAGCCCCA 
CGAGAAGGAG 
GACAAGGATC 
AAGGAAAGGA 
GCATGGATTC 
GTCACGTCTT, 
GCTGCTGTAT, 
GGGCCAAACT 
TTTCCTGCTC 
ATTGCACCTC 
TAGACCAATC 
TCCCATTACC 
CCATCCCCAA 
GCAATTTGGG 
GCTAAGGTCA 
CCTGGCAGGT. 
GCAGGCTAGC 
AAGCTGAAAG 
CTAAGTTCTT 
AGACTACATG 
AAATAATTTT 
TGTCTCATGG 
GGGTCTAGGG 
TAGTAGGAGT 
AGAGGGGCTT 
TTCTCACTGC 
TTCAATAATA 
GGTGACCAGT 
TGGGTCTTGG 
ATTTTAGTTA 
:AGATGTTGAC 
GCAACTTCTC 
AAAAAAAAAA 



GGGCCCGGGC 
GTGCACAAOG 
AAGGTGCAGA- 
GTCCTCTGGA 
CGCTACGCGG 
ACTCTCTCAG 
CCTTGGCACC 
GGCCATGCAG 
CTGGGCACCC 
GTTCCCACCT 
AACAGAGATG- 
CCTCTGCATT. 
AAACCTTGAT. 
TGGAGATGGG- 
GGATCCTTTT 
CCCCAAGAAT 
AGCTGGACCC 
TCTCCCTCTG 
TAGCTGCAAA 
GTGTTCCATC 
AAGAGAGAAA 
ACTTGGCAGC 
CTGGGCTCAA" 
GCTGTTACCT 
GACAGGCTGT 
TCAGGCTTAT 
ACTTCTTGAG 
TTGGTCAGAG 
TCTTGATGAG 
GGATGGGTAT. 
GGCCTCACTC 
GAAGAGGTGT 
GCCTCCTTGT 

ctgaaaacca: 
cctctaaggg. 
aggacaagag 
aatagaattg 
aacaaaaaca 
atataccaat 
cttggggtag: 
tgtaaatcta 

AA 



CGAGCGGCGG 

TGCCGCTGAG 

GCCTCGTGGA 

TCAAAGGGGC 

CCTACCAGCA 

ACCTAAGGGT ' 

TGCTGCGACC 

AAGGGATAGC 

CTCTCACCGG 

CTGTGACAAT 

GGCTTGAAGA 

TTTGTCTATG 

TGCAGTGGTT 

TGGCTAATGG 

GCCTTAATCT 

CATGTCTGCG- 

GTCACCTTCT 

CATATCTTCA 

CAAAGCTAAG 

TACTGAGACT 

TGCAGAATTG ' 

TAGAAACAAA 

CAGGGCAGTT 

CAGCTGGGCC 

TGGCTGGAAT 

TCACATGATG ' 

TTCTTGGCCT 

CAAATCACAA *• 

GGGAACCACA 

TTATTTGGAT 

TGCCAAGTTG 

TGGGGCAGGA-' 

CTGCACTGCT 

AGAGTCCCTG 

AACTTGGTCA 

CTTTTCCCAT 

AAAAGGGAGA : 

GGACATTAGT 

TTCAGTCTCT. 

GCTGGGTGAA 

GTATCATTCC 



60 
"120- 
. 180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
. 840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680- 
1740 
1800 
I860 
1920 
1980 
2040- 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



Seq ID NO: 224- Protein sequences 
Protein Accession ft: AAH17001.1 

1 11 • 21 . 31 fl 51 

I 1 I I I I 

TLGRAGAGRG APEGPGPSGG AQGGSIHSGR IAAVHNVPLS VLIRPLPSVL DPAKVQSLVD 
TIREDPDSVP PIDVLWIKGA QGGDYFYSFG GCHRYAAYQQ LQRETIPAKIi VQSTLSDLRV 
YLGASTPDLQ . ' * 



Seq ID NO j 225 DNA sequence 
Nucleic Acid Accession #: NM_021048 
Coding sequence : 1 . . 1110 



ATGCCTCGAG 
GAGACACAGG 
TCCACTTCCA 
TCCTCCTGCT 
CCAAATCCTC 
CCATTAGATC 
CAGGTCCTGC 
TTGGTGCAGT 
CTGGAGAGTG 
GAGTGCATGC 
TTTGTCCTTG 
ATGCCCAAGA 
ACCCCTGAGG. 
CACCTCATTT 
CTGGAGTACC 
AGGGCTCATG 
AGTGATCCAA 
GCCCAGGACA 
GCTACAGGTA 



11 
I 

CTCCAAAGCG 
GCCTCGAGGG 
CCAGCTCCTC 
ATCCTCTAAT 
CCCAGAGTGC 
AATCTGATGA 
CAGACAGTGA' 
TTCTGCTCTT 
TCATAAAAAA 
TGCTGGTCTT- 
TCACCTCCCT 
CTGGCATTCT 
AGGTCATCTG 
ATGGGGAGCC 
GGCAGGTGCC 
CTGAAATTAG 
GATCCTTCCC 
GAATTGCCAC 
GCTTCTCCTA 



21 
I 

TCAGCGCTGC 
TGCACAGGCT- 
TTTTCCATCC 
ACCAAGCACC 
TCAGATAGCC 
GGGCTCCAGC 
GTCTTTACCC 
CAAGTATCAA 
TTATGAAGAC 
TGGCATTGAT 
GGGCCTCACC 
CATACTTATC 
GGAAGCACTG 
CAGGAAGCTG 
TGGCAGTGAT 
GAAGATGAGT 
ACTGTGGTAT 
CACAGATGAT 
CCCTGAATAA 



31 
I 

ATGCCTGAAG 
CCCCTGGCTG 
TCTTTTCCCT 
CCAGAGGAGG 
TGCTCCTCCC 
AGCCAAAAGG 
AGAAGTGAGA 
ATGAAGGAGC 
CACTTCCCTT 
GTAAAGGAAG 
TATGATGGGA 
CTAAGCATAA 
AATATGATGG 
CTCACCCAAG 
CCTGCACGGT 
CTCCTGAAAT 
GAGGAGGCTT 
ACTACTGCCA 



41 
I 

AAGATCTTCA 
TGGAGGAGGA 
CCTCCTCCTC 
TTTCTGCTGA 
CCTCGGTCGT 
AGGAGAGTCC 
TAGATGAAAA 
CGATCACAAA 
TGTTGTTTAG 
TGGATCCCAC 
TGCTGAGTGA 
TCTTCATAGA 
GGCTGTATGA 
ATTGGGTGCA 
ATGAGTTTCT 
TTTTGGCCAA 
TGAAAGATGA 
TGGCCAGTGC 



60 
120 



51 
I 

ATCCCAAAGT 
TGCTTCATCA 
TTCCTCCTCC 
TGATGAGACA 
TGCTTCCCTT 
AAGCACCCTA 
GGTGACTGAT 
GGCAGAAATA 
TGAAGCCTCC 
TGGCCACTCC 
TGTCCAGAGC 
GGGCTACTGC 
TGGGATGGAG 
GGAAAACTAC 
GTGGGGTCCA 
GGTAAATGGG 
GGAAGAGAGA 
AAGTTCTAGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



Seq ID NO: 226 Protein sequence: 
Protein Accession #: NPJ)66386 

t . r -. r i 1 " " 

MPRAPKRQRC MPBEDLQSQS ETQGLEGAQA PIAVBEDASS STSTSSSFPS 



60 



276 



WO 02/086443 ' 

SSCYPLIPST PEEVSADDET PNPPQSAQIA CSSPSWASL PLDQSDEGSS. .SQKEESPSTL 120 
QVLPDSESLP RSEIDEKVTD LVQFLLFKYQ MKEPITKAEI LESVIKNYED HFPLLFSEAS- 180 - 
. ECMLLVFGID VKEVDPTGHS ' FVLVTSLGLT YDGMLSDVQS MPKTGILILI LSIIFIEGYC . 240 
• TPEEVIWEAL NMMGLYDGME . HLI YGEPRKL LTQDWVQENY IiEYRQVPGSD PARYEFLWGP 300- 

• 'RAHAEIRKMS. LLKFLAKVNG SDPRSFPLWY EEALKDEEER AQERIATTDD TTAMASASSS 360 
" ATGSFSYPE • . . 

; Seq ID NO: .227 DNA sequence 
.Nucleic Acid Accession #: NMJ>05025.1 
Coding sequence: 82-1314 

1 - 11 21 31 41 51 

I- - I - I I I I • 

GCGGAGCACA GTCCGCCGAG CACAAGCTCC AGCATCCCGT CAGGGGTTGC AGGTGTGTGG 60 
GAGGCTTGAA ACTGTTACAA TATGGCTTTC CTTGGACTCT TCTCTTTGCT GGTTCTGCAA. .120 
AGTATGGCTA CAGGGGCCAC TTTCCCTGAG: GAAGCCATTG CTGACTTGTC AGTGAATATG: .180 
TATAATCGTC " TTAGAGCCAC TGGTGAAGAT- GAAAATATTC. TCTTCTCTCC ATTGAGTATT .240 
GCTCTTGCAA TGGGAATGAT GGAACTTGGG GCCCAAGGAT" CTACCCAGAA AGAAATCCGC . 300 
CACTCAATGG GATATGACAG CCTAAAAAAT GGTGAAGAAT- TTTCTTTCTT. GAAGGAGTTT 360 
TCAAACATGG. TAACTGCTAA 4 AGAGAGCCAA TATGTGATGA AAATTGCCAA TTC CTTG TTT ■ 420 
GTGCAAAATG GATTTCATGT. CAATGAGGAG TTTTTGCAAA- TGATGAAAAA ATATTTTAAT v. 480 
GCAGCAGTAA- ATCATGTGGA CTTCAGTCAA AATGTAGCOG TGGCCAACTA CATCAATAAG . 540 
TGGGTGGAGA ATAACACAAA CAATCTGGTG AAAGATTTGG TATCCCCAAG GG ATTTTGAT * 600 
GCTGCCACTT ATCTGGCCCT CATTAATGCT- GTCTATTTCA AGGGGAACTG GAAGTCGCAG .660 
TTTAGGCCTG" AAAATACTAG AACCTTTTCT ; TTCACTAAAG. ATGATGAAAG TGAAGTCCAA 720 
ATTCCAATGA TGTATCAGCA AGGAGAATTT. TATTATGGGG AATTTAGTGA TGGCTCCAAT . 780 
GAAGCTGGTG GTATCTACCA AGTCCTAGAA- ATACCATATG AAGGAGATGA AATAAGCATG .840 
ATGCTGGTGC TGTCCAGACA GGAAGTTCCT CTTGCTACTC TGGAGCCATT AGTCAAAGCA 900 
CAGCTGGTTG AAGAATGGGC AAACTCTGTG. AAGAAGCAAA AAGTAGAAGT ATACCTGCCC 960 
AGGTTCACAG TGGAACAGGA AATTGATTTA' AAAGATGTTT, TGAAGGCTCT TGGAATAACT." 1020 
GAAATTTTCA- TCAAAGATGC AAATTTGACA' GGCCTCTCTG ATAATAAGGA GATTTTTCTT 1080 
TCCAAAGCAA TTCACAAGTC CTTCCTAGAG" GTTAATGAAG AAGGCTCAGA AGCTGCTGCT 1140 
GTCTCAGGAA" TGATTGCAAT TAGTAGGATG GCTGTGCTGT ATCCTCAAGT TATTGTCGAC 1200 
CATCCATTTT TCTTTCTTAT- CAGAAACAGG AGAACTGGTA CAATTCTATT CATGGGACGA 1260 
GTCATGCATC. CTGAAACAAT GAACACAAGT GGACATGATT TCGAAGAACT TTAAGTTACT - 1320 
TTATTTGAAT AACAAGGAAA ACAGTAACTA AGCACATTAT- GTTTGCAACT GGTATATATT 1380 
TAGGATTTGT GTTTTACAGX ATATCTTAAQ. ATAATATTTA AAATAGTTCC AGATAAAAAC 1440 
AATATATGTA AATTATAAGT AACTTGTCAA GGAATGTTAT CAGTATTAAG CTAATGGTCC 1500 

• TGTTATGTCA TTGTGTTTGT GTGCTGTTGT. TTAAAATAAA AGTACCTATT GAACATGTG 

Seq ID NO: 228 Protein sequence: 
Protein Accession #« NP_00 5016.1 

1 il 21 31 41 51 " 

I I I.I I I 

MAFLGLF5LL VLQSMATGAT- PPEEAIADLS VNMYNRLRAT GEDENILFSP LSIALAMGMM 60 
ELGAQGSTQK EIRHSMGYDS LKNGEEFSFL KEFSNMVTAK ESQYVMKIAN SLFVQNGFHV 120 
NEEFLQMMKK YFNAAVNHVD FSQNVAVANY INKWVENNTN NLVKDLVSPR DFDAATYLAI* 180 
INAVYFKGNW KSQFRPENTR TFSFTKDDES EVQIPMMYQQ GEFYYGEFSD GSNEAGGIYQ 240 
VLEIPYEGDE. ISMMLVLSRQ. EVPLATLEPL VKAQLVEEWA NSVKKQKVEV YLPRFTVEQE 300 
IDLKDVLKAL. GITEIFIKDA NLTGLSDNKE IFLSKAIHKS FLEVNEEGSE AAAVSGMIAI 360 
SRMAVLYPQV: IVDHPPFFLI RNRRTGTILF MGRVMHPETM NTSGEDFEEL 



Seq ID NO: 229 DNA sequence 
Nucleic Acid Accession #: NM_003695 
Coding sequence: 12-398 

1 11 21 31 41 51 

I I I I i I 

CGACATCAGA GATGAGGACA GCATTGCTGC TCCTTGCAGC CCTGGCTGTG GCTACAGGGC 60 

CAGCCCTTAC CCTGCGCTGC CACGTGTGCA CCAGCTCCAG CAACTGCAAG CATTCTGTGG 120 

TCTGCCCGGC CAGCTCTCGC TTCTGCAAGA CCACGAACAC AGTGGAGCCT CTGAGGGGGA 180 

ATCTGGTGAA GAAGGACTGT GCGGAGTCGT GCACACCCAG; CTACACCCTG CAAGGCCAGG. 240 

TCAGCAGCGG CACCAGCTCC ACCCAGTGCT GCCAGGAGGA CCTGTGCAAT GAGAAGCTGC 300 

ACAACGCTGC ACCCACCCGC ACCGCCCTCG CCCACAGTGC CCTCAGCCTG GGGCTGGCCC 360 

TGAGCCTCCT GGCCGTCATC TTAGCCCCCA GCCTGTGACC TTCCCCCCAG GGAAGGCCCC 420 

TCATGCCTTT CCTTCCCTTT CTCTGGGGAT TCCACACCTC TCTTCCCCAG CCGGCAACGG 480 

GGGTGCCAGG AGCCCCAGGC TGAGGGCTTC CCCGAAAGTC TGGGACCAGG TCCAGGTGGG 540 

CATGGAATGC TGATGACTTG GAGCAGGCCC CACAGACCCC ACAGAGGATG AAGCCACCCC 600 

ACAGAGGATG CAGCCCCCAG CTGCATGGAA GGTGGAGGAC AGAAGCCCTG TGGATCCCCG 660 

GATTTCACAC TCCTTCTGTT TTGTTGCCGT TTATTTTGTA CTCAAATCTC TACATGGAGA 720 
TAAATGATTT AAACC 

Seq ID. NO:. 230 Protein sequence: 
Protein Accession ft: NPJ303686 

1 ■ .11. 21 31 41 .51 

I". • I • I I I I 

MRTALLLLAA LAVATGPALT LRCHVCTSSS NCKHSWCPA SSRFCKTTNT VEPLRGNLVK 60 
KDCAESCTPS YTLOGQVSSG TSSTQCCQED LCNEKLHNAA PTRTALAHSA LSI/3LALSLL 120 
AVILAPSL 



Seq ID NO: 231 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 126-752 



277 
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WO 02/086443 

1 . . "11 .21". 31 41 51 " ~ 

• I :. -.I.:- .' I .1 I I. 

CCGGGCAGGT GGCTCATGCT • CGGGAGCGTG GTTGAGCGGC TGGCGCGGTT GTCCTGGAGC 
AGGGGCGCAG GAATTCTGAT GTGAAACTAA CAGTCTGTGA GCCCTGGAAC CTCCACTCAG ' 
AGAAGATGAA v GGATATCGAC ATAGGAAAAG AGTATATCAT CCCCAGTCCT GGGTATAGAA 
GTGTGAGGGA GAGAACCAGC ACTTCT GG GA CGCACAGAGA' CCGTGAAGAT TCCAAGTTCA- 
GGAGAACTCG ACCGTTGGAA TGCCAAGATG CCTTGGAAAC AGCAGCCCGA GCCGAGGGCC 
TCTCTCTTGA TGCCTCCATG CATTCTCAGC TCAGAATCCT GGATGAGGAG CATCCCAAGG 
GAAAGTACCA tcatggcttg: AGTGCTCTGA AGCCCATCCG: GACTACTTCC AAACACCAGC 
ACCCAGTGGA CAATGCTGGG CTTTTTTCCT GTATGACTTT TTOGTGGCTT TCTTCTCTGG 
CCCGTGTGGC CCACAAGAAG GGGGAGCTCT CAATGGAAGA CGTGTGGTCT CTGTCCAAGC 
ACGAGTCTTC TGACGTGAAC TGCAGAAGAC TAGAGAGACT GTGGCAAGAA GAGCTGAATG 
AAGTTGGGCC AGACGCTGCT TCCCTGCGAA GGGTTGTGTG GATCTTCTGC CGCACCAGGC 
TCATCCTGTC CATCGTGTGC CTGATGATCA CGCAGCTGGC TGGCTTCAGT GGACCAAATT 
TTCAGGATGG. CTGTATTCTG CGGTCAGAAT GAGAGAGTCA AGCTGGGCAG AATCTCTCGC 
CAAGAGTTCA GCCTTCCTTT GGAGACTGCT- CCATCAGTGC CGAGGTGTGT GGGAACAGGC 
TTCACTGCAC CGCCATCTTA CTGAGTTGCT; TCACGTGAGG. AAAAGGGGGC TTTGGCCCTG. 
TGACTCAGTT, CCACATTTTG. GATTGCATAC TGGAAAAGAA' GCCAATCTTC TTGCTAGTAA 
ACCAGCAACC CGGCTGTATA CAGTGGTGAC CCAAGCAATG GATATAAACC TAAAAATCTG 
AGGGAGGGGA GAGGTGGAAT ACAGTAGTTC: TTGGAATCTO: AAGTCTCCTA- TTTGATCAGG 
TTATTTCCTG GGACTTGGCA> AAAATCTGAT TGGTGGGGAT CTCCTAGGAC CTAGTGGACA 
TCTGGTATTA ATTTAATCTC AGGAAAAACA AGAAATTAAC: CCAGAGAGAG TCTGGGTTTT 
GGAATTCAGC GTAGCTACCT. CCAGACCGTG; GTGTCTGGCC TCCATTTTTG TCTGTCATTC 
AGCTCTGACT TACAGCTGCA- GTCACCTTTG"* CTATAAGGCA CCTGGGTAGA AGGGTGGATG 
GG CTTCACAT CAATTTTTTT. CTTCCTTTAG. GGTGGGGGAT TGGTTTGGCT TTCTTTTGTT 
GTGGTTTTTT-. GTTTTATTTTj TGTCAAGATT GATTTTTAGA- TGCAAGGACT TGAAAAGACC 
CAGAAGGATG CCACCAGTTT- TTCCTTGAGG CCT AGGATTT * TTTATTCTGT CCCGAGCAGA 
GGTAATTCCT CACAACTTAG TGCACCAGTA GCACCAGCCA* TTTTGAGCAG AGTACCTCTT 
TGGGGAGCTT TTCGTTTTGT TTTGTTTTTA' ATTCTCTTTC CTTAGCAGCA AGGTCTTTTT 
TCCTAGAGAA TCTACTCOGT TGCAGAATCA TTGCAACCTC AGGAGCCCTC ACTGATTGAG 
TGCTGTCAGC CTGATATACT ACTTTGGACT CTGGAAACAG' ATATGGGTTC TATTCTCTAT 
TTCTACTGTG TGTCGTTAAA CAACCGTCGG AGACCAGATG ACCTGTTAGA TGGCTAGTCC 
TGTATAACTC GACTCTGTAT GTTTCAATGT ATGTTACTGC AATGCTTCAC CTGCTGTACA 
GTGTTTGTGA. GATGCTCTTT GAAGATGGTA CTTTTATATT T 

Seq ID NO: 232 Protein sequence: 
Protein Accession # : Eos sequence 

21 



PCT7US02/I2476 



41 



SI 



1 11 21 31 

I I I I I I 

MKDIDIGKEY IIPSPGYRSV RERTSTSGTH RDREDSKFRR TRPLECQDAL BTAARAEGLS 
LDASMHSQLR ILDEEHPKGX YHHGLSALKP IRTTSKHQHP VDKAGLFSCM TFSWLSS1AR 
VAHKKGELSM EDVWSLSKHE SSDVNCRRLE RLWQEELNEV GPDAASLRRV VWIPCRTRLI 
LSIVCLMITQ LAGFSGPNFQ DGCILRSE 

Seq ID NO: 233 DNA sequence 

Nucleic Acid Accession ft: CAT cluster 



1 
I 

TTTTAATGGT 
CACAACATGA 
TGTTTCTTAG 
TAGTAAACTT 
TTTGTTTCCA 
TGTTTGACTG 
TTGTTATGGT 
AAGCTATTTT 
TATATTATGG 
ATTAGCCTAC 
AATAGGTTAA 
AATTAAAGTA 
TATGAAAGCT 
TATATATTTT 
TATCCTGTAT 
TAGAAGACCC 
TTTTTTTTAG 
GATTCTGTTA 
TCCAGATAAA 
CTTAGGCTCT. 
CCATAGTATC 
ACTATAATAG 
TATCAATGTT 
TACATAAAAA 
AAGTCATTAA 
AGAATCATCT 
GGAAGATGGA 
GAGTTGAAGA 
CTTTTAAAAC 
TATTCTTCTA 
AGCCCTTGTG 
CTGTTCAGGT 
TGAGATGGAG 
ATCATATGGA 
TTTTAATTCT 
CAATCCCCAA 
TTTCTGGAAT 



11 

I- 

GCTCATATAT 
ATCACATAAT 
ATCGTAGTCA 
CTCTTTCATC 
AAGTCACAAT 
AAAGCAAAAC 
ATATCTTTTT 
AATCATCAAG 
ATTAACCAGA 
TATAGTATTA 
AAAGTAGTTA 
CATTTTAAAT 
CTGGCTATCA 
TTAGTTCCTT 
TTTTTTTAAG 
ACTCTTACTA 
GTAGTTTAAA 
AGCATCCAAA 
AGATGGAGAA 
CTCCATGTAT 
AAGTGGAGGG 
AAGTTTGAGT ' 
ATCCAATGAT 
GTGCTCATGT 
TAATTTAATA 
GAGGACTTTT 
GCCATGCTTG 
CCACTGCTCT 
TAGTAATGTA 
ATAACTAGCA 
TAAGATTATT 
GAGAAAACAT 
GAGGTGGGCA 
AGTTTGGAAA 
CACTGTCTCA 
GGGGCAGTGT 
TGCACCTACA 



21 
I 

ACTGTATTTT 
CATGATTTTT 
TTGAGAAGTC 
TTTGTGTTAG 
TGAATTATTC 
AACGTGACAG 
ATTAAATATT 
TATGGAAAAC 
ATTGTATCAT 
ATAAATTTTT 
CAAATTAAAC. 
GAGCTTTATA 
TCCTGGGATA- 
TGAGATAACT 
AATTGTTTTA 
GGTTCCCTAA 
GCAAGCACTG 
AACAATGCCT 
TACCTCATGT 
CTTTCTTAAG 
TAGTTCAGAA 
AATATTTTAA 
TTTTATT AAA 
ATTTGAATTT 
ATTGTTTTAA 
AATATGGAAT 
TTTTTCCAAA 
AAATTAGTGC 
CCCAGTTAAG 
TTTATTACAT 
ATTTCTTCTC 
AATGGATTTT 
CATTTAAGGT 
AGAGAGCTTA 
AAAGAGAATC 
TACCTTACTC 
TGTTTTCTTA 



31 
I 

TTGTTGTTTA 
TTTTTTTACT 
CCAATAACTC 
CTCTGTAGTC 
TTAGATACCT 
TTTATTTTCA 
TATTTTGACT- 
AAATTACTAT- 
TTTTGGCCTA 
CAGTTGGTTT 
TTACTAATTT 
ATACCTTAAA 
GTAATTTCTA 
AATTTCTAAT. 
TAAATAGGTC 
GGATCTGCCA 
ATACCAGTGG' 
AATTTCAGTT 
ACTGTGACTT 
GAAAAGTTTC 
AAGTTAATAG 
TAAATTTATA 
AAATTACCTT 
TAAATAATTT 
ATCAGTGGTT 
CCACCTCATA 
AGCTCTTTGA 
AGGAAAATGC 
TTTTGATGGT 
GAAATTTAAG 
TATAACTTCA 
TTTTTTTTTC 
CAGTTCACTA 
TCACAGGTTT 
AGCTCTCCAG 
CTTCACTGCT 
TTAACATTCA 



41 

I 

GTTTTACTTA ' 
TTTACTCCCC . 
TAAACTTTTG 
TTAACCTGGA ' 
TAAGCCACTG 
AACACTAACT . 
AAGCTTTCAT , 
TGCATTTTCC 1 
ATGTCTGGAT . 
GGGCAAATTT . 
ATACCTGATT 
AAGTTGGTTC 
ATTATATAGT 
TATATATGTT- 
ATAAGATACA . 
TAGATTTTTT 
GAGTTGGTCT 
CTTAGGTTAT 1 
GAAAATGAAT 
TGAGTGTGAT 
GAAATCTTTT 
TAATTCAAAT i 
ATTATTAGAA i 
ATTTAAATCA , 
TTCAACCCTC . 
ACAATTAAGT'C 
GTGATTCTAA 
TTTTATTTCT 
TTAAATTCCA 
AGTTTAAGTT 
AAATAGATAT 
CTCTGGAGCT I 
ACCTATGGTT « 
GTATGCTGGT 
CAGTTCTAGA , 
TCTTAGAAGG 
GAATTGGGAA ' 



• 60 
120 
180 
240 
300 
360 
420 
480 
540 
• 600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
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WO 02/086443 

TCCAGTGAGT A GTTT TCTGA AATTGGTAAC TTGGAGA6TA 
CAATTTTGTG- TTTGTTTACT TTTATGTAAA AATTTGATAT 
AAAACCTCAT GCCTTTTCAT TACATCTAAT TTGAACTCTC 
TTTAAAGATG CTTTAATGAA AAGTATTAAG AAAATATATA 
CTTCAGAAAT ' CCATAT ATTT GTCATATTTA TTTTTTTAGA 
AGATGGTATT TAAAATGAAT- GCCCAAAAAT ATCTTGTACC 
TTGGAAGCCQ CCAGCCATTC ATGTAGAGAG TTTATAAGAA 
ATTTTATATT ACTATGGTAT. CTGTGTACCA TATTTCTAAG" 
CTTCTTAAAA CCATAACCTG GCTTGCCTTT TAGTGTTAAA' 
ATAGAGATTC TTCTTTTATG AAGAAGAGCT GACG TAAT TT 
AAGACATTAA' CATAAGTCTC TGAGCAGTGA TACATTTTCA 
CCACATTAAA CAACCACGGC AACACTCAGA CTTGGCACTT 
TGTGCCTGGT ATCGCCTCTG GCATAACTTA 1 CACGAATCGT 
CCTTCATCAA GCACTTGCCA ACACATTCAC CTCTAACTTG 
ACAACATCTG CAACTCTA.CC CTATCAACTG CCAACCTAAA 
CCCCAAACAC AAAACCACTA AATCATAACC ACCACACACG 
CACACAACCA ACACACCACG ACCAAACACC CCACCACAAA 
GACAACACAT CACATACACT CACTACCCCC CCATACTCCC 

Seq ID NO: 234 DHA sequence 

Nucleic Acid AcceBsion #: Eos sequence 

Coding sequence: 27-281 



PCT/US02/12476 



AAATAACGTA 
GTGAATTACA 
AACTTCAGTG 
GATTTGTATG 
AACCTCCTAA 
TTTGTCCAAA 
AATAATTTAA 
TATTCATTAT 
CACAAAATCC 
ATTACCAGTG 
AACATGAAGA' 
TCCTACGAAT' 
CCTCCCTACT 
TACAACCTTA: 
GACCCCCAAC 
CCACACACCA' 
CAAGCTAACA 
ACCCACCA 



TTTTGCTTTT 
CAGTTCTAAT 
CCAGAAGTGC 
TCAGTTTATA 
TTGGATAACT- 
AGTTTATCTG 
AATTGTATGC 
TAAATTGGTA 
AACATTGTAT 
CATCTGCACA 
GTGACAACCA 
CCATCCTATA - 
TGTCTACGCT " 
CCAACTCACC- 
ACAACACAAC 
CACACCCACC 
ACCACAAACA . 



25 | 



AGCAGGAGGA 
GTCTACCGCG 
TTCCTGCCCC 
GGCGCTCACT 
CAGGGTTCCG 
GAGTTTTTCT 
CAGAAAGAAT. 
ATGTTGCAGG 
TTTTCTTCTC 
ATAAAAACTG 
GATGCCAGGA 
TCAAGCCAAG 
CTCAAACCCG 
GGTCAGAACA 
AGACAGCCTG 
. TCTCAGTATC 
CATAAACACA 
GGAAAGGTCT 
TCTTTTGATG 
GCAAAGAAAA 
TTTATTTTTA 
TTTGACCTTG 
GATTAAAACA 
AAATCTCAAG 
AAAAAAAAGC 
GCACTTCTCT 
TCCCATCAAA 
ATAATCCC 



11 

i: 

GAGCTGGOGG 
TAGCAGTTAC 
GTGCTCATTT 
GAAACAGTGT 
ACCAATCCAA- 
TTGCTCTGAT . 
GCAAGGAGAT 
AAGAGCTAGT " 
CACTTGGCAT 
TTCAGCGGTT 
GGAAAGATGC. 
CCAACAGGTG 
GGGAAGCCCA 
CAGCTAAGCA- 
TGACGTTTCA" 
TTACGCCCAG 
TAACAGCAGC 
CCTGTGACTG 
ACTCTATATC 
ATAAAAGACA 
AACTGAATTC 
AAATAATCTT 
TATTAGTAAT 
GCTTTTAAAG 
CCTCCATCTG 
TCTCATTTTC 
GCCAAAGAAA 



21 
I 

GAAGACATGC 
ATCAGACTGA 
GGGGCTGACG 
GTTGCTCCAC 
GAGCCTTGCA 
CTTGGAGACA 
AGACCAACGT, 
CTTTCAGGCT 
ATCAAGAGCC 
CGCCAACAAG 
CAGGGGTAAA- 
TTCTGTTTTT* 
CTCTAGAACC 
GATGGCTTGG 
AAAGCAAAAG 
TGACACGATC 
AGCAATAATT 
TTTTATTTTT 
CAACTCTGAG 
ATTTCCAGTA 
AGCAGAGATT 
TACATTGTAA 
TAATTATTAA" 
CATTTGTACA 
ATTCTCATTT 
CACTGTCTCG 
GAAAAGAAAA 



31 
I 

ACCCCTTGAA 
GACACTTCCT 
CCATTTTAGG 
ACCGCCTTGT 
GAAAGCATTA' 
TCCCTCTGCC 
GAGATTCTCC 
GGGCTGGTGA 
AGGCGTGGAA 
AAGTGGTAAA 
GTGGGAAAAT 
CATCACAGAA 
CATGCTGGTC 
GTCATCAGGA- 
TCCCCTACCA 
TACCCTCAAA 
AAAGATGAGA 
AGGGAAACAG 
GTTTGATTAA 
AGTATGCCAG 
TACATGCATT 
ATTCTTAATG 
AGGAGAATAA 
AATGACTGGA 
TCATTGTCAG 
CAAGCTAGAA 
TTGTTCTGTA 



41 

I 

GACCCAGAGA 
GTTTACAGGA 
CCTCAGCCCA 
TTTGCTTGTT 
ACGTGCTTTT 
TAGTGGAAAC 
TTCATGCACT 
CCTGAGAAAG 
GACTAAAACA 
GTAGCAAAAA 
GGGAACCTGA 
CTAATAAGTG 
ATCCATATCC 
CGTCCATTAC 
GCCAGTGAAG 
ACTTAAAAAA 
TGAGAACAAT 
AGAGGAAGAA 
AGAAATGACC 
TTCGAATTAA 
ACGATGATTA 
ATCAAAACAA 
TTGCAAATAC 
CATTTTTTAA 
TGCAACAACA 
ATTCTCACGA 
CAGATATATG 



51 
I 

GAGGCCGTCT 
GACTATAAAA 
TCTGCACCCA 
GGCGCGCTCT 
CTCTTTGGCA 
ATAAGGAATA 
CAAGAGAAAG 
AATGTCCAGC. 
GGAAATGTTT 
TGGGGATGGA 
AGCCAGGAGG : 
GTGCTGAGGA 
CCAAGGCCCT 
ATCCAAAGGA 
CTACCTGATT- 
AAAAGGGAAA-. 
TAAGAAAAAA' 
GAATGATTTT 
TTGAACCACA 
TGATTTACTT 
ACATCTGAAA- 
GGTTCTCAGT 
AACATTCCTA 
ATTTGAAAAA 
AAAAAGGTAT. 
CTACCTTTGA 
ACATTAAAAA 



Seq ID NO: 235 Protein sequence: 
Protein Accession 8: Eos sequence 



1 11 21 31 41 51 

.MHPLKTQREA VCLPRSSYIR LRHFLFTGDY KIPAPCSFGA DAILGLSPSA PRRSLKQCVA 
PHRLVLLVGA LSGFRPIQEP CRKH 

Seq ID NO: 236 DMA sequence 
Nucleic Acid Accession ft: NM__002075 
Coding sequence: 406. .1428 



CCACAATAGG 
ACAGGATCAG 
AGTCCTTTCE 
ACCACCCTGA 
GGCCAGGCCA 
CGTCGCAGCT 
CCAGCCAGAG 
CAACTGCGTC 
GCTGACGTTA 
CGGACGCGGC 
GATTCTAAGC 
ACCACCAACA 
GCCCCATCAG 
CTCAAATCCC 
CTCTCCTGCT 
TGTGCCTTGT 
GACTGCATGA 
GCCAGTGCCA 



11 

I 

GGCAGACCTG 
ACCCAGAGGC 
AATCTCAGCT 
GCTGAGGAGC 
GGCCAGCTCC 
GAGGGAGTAA 
CCCAAGAGCC 
AGGAAGCGGA- 
CTCTGGCAGA 
GGACGTTAAG 
TGCTGGTAAG 
AGGTGCACGC 
GGAACTTTGT 
GTGAGGGCAA 
GCCGCTTCCT 
GGGACATTGA 
GCCTGGCTGT 
AGCTCTGGGA 



21 
I 

TCCATCCTTC 
AGCTGGTTGG 
CCTGCCTGTA 
ACAGTTTGAG 
TCTGGCAGCA 
GGAGGCTCCC 
AGAGTGACCC 
GCAGCTCAAG 
GCTGGTGTCT 
GGGACACCTG 
TGCCTCGCAA 
CATCCCACTG 
GGCATGTGGG 
TGTCAAGGTC 
GGATGACAAC 
GACTGGGCAG 
GTCTCCTGAC 
TGTGCGAGAG 



31 
I 

TCTGTGGGTC 
GGTTTGTCGA 
CCCTCCCATA 
GCCCCCCCAA 
GAGCCTGGGC 
AGGAACCGGA 
CTCGACCTGT 
AAGCAGATTG 
GGCCTAGAGG 
GCCAAGATTT 
GATGGGAAGC 
CGCTCCTCCT 
GGGCTGGACA 
AGCCGGGAGC 
AATATTGTGA 
CAGAAGACTG 
TTCAATCTCT 
GGGACCTGCC 



41 
I 

CCCTGTACCT 

gaagaaggat 
ctcaccaaac 
ccccccgccg 
aggtgacggg 
gctggaaacc 
cagccatggg 
cagatgccag 
tggtgggacg 

ACGCCATGCA 
TGATCGTGTG 
GGGTCATGAC 
ACATGTGTTC 
TTTCTGCTCA 
CCAGCTCGGG 
TATTTGTGGG 
TCATTTCGGG 
GTCAGACTTT 



51 
I 

TTCTCCCCCA 
TATCCAGATC 
CCTCTTCCCC 
GTCGGGGCCA 
CGGGCGCGGG 
CGGCCGAGGT 
GGAGATGGAG 
GAAAGCCTGT 
AGTCCAGATG 
CTGGGCCACT 
GGACAGCTAC 
CTGTGCCTAT 
CATCTACAAC 
CACAGGTTAT 
GGACACCACG 
ACACACGGGT 
GGCCTGTGAT 
CACTGGCCAC 



2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 



60 
120 
180 
240 
.300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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10 
15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 

85 



GAGTCGGACA 
GATGACGCTT 
CACGAGAGCA 
TTCGCTGGCT 
GGCATCCTCT 
GCTGTGGCCA 
AAGGGAAGTG 
GGTGTTCTCT 
GGGAGCATGG 
CCATCTCCTC 
GGACAACCTG 
GCCCTAGGAT 
TTGGCCCTGT 
TTCTCCTTTT 
GT 



TCAACGCCAT. 
CCTGCCGCTT 
TCATCTGCGG: 
ACGACGACTT 
CTGGCCACGA 
CAGGTTCCTG * 
GAAGGCAGTG 
TCTATATTCC 
GACTGTGCCT 
CCATGGCCTT 
CCCCTCCCCA 
TCCTCCCCCA- 
GACTATGGCT 
TCTACCTTTT 



CTGTTTCTTC 
GTTTGACCTG 
CATCACGTCC 
CAACTGCAAT 
TAACAGGGTG 
GGACAGCTTC 
AACACACTCA 
GGGTGCCATT 
TTGGGAGGCA 
CCCTCCCCAC 
GCCCTTTGCA 
GAGCCACTAC 
CTGGCACCAC 
TTTCTCTCCT 



CCCAATGGAG 
CGGGCAGACC 
GTGGCCTTCT 
GTCTGGGACT 
AGCTGCCTGG 
CTCAAAATCT 
GCAGCCCCCT 
CCCACTAAGC 
GCATCAGGGA 
AGTCCTCACA 
GGCCCAGCAG 
CTTTGTCCAG 
TAGGGTCCTG 
AAGACACCTG 



AGGCCATCTG 
AGGAGCTGAT 
CCCTCAGTGG. 
CCATGAAGTC 
GAGTCACAGC 
GGAACTGAGG- 
GCCCGACCCC 
TTTCTCCTTT 
CACAGGGGCA 
GCCTCTCCCr 
ACTTGAGTCT: 
GCCTGGGTGG 
GCCCTCTTCT 
CAATAAAGTG 



CACGGGCTCG 

CT6CTTCTGC 

CCGCCTACTA 

TGAGOGTGTG 

TGACGGGATG 

AGGCTGGAGA 

ATCTCATTCA 

GAGGGCAGTG 

AAGAACTGCC ' 

TAATGAGCAA 

GAGGCCCCAG • 

TATAGGGCGT 

TATTCATGCT 

TAGCACCCTG 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 



Seq IDNO: 237 Protein sequence: 
Protein Accession fl : NP_002066 



1 

! 

MGEMEQLRQE 
KHWATDSKLL 
CSIYNLKSRE 
VGHTGDCMSL 
ICTGSDDASC 
KSERVGILSG 



11 
I 

AEQLKKQIAD 
VSASQDGKLI 
GNVKVSRELS 
AVSPDFNLFI 
RLFDLRADQE 
HDNRVSCLGV 



21 
I 

ARKACADVTL 
VWDSYTTNKV 
AHTGYLSCCR 
SGACDASAKL 
LICFSHESII 
TADGMAVATG 



31 



41 



51 



AELVSGLEW GRVQMRTRRT LRGHLAKIYA 
KAIPLRSSWV MTCAYAPSGN FVACGGLDNM 
FLDDNNIVTS SGDTTCAIiWD IETGQQKTVF 
WDVREGTCRQ TFTGHSSDIN AICFPPNGEA 
CGITSVAFSL SGRLLFAGYD DFNCNVWDSM 
SWDSFIiKIWN- 



Seq ID NO: 238 DNA sequence 

Nucleic Acid Accession ft: CAT cluster 



1 

I 

TCCCAATGTG 
TACCATTTGC 
ACTGATTCTG 
TGCATTGACC 
TAAGTGACTG 
TTTTG ACGAG 
CAAAAAGGGG 
AAGAAAGAAA 
GGATCAAGAA 
CAGAAGTGAA' 
TAGAAAAGTT 
CAACTACTCA 
CTTGTGTTCC 
ACTGTTGTTT. 
GACATGAAAG. 
TAGGCTAAGT 
AAAATCCCAA 



11 

I 

TNGAACCTAC 
TTTTAAGGCA 
TGAATTATGA 
AGTGTGAAGC 
GAAAGCTGAA 
TATCGGGTGA 
AAAAAAAAGA 
AATAAAATAC 
GTTTGTGTAC 
TCAAAATATT. 
TTTCTGTAAA 
ACTTTCCTAC 
AATAAAGCTT 
GCCAAGTCCT 
TTCATTGGGT 
TATAATACAC 
ATAAAA 



21 
I 

CATAAATTCT 
GATAATCCTC 
AATCTGAAAA 
ACAGTGGAAT 
GAATCACCGG 
CTTTGAGGTG 
GCAACCAAAG 
ACAATATGGA 
ACATAATCTC 
TCAAAATGCT 
AGTCAGATAG. 
TGTAGCACAA 
CATTTACAAA 
AATATAGTTG 
TGCTAAAAAG 
TGTTTTAACA 



31 

I 

TTTCTTAQIG 
CAAGTTTTCT 
GGAATTGGAA 
GAGAATGCGT 
CTTCAGTGAC 
GTCAAGAAAC 
AAAAAAAATC 
CGATGGAGAA 
ATTTTGAGAT 
GTCTTATGAA 
TAAATATTTT 
GAGTAGCTGT 
AACATGCCAT 
CTTAGCAAGT 
TATGTAGAAA 
ATTGTAAAAT 



41 
I 

GACAATCTTA 
AATGATATCT 
GTTGCTAAAA' 
GCCCTGACAC 
ATGGAACCCA" 
CACACTTTAA 
CATAAAATTG 
AAACAGTTAC 
ATATAACTAT 
ACTACAATAT^ 
AGGTTTTGCA 7 
GGTACTGTGC 
GGGCCATATT 
ATTGTGAGCT 
TTCAAAGGAA 
GTAAGAGAAA 



51 
I 

TNCTAANCAA 

GAAACTATTA- 

ATCTATCATT 

CAAAGAAAAA 

GTGATTTGAT 

GAACAATGTC 

CACAGAAGAA. 

ATTTCTTTAT 

TTTTGTCTTT 

TCTCACAGAT 

GTGTCTTTTG 

AAATAAATTG 

TGGCCTGTAC 

ATTTGAGGAA- 

AATTAAAATT 

TTTACAAATA 



Seq ID NO: 239 DNA sequence 

Nucleic Acid Accession Qi NM_0017B6.1 

Coding sequence: 130-1023 



1 
I 

GGGGGGGGGG, 
GTTGTTGTAG 
TGACTAACTA 
GTGTATAAGG 
GAAAGTGAAG 
CTTCGTCATC 
CTCATCTTTG 
CAGTACATGG 
TTTTGTCACT 
GACAAAGGAA 
AGAGTATATA 
TCAGCTCGTT 
GCAACTAAGA 
AGAGCTTTGG 
AAGAATACAT 
GAAAATGGCT 
GGCAAAATGG 
TAGCTTTCTG 
AACTCTTGTC 
GTCTTCTAAT 
ATTCTGTAAA 



11 
I 

GGCACTTGGC 
CTGCCGCTGC 
TGGAAG ATT A 
GTAGACACAA 
AGGAAGGGGT 
CAAATATAGT 
AGTTTCTTTC 
ATTCTTCACT 
CTAGAAGAGT 
CAATTAAACT 
CACATGAGGT 
ACTCAACTCC 
AACCACTTTT 
GCACTCCCAA 
TTCCCAAATG 
TGGATTTGCT 
CACTGAATCA 
ACAAAAAGTT 
TATTTTTGTC 
TTCAAAAATA 
TGTGAAAAAA 



21 

I . 

TTCAAAGCTG 
GGCCGCCGCG 
TACCAAAATA 
AACTACAGGT 
TCCTAGTACT 
CAGTCTTCAG 
CATGGATCTG 
TGTTAAGAGT 
TCTTCACAGA 
GGCTGATTTT 
AGTAACACTC 
AGTTGACATT 
CCATGGGGAT 
TAATGAAGTG 
GAAACCAGGA 
CTCGAAAATG 
TCCATATTTT 
TCCATATGTT, 
• TTATATATAT 
TAACTTAAAA 
AAAAAAAAAA 



31 

I 

GCTCTTGGAA 
GAATAATAAG 
GAGAAAATTG 
CAAGTGGTAG 
GCAATTCGGG 
GATGTGCTTA 
AAGAAATACT 
TATTTATACC 
GACTTAAAAC 
GGCCTTGCCA 
TGGTACAGAT 
TGGAGTATAG 
TCAGAAATTG 
TGGCCAGAAG 
AGCCTAGCAT 
TTAATCTATG 
AATGATTTGG 
ATGTCAACAG 
TTCTTTGTTA 
ATGTAAATAT 
AAAAA 



41 • 
i 

ATTGAGCGGA 
CCGGGATCTA 
GAGAAGGTAC 
CCATGAAAAA 
AAATTTCTCT 
TGCAGGATTC 
TGGATTCTAT 
AAATCCTACA 
CTCAAAATCT 
GAGCTTTTGG 
CTCCAGAAGT 
GCACCATATT 
ATCAACTCTT 
TGGAATCTTT 
CCCATGTCAA 
ATCCAGCCAA- 
ACAATCAGAT 
ATAGTTGTGT 
TCAAACTTCA 
TCTATATGAA 



51 
I 

GAGCGACGCG 
CCATACCCAT. 
CTATGGAGTT 
AATCAGACTA 
ATTAAAGGAA 
CAGGTTATAT 
CCCTCCTGGT 
GGGGATTGTG 
CTTGATTGAT- 
AATACCTATC 
ATTGCTGGGG 
TGCTGAACTA 
CAGGATTTTC 
ACAGGACTAT 
AAACTTGGAT 
ACGAATTTCT 
TAAGAAGATG 
TTTTATTGTT 
GCTGTACTTC 
TTTAAATATA 



Seq ID NO: 240 Protein sequences 
Protein Accession 6: NPjp01777.1 



11 



21 31 |1 51 

MEDYTKIEKI GEGTYGVVYK ' GRHKTTGQVV .AMKKIRIiESE EEGVPSTAIR BISLLKBLRH 
PNIVSLQDVL MQDSRLYLIF EFLSMDLKKY LDSIPPGQYM DSSLVKSYLY QILQGIVFCH 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



60 
120 
160 
240 
300 
360 
420 
480 . 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



60 
120 



280 



WO 02/086443 .... " 

SRRVLHRDLK PQNLIiIDDKG TIKLADFGIiA RAFGIPIRVY THEWTliWYK SPEVLLGSAR 180 

YSTPVDIWSI GTIFAEliATK KPLFHGDSEI DQLFRIFRAL GTPNNEVWPE VESLQDYKNT .240. 
FPKWKPGSLA SHVKNLDENG LDLLSKMLIY DPAKRISGKM ALNHPYFNDL -DNQIKKM 

^Seq ID NO: 241 UNA sequence 
Nucleic Acid Accession g: NM_033379.1 
Coding 'sequence: 132-854 - .. 

1 ; .11,. - *21. "31 41 51 

CGCCCGCGCG CGGGCTCAAC TTTGTAGAGC GAGGGGCCAA CTTGGCAGAG CGCGCGGCCA 60 

GCTTTGCAGA GAGCGCCCTC CAGGGACTAT GCGTGCGGGG ACACGGGATC TACCCATACC 120 

ATTGACTAAC TATGGAAGAT TATACCAAAA TAGAGAAAAT TGGAGAAGGT ACCTATGGAG 180 

TTGTGTATAA GGGTAGACAC AAAACTACAG " GTCAAGTGGT AGCCATGAAA AAAATCAGAC 240 

TAGAAAGTGA AGAGGAAGGG GTTCCTAGTA CTGCAATTCG GGAAATTTCT CTATT AAAGG -, 300 

AACTTCGTCA TCCAAATATA GTCAGTCTTC AGGATGTGCT TATGCAGGAT TCCAGGTTAT. 360 

ATCTCATCTT TGAGTTTCTT TCCATGGATC TGAAGAAATA CTTGGATTCT ATCCCTCCTG" . 420 

GTCAGTACAT GGATTCTTCA CTTGTTAAGG TAGTAACACT CTGGTACAGA TCTCCAGAAG 480 

TATTGCTGGG GTCAGCTCGT> TACTCAACTC CAGTTGACAT TTGGAGTATA GGCACCATAT . 540 

TTGCTGAACT AG CAACTAAG • AAACCACTTT * TCCATGGGGA TTCAGAAATT GATCAACTCT 600 

TCAGGAT TTT- CAGAGCTTTG GGCACTCCCA- ATAATGAAGT GTGGCCAGAA- GTGGAATCTT 660 

TACAGGACTA- TAAGAATACA- TTTCCCAAAT GGAAACCAGG AAGCCTAGCA TCCCATGTCA- 720 

AAAACTTGGA TGAAAATGGC TTGGATTTGC TCTCGAAAAT GTTAATCTAT' GATCCAGCCA- 780 

AACGAATTTC : TGGCAAAATG GCACTGAATC ATCCATATTT TAATGATTTG GACAATCAGA 840 

TTAAGAAGAT GTAGCTTTCT GACAAAAAGT TTCCATATGT TATGTCAACA GATAGTTGTGr 900 

TTTTTATTGT TAACTCTTGT- CTATTTTTGT CTTATATATA TTTCTTTGTT ATCAAACTTC 960 

AGCTGTACTT CGTCTTCTAA' TTTCAAAAAT ATAACTTAAA AATGTAAATA TTCTATATGA 1020 
ATTTAAATAT AATTCTGTAA ATGTGAAAAA AAAAAAAAAA AAAAAA 

Seq ID NO: 242 Protein sequence: 
Protein Accession #: NP_203698.1 

1 11 ,' 21 ; 31 41 51 

MEDYTKIEKI. GEGTYGWYK GRHKTTGQW AMKKIRLESE EEGVPSTAIR EISLLKELRH 60 

PNIVSLQDVL MQDSRLYLIF EFLSMDLKKY LDSIPPGQYM DSSLVKWTL WYRSPEVLLG- 120 

SARYSTPVDI WSIGTIFAEL ATKKPLFHGD -SEIDQbFRI F RALGTPNNEV WPEVESLQDY 180 
KNTFPKWKPG: SLASHVKNLD, ENGLDLLSKM" LIYDPAKRIS GKMALNHPYF NDLDNQIKKM 

Seq ID NO: 243 DNA sequence . 
Nucleic Acid Accession #: AF101051.1 
Coding sequence: 221- BS6 

1 il .21 31 41 51 

GAGCAACCTC AGCTTCTAGT ATCCAGACTC CAGCGCCGCC CCGGGCGCGG ACCCCAACCC 60 

CGACCCAGAG CTTCTCCAGC GGCGGCGCAG CGAGCAGGGC TCCCCGCCTT AACTTCCTCC 120 

GCGGGGCCCA GCCACCTTCG GGAGTCCGGG TTGCCCACCT GCAAACTCTC CGCCTTCTGC 180 

ACCTGCCACC CCTGAGCCAG CGCGGGCGCC CGAGCGAGTC ATGGCCAACG CGGGGCTGCA 240 

GCTGTTGGGC TTCATTCTCG CCTTCCTGGG ATGGATCGGC GCCATCGTCA GCACTGCCCT 300 

GCCCCAGTGG AGGATTTACT CCTATGCCGG CGACAACATC GTGACCGCCC AGGCCATGTA 360 

CGAGGGGCTG TGGATGTCCT GCGTGTCGCA GAGCACCGGG CAGATCCAGT GCAAAGTCTT 420 

TGACTCCTTG CTGAATCTGA GCAGCACATT GCAAGCAACC CGTGCCTTGA TGGTGGTTGG 480 

CATCCTCCTG GGAGTGATAG CAATCTTTGT- GGCCACCGTT- GGCATGAAGT GTATGAAGTG 540 

CTTGGAAGAC GATGAGGTGC AGAAGATGAG ; GATGGCTGTC ATTGGGGGTG CGATATTTCT 600 



TCTTGCAGGT CTGGCTATTT 
ATTCTATGAC CCTATGACCC 
TGGCTGGGCT GCTGCTTCTC 
CCGAAAAACA ACCTCTTACC 



TAGTTGCCAC AGCATGGTAT GGCAATAGAA TCGTTCAAGA 660 

CAGTCAATGC CAGGTACGAA TTTGGTCAGG CTCTCTTCAC 720 

TCTGCCTTCT GGGAGGTGCC CTACTTTGCT GTTCCTGTCC . 780 

CAACACCAAG GCCCTATCCA AAACCTGCAC CTTCCAGCGG 840 

gaaagXctac GTGTGACACA GAGGCAAAAG GAGAAAATCA TGTTGAAACA AACCGAAAAT 90 0 

GGACATTGAG ATACTATCAT TAACATTAGG. ACCTTAGAAT TTTGGGTATT GTAATCTGAA 960 

GTATGGTATT ACAAAACAAA CAAACAAACA AAAAACCCAT GTGTTAAAAT ACTCAGTGCT 1020 

AAACATGGCT TAATCTTATT TTATCTTCTT TCCTCAATAT AGGAGGGAAG ATTTTACCAT 1080 

TTGTATTACT GCTTCCCATT GAGTAATCAT ACTCAAATGG GGGAAGGGGT GCTCCTTAAA 1140 

TATATATAGA TATGTATATA ' TACATGTTTT TCTATTAAAA ATAGACAGTA AAA TACT ATT 1200 

CTCATTATGT TGATACTAGC ATACTTAAAA TATCTCTAAA ATAGGTAAAT GTATTTAATT 1260 

CCATATTGAT. GAAGATGTTT- ATTGGTATAT TTTCTTTTTC GTCCTTATAT ACATATGTAA 1320. 

CAGTCAAATA- TCATTTACTC TTCTTCATTA GCTTTGGGTG CCTTTGCCAC AAGACCTAGC . 1380 

CTAATTTACC AAGGATGAAT TCTTTCAATT CTTCATGCGT GCCCTTTTCA TATACTTATT 1440 

TTATTTTTTA CCATAATCTT ATAGCACTTG CATCGTTATT AAGCCCTTAT TTGTTTTGTG 1500 

TTTCATTGGT . CTCTATCTCC TGAATCTAAC ACATTTCATA GCCTACATTT TAGTTTCTAA 1560 

AGCCAAGAAG AATTTATTAC AAATCAGAAC TTTGGAGGCA AATCTTTCTG CATGACCAAA 1620 

GTGATAAATT CCTGTTGACC TTCCCACACA ATCCCTGTAC TCTGACCCAT AGCACTCTTG 1680 

TTTGCTTTGA AAATATTTGT CCAATTGAGT AGCTGCATGC TGTTCCCCCA GGTGTTGTAA 1740 

CACAACTTTA TTGATTGAAT TTTTAAGCTA CTTATTCATA GTTTTATATC CCCCTAAACT 1800 

ACCTTTTTGT TCCCCATTCC TTAATTGTAT TGTTTTCCCA AGTGTAATTA TCATGCGTTT 1860 

TATATCTTCC TAATAAGGTG TGGTCTGTTT GTCTGAACAA AGTGCTAGAC TTTCTGGAGT 1920 

GATAATCTGG TGACAAATAT TCTCTCTGTA GCTGTAAGCA AGTCACTTAA TCTTTCTACC 1980 

TCTTTTTTCT ATCTGCCAAA TTGAGATAAT GATACTTAAC CAGTTAGAAG AGGTAGTGTG 2040 

AATATTAATT AGTTTATATT ACTCTCATTC TTTGAACATG AACTATGCCT ATGTAGTGTC 2100 

TTTATTTGCT CAGCTGGCTG AGACACTGAA GAAGTCACTG AACAAAACCT ACACACGTAC .2160 

CTTCATGTGA TTCACTGCCT TCCTCTCTCT ACCAGTCTAT TTCCACTGAA CAAAACCTAC 2220 

ACACATACCT TCATGTGGTT CAGTGCCTTC CTCTCTCTAC CAGTCTATTT ; CCACTGAACA 2280 

AAACCTACGC ACATACCTTC ATGTGGCTCA GTGCCTTCCT CTCTCTACCA GTCTATTTCC 2340 

ATTCTTTCAG CTGTGTCTGA -CATGTTTGTG CTCTGTTCCA TTTTAACAAC TGCTCTTACT 2400 

TTTCCAGTCT GTACAGAATG CTATTTCACT TGAGCAAGAT GATGTATGGA AAGGGTGTTG 2460 



281 



WO 02/086443 

GCACTGGTGT CTGGAGACCT GGATTTGAGT CTTGOTGCTA TCAATCACCG TCTGTGTTTG' 2520 
AGCAAGGCAT TTGGCTGCTG TAAGCTTATT GCTTCATCTG TAAGCGGTGG TTTGTAATTC - 2580 
CIGATCTTCC CACCTCACAG TGATGTTGTG GGGATCCAGT GAGATAGAAT. ACATGTAAGT 2640 
GTGGTTTTGT AATTTGAAAA GTGCTATACT AAGGGAAAGA ATTGAGGAAT TAACTCCATA 2700 
CGTTTTGGTG TTGCTTTTCA AATGTTTGAA AATAAAAAAA TGTTAAGAAA TGGGTTTCTT 2760 
GCCTTAACCA GTCTCTCAAG TGATGAGACA' GTGAAGTAAA ATTGAGTGCA CTAAACGAAT . 2820 
AAGATTCTGA GGAAGTCTTA TCTTCTGCAG, TGAGTATGGC CCAATGCTTT CTGTGGCTAA * 2880 
ACAGATGTAA TGGGAAGAAA TAAAAGCCTA CGTGTTGGTA AATCCAACAG CAAGGGAGAT. 2940 
TTTTGAATCA TAATAACTCA TAAGGTGCTA TCTGTTCAGT GATGCCCTCA GAGCTCTTGC .3000 
TGTTAGCTGG CAGCTGACGC TGCTAGGATA GTTAGTTTGG AAATGGTACT TCATAATAAA 3060 
CTACACAAGG AAAGTCAGCC ■ ACCGTGTCTT ATGAGGAATT GGACCTAATA" AATTTTAGTG 3120 
TGCCTTCCAA' ACCTGAGAAT- ' ATATGCTTTT GGAAGTTAAA ATTTAAATGG CTTTTGCCAC - 3180 
ATACATAGAT: CTTCATGATG TGTGAGTGTA ATTCCATGTG GATATCAGTT ACCAAACATT 3240 
ACAAAAAAAT TTTATGGCCC AAAATG ACCA • ACGAAATTGT TACAATAGAA : - TTTA TCCAAT 3300 
TTTGATCTTT TTATATTCTT- CTACCACACC TGGAAACAGA CCAAT AGAC A TTTTGGGGTT 3360 
TTATAATGGG AATTTGTATA AAGCATTACT CTTTTTCAAT AAATTGTTTT TTAATTTAAA 3420 
AAAAGGAAAA- AAAAAAAAAAr AAA 



Seq ID MO: 244 Protein sequence: 
Protein Accession "&: AAD16433.1 

1 ll 21 31 41 51 

MANAGLQLLG FILAFLGWIG AIVSTALPQW RIYSYAGDNI VTAQAMYEGL WMSCVSQSTG 60 

QIQCKVFDSL LNLSSTLQAT-RALMWGILI* GVIAIFVATV, GMKCMKCLED; DEVQKMRMAV. 120 

IGGAIFLLAG LAILVATAWY GNRIVQEFYD PMTPVNARYB FGQALFTGWA AASLCLLGGA 180 
LLCCSCPRKT TSYPTPRPYP KPAPSSGKDY V 

Seq ID NO: 245 DNA sequence 

Nucleic Acid Accession #: CAT cluster 

1 ii 21 31 41 ' 51 

I I I I i > 

'ri T l T lTi T T r rmv A T T T . TTTTTCAAGG AGAGCACAAG GAACTTTATT AATGACTTTC 60 

TTAATGGTTA AATGCTGTTT ACCAAGTGAC CCAGAGGCAG CG TGGT TTAG TGGTTTCAAC 120 

AGCATGGTCC CGAGAGTCTG ACAAACCTCA GTTCAAATCC TTCTTTTGTC TTCACTTAGT 180 

TTTTCTTCCT. GAGATTTAGT TTCTTCATCG TTAACAATGA; GGATATTAAT ATGTTTCACA 240 

. CAGTTGTTAT GAAGAATGCA' TATATTAGAA- TGCCTGTAGT > CTCAGCTACT CAGGAGGCTA 300 

AGGTGGGGAG GTCGCTCAAG CCCAGGAATT . CAAAGCTGCA ATGCATTATG ATTACAGCTG - 3 60 

TTAATAGCCA ■ CTGCACTTCA GCCTGGGCAA TGTAGTAAGA TCCCATCTCT GGCTCGGAGG 420 
GTCCTACGCC CACGGAGTCT CGCTGATTGC TAGCACAGCA GTCTGAGATC AAACTGCA 

Seq ID NO: 246 DNA sequence 

Nucleic Acid Accession #: XM_0S8553.2 

Coding sequence: 897-1400 

X 11 21 31 41 51 

AATTTTCAGA AGTTTCGTAT GGGGATGGTT TTATATAAAT TCAGGTTTTT CCCACAATAA 60 

TAAATGTATT TAGTCTCAGT GCTCAATAGA AGAGATTTCT AATAGAAAAG GATTCAAACT 120 

GTGAAACCAT TTCTCTTTTA ATGTTTCACA TTCCTGTTAC AGATTTGTTC TCTTGTGACT 180 

CTGTTATCCA TAATATGGAC AGTTCTTGAG TCCTAACATT GAGA GGTT TT CCCTTAGTGC 240 

ATAGAGGGAA TGAGTATTAA TTGGAGAAGC TTAAAGTATT GCCACTTTAG CACTGAAGAT 300 

TGGGATGAGA- GGAGGTGAAA CCTCACTAGA AAAAGGGACA ATGTTAGTGT GGCCCTTCCT: 360 

GATCATGTTT AAGAAAAGTC ATGAAAATGG ' TGAACTAGTG TTTCCAAGCA TATTGGAAGG 420 

GTTGAGTGTA TACTGTCTGT CAAAGACTTC CAGCATTTCC AGGTCCTAGA GAGGAACAAG. 480 

ACTGGTAACC TGCCTATCTG TATTTTTAAG. AACCCAGGAG GAAAGCTTTA TAATAGAACA 540 

TTATTTCTGT GTTTATGTAT. AAGGGGTTTT TTGTTTTTTT AAAGACAGGA TCTCACTCCA 600 

TTGTCCAGGC CAAGTGCAAT. GGCACGAACC TCATAGCTCC TGGACTTAAG TGATCTGCCT 660 

GCCTTTGCCT CCTGAGTAGC TGGGACTACA GGCATGAGCC CCCATGCCTG GCTAAGTTTGi" 720 

XTTTTTTGTT- TGTTTGTTTG TTTGTTTTTG GGGGGGGTTG TTTTGTTTTT TGTAGAGACG- 780 

TAGTCTTGCT TTGTTGCCAG GCTAGTCTCA AACTCCTGGC TTCAAGTGAT CCTCCTGCCT 840 

CAGCCTCCCA GAGTGCTAGG ATTACAGCAC TTGGATTCAG CTTCTTCATT^ TCCAACATGG 900 

AAGAAACTTA CACCGACTCC CTGGACCCTG AGAAGCTATT GCAATGCCCC TATGACAAAA 960 

ACCATCAAAT CAGGGCTTGC AGGTTTCCTT " ATCATCTTAT CAAGTGCAGA- AAGAATCATC 1020 

CTGATGTTGC AAGCAAATTG GCTACTTGTC CCTTCAATGC" TCGCCACCAG" GTTCCTCGAG 1080 

CTGAAATTAG TCATCATATC TCAAG CTGTG ATGACAGAAG TTGTATTGAG CAAGATGTTG 1140 

TCAACCAAAC CAGGAGCCTT ' AGACAAGAGA CTCTGGCTGA GAGCACTTGG CAGTGCCCTC 1200 

CTTGCGATGA AGACTGGGAT AAAGATTTGT GGGAGCAGAC CAGCACCCCA TTTGTCTGGG 1260 

GCACAACTCA* CTACTCTGAC AACAACAGCC CTGCGAGCAA CATAGTTACA GAACATAAGA- 1320 

ATAACCTGGC TTCAGGCATG CGAGTTCCCA AATCTCTGCC GTATGTTCTG. CCATGGAAAA 13 80 

ACAATGGAAA TGCACAGTAA * CTG AATACCT ATCTCATCAA ATGCCAGACC CTAGAAGACT 1440 

. GTTGCTTCTT CTTCTACCAG. TGGGTTCTCA TTTTCCTCCT AATCTAATTA TAGAATGGTA" 1500 

AACTCCCTGT GACTTTCCAA' ACTGACAAGC ACACTTTTTT CCTCCCCCCT TGAATCCTCA 1560 
TTTAATGCAA GAACCCTCAT ACTCAGAAGC TTCCAAATAA ACCTTTGATA CAGATTG 

Seq ID NO: 247 Protein sequence: 
Protein Accession #: XP_058553.1 

1 - ii 21 - 31 41 51 

I I -' I I I I 

MEETYTDSLD PEKLLQCPYD KNHQIRACRP PYHLIKCRKN HPDVASKLAT CPFNARHQVP 60 

RAEISHHISS CDDRSCIEQD WNQTRSLRQ ETLAESTWOC PPCDEDWDKD LWEQTSTPFV 120 
WGTTHYSDNN SPASNIVTEH KNNLASGMRV PKSLPYVLPW KNNGNAQ 



282 



WO 02/086443. 

Seq -ID-NO: . 248 _DNA sequence 
Nucleic Acid, Accession g: NM_003392 
Coding sequence: 758..18SS • - 



PCTAJS02/12476 
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TTAAGGAAAT 
AACTGATTAT- 
CGCGCTGGTC 
CATCCTCCAC 
TCCACTCGCC 
GGCCCAGGTT 
GAGGGACTGA 
GGAAGGAGGC 
TCGCTGGCGT 

tgggtgqatt: 

GAGAAGCGCA 
TGTATGCTTG 
GTCCATTGGA 
CAAGTTCTTC 
CAATTCTTGG" 
AGGAGCACAG 
CCACTTGTAT 
ATGCCAGTAT 
TTTTGGCAGG 
AGCAGGGGTG 
CTGCAGCCGC 
CGACAACATC 
GCGCATCCAC 
CGAGGCCGGC 
GTCCGGCTCA 
TGATGCCCTG 
GTTGGTACAG 
CCCCAGCCCT. 
CCTGTGCAAC 
GTACGACCAG 
CTACGTCAAG 
GCCACCCAGC 
TGGTTTTTGG 



TTTTTTCCTG 
GGCAATAATG. 
TACAAGACTT 
CTGTGTGGGA 
TGCCATCATA 
TTCAGCTTCT 
AAAACAAAAC 
CATTTTCAAA 
GGTATATCAC. 
AATAGCTCAT 
CTCTTATGTC 
AAGACCCCCA 

. ATGAAATATC 
TACATGAATC 
GCACTGCACC 
ACACTGAGCC 
GCAGCTCCAC 
TGGAAAACAG 
TAGTTTTCTA 

. TATCACTGTT 
GTGTACCTTA 
GGTTTAATGG 
ATATATAAAT 
CTCTGGGGTT 
ATTCCAAAAG 
GCACGACGAA 
TGTTGGGTTG 
TTCTGTTCAC 
ATTCAAAACT 
AACCCATGCC 
CACTACATAG 
CAGGCCATCA 
ACATCTTTTC 
CTCTTAATTT 

. ATAATGATAT 
AAAGCACTAA 
.TACTTT T T T T 
GTTGAGTTTA 
CATTCAGATA 
GTCTTGCGTG 
AATGGAAGAT 



11 

I i 
CCGGGCTGCT 
; GAAACATACG 
CCCGGGGCCT 
CCCCCGCGCT 
TCCCGTGCTC 
GGGGAGAGGA 
GCACAGCACC 
AGCGCCTGGC 
GCCCCGCGCA 
AATTTGGAAA 
GTCAATCAAC 
AAAATTATCT 
ATATTAAGCC 
CTAGTGGCTT 
TGGTCGCTAG 
CCTCTCTGCA 
CAGGACCACA 
CAATTCCGAC 
GTGATGCAGA 
GTGAACGCCA 
GCCGCGCGCC 
GACTATGGCT 
GCCAAGGGCT 
CGCAGGACGG 
TGTAGCCTGA 
AAGGAGAAGT 
GTCAACAGCC 
GACTACTGCG 
AAGACGTCGG 
TTCAAGACCG 
TGCAAGAAGT 
ACTCAGCCCC 
TTTTTAGAAA;, 
TTACCATCTA 
GGGGTGGGAA 
CTTTTGGATA 
CATGGTACAC 
TGGGATGGGT 
ATGACCAAAA 
AAAACAAAAC 
ATGATAATTT 
ATGTCTCATT 
GAAATTTGGG 
AAGATGTTGA- 
ATGATTCTGG 
CTGTATTTTC ' 
CCATTCACAG 
AGAGCAGACA 
CTCTCTGATT 
TGGGTCCCCT 
TTCACTACTT 
TGTTTTAATG 
ATGATCCTGT 
AAACTGTTCC 
TGCCTGATAT- 
ATAAATATAA 
ATCTCTCTGT 
TTTTTTGAGT 
GCAACCTCGT 
AAGATATCTT 
TTTGTGGAGA- 
CAGAAGCATC 
TATTAGAAAT 
ATAGCTTTTT 
AATATGTTCT 
ATACCCCCCC 
ATTGCATAAT 
TCACATCCCC 
TTAGTTTAAA 
ATTTGCTAAA 
ACAATCCTAG 
TTATGTATAT 
ATTTGTATAT 
AGAATATAAA 



21 
I 

CTTCCCCATC- 
ATGTTAATTC 
CGCCCCCCAC: 
GGCCACCCCG 
CTCTCGCCCA 
CGGAGGGTGG: 
AACTAGAGAG 

accagggctt; 
caggatccca 
aagaaactgc- 
agtaaactta- 
gagagggaat 

CAGGAGTTGC 
TGGCCATATT 
GTATGAATAA 
GCCAACTGGC 
TGCAGTACAT 
ATCGACGGTG 
TAGGCAGCCG, 
TGAGCCGGGC 
CCAAGGACCT. 
ACCGCTTTGC 
CCTACGAGAG: 
TGTACAACCT 
AGACATGCTG> 
ACGACAGCGC 
GCTTCAACTC 
TGCGCAATGA 
AGGGCATGGA- 
TGCAGACGGA 
GCACGGAGAT 
GCTCCCAGGA 
TATTTTTTAT 
AGAACTCTGT 
CCACGAAAAA 
GTATAGAATG 
ATCCAGAAGG 
AGGTTCCAGT. 
TGAGTTGTAA 
AAACCTCCCT 
ACAATGGAAG 
CTCCTCAAAT 
CAGCAGGGAG 
TTTGAAGCTG. 
ACACTAGATT 
TTAGGGATAC 
GTTTCTCAGC 
ACCTATTTGA- 
CCTCCGTGTT 
TTGGTTGTAG 
AGGGATTTTT 
ACAGAACTTG 
GTTTAGATTA 
CAGTGTACTT 
CTCAAAGTCT 
ATATATCTCA 
CTAGAGCATT 
CTTGAGCTTG 
TTCTGAGGAA 
TTTTTCTTTT 
GGGCATTACT' 
AGCAATGTTT 
GACAGTACTT 
TTTTTTTTTT 
TATCTCAGAC 
TTAGGAGGTT 
GATATCCACA 
TCAGTTGCAG 
ATGTCACTTT 
TCAGATTGTT 
CTTTTAAAAG 
CTTCTAGCCT 
TTCACTGGTT 
ATAAAACGTT 



31 
I 

TGGAAGTGGC 

GGAGCTGCAT 

CCCCTGCCCT 

CCTCCTTGGC 

TGGAATTAAT 

CCGCAGCGGG 

GGGTCAGGGG 

TGACTCAACA 

GCGAAAATCA. 

CTATATCTTG, 

AGAGACCCCC 

AAACATCTTT; 

TTTGGGGATG 

TTTCTCCTTC 

CCCTGTTCAG 

AGGACTTTCT- 

CGGAGAAGGC 

GAACTGCAGC 

CGAGACGGCC 

GTGCCGCGAG 

GCCGCGGGAC 

CAAGGAGTTC 

TGCTCGCATC 

GGCTGATGTG: 

GCTGCAGCTG 

GGCGGCCATG- 

GCCCACCACA 

GAGCACCGGC; 

TGGCTGCGAG- 

GCGCTGCCAC- 

CGTGGACCAG 

CCCGCTTATT 1 

TTTTCCCCAA 

GGTTTATTAT 

TATTTATTTT . 

AAGGGGGAAA 

TAAAGAAATA 

TGAAAGAGGG 

ATTCTCTGGT 

TCCCCAGCAG" 

GACAAGAATG 

ATTCCATTTG 

gaaagtcccc 
ttataagaat: 
ttttgtttgg; 

TTGGTTAGTA' 
CCAAGCAACA 
GGAAAAACAG 
GTGATGTGAT 
GACAGGAAAT 
GTTTCCTAAA 
GCTAATGGAA 
TCCACTCATG 
GAACAGTTGC 
TTTGTACATA 
TTGCAGCCAG; 
GTTGTCCTTC 
GGCTGTGGCC 
GAAGCTTGAG 
CTGCCTCACC 
TGTTCGTTAT. 
CTCTTTTCTT- 
ATTAATTGAG 
TTTTTTTTAA 
TTACGTTGTT 
GGGCTTTCAT* 
TCAGCCAACT 
TGAATTGTGA 
TTTGGTTTTT 
CCTTTTTAGT 
AAACTATTTA 
TTATTCTGTA 
TAAAAAACAA 
ACTTGTAAAA 



41 
I 

TTTCCCCACA 
TTCCCAGCTG 
TCCCTCCCGC 
AGCCTCTGGC • 
TCTGGCTCCA 
TTCCTGAGTG 
GTGCGGGACT 
GAATTGAGAC 
GATTTCCTGG 
CCATCAAAAA 
GATGCTCCCC 
TCCTTCTTCC 
GCTGGAAGTG' 
GCCCAGGTTG 
ATGTCAGAAG' 
CAAGGACAGA 
GCGAAGACAG 
ACTGTGGATA 
TTCACATACG ' 
GGCGAGCTGT 
TGGCTCTGGG 
GTGGACGCCC 
CTCATGAACC 
GCCTGCAAGT 
GCAGACTTCC 
CGGCTCAACA 
CAAGACCTGG 
TCGCTGGGCA 
CTCATGTGCT 
TGCAAGTTCC 
TTTGTGTGCA 
TATAGAAAGT 
GAATTGCAAC 
TAATAT TATA 
GTGGATCTTT 
TAACACATAC 
CATTTTCTTT 
TGGTAGAAAT 
GCAAGATAAA 
GGCTGCTAGC 
TCATATTCTC 
CAGACAGACC 
AGAAATTAAA 
TGGGATTCCA 
GGAGGTTGGC 
AATTATAATA 
AGGTAATTGC 
TGAAATCCAC 
GCTGGCCACG 
GAAACATTAG 
ACTTTTATTT 
TTCACAGAGG 
CTTCTCCTAT 
ATTTATAAGG 
ACATATATAT 
TGATTTAGAT 
ACTGCAGTCC 
CCGCTGTGAT 
TTCTGACTCA 
CCTTTGTCTC 
AGACATGGAC 
AGTTCATTCT 
TCCCTAAGGA 
TAAGGACACC 
TTAAAAGTTT 
ATCACCTCAG 
GTGGCTCTTT 
GCAAAAGATC 
ATTATACAAA 
GACTCATGTT 
ATGTAAAATA 
CTTTTAATGT 
ACATCGAAAG 
AAAAAAAA 



51 

I; 

TCGGCTCGTA 

GGCACTCTCG 

GTCCTGCCCC 

GGCAGCGCGC 

CTTGTTGCTC 

AATTACCCAG 

CGAGCGAGCA 

ACGTTTGTAA 

TGAGGTTGCG 

ACTCACGGAG ' 

TGGTTTAACT 

CTCTCCAGAA 

CAATGTCTTC 

TAATTGAAGC 

TATATATTAT 

AGAAACTGTG 

GCATCAAAGA 

ACACCTCTGT 

CCGTGAGCGC-r 

CCACCTGCGG *' 

GCGGCTGCGG 

GCGAGCGGGA 

TGCACAACAA 

GCCATGGGGT 

GCAAGGTGGG 

GCCGGGGCAA 

TCTACATCGA 

CGCAGGGCCG 

GCGGCCGTGG 

ACTGGTGCTG . 

AGTAGTGGGT 

ACAGTGATTC 

CGGAACCATT - 

ATTATTATTT 

GAAAAGGTAA - 

CCTAACTTAG - 

TTCTCAAATA 

CTATTCACAA 

AGGTCTTGGG 

TTGCTTTCTG 

AAGGAAAAAA 

GTCATATTCT 

AAATTTAAAA 

GATTTGTAAA 

TTGAACATAA 

GTAGAAATAA 

GTGCCATTCA 

CTTCCTCTTC 

TTTCCAAACG 

GAGCTCTGCT 

TGAGGAGCAG 

TGTTGCAGCG 

TGTACTGCAG 

GGGGAAATGT 

ATATATACAT 

TTACAGCTTA 

AGTTGGGATT 

CATACCCTGA 

CTGAAATGCG 

CAACCTCCAT 

GTTAAGAGAT 

GCAGAATGGA 

ATATTCAGCC 

TCTTTCCAAA 

GGAAAGATAC 

CCAACTGTGG 

AATTTATTGC 

TTGAAAGCAA- 

AACCATGAAG 

TATGAAGAGA 

TTCTACATGT 

ACATATTTCT 

GCTTATTCCA 
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Seq ID NO: 249 Protein sequence : 
Protein Accession fh NP_003383 . 
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WO 02/086443 

MAGSAMSSKF FLVALAIFFS FAQWIEANS KWSLGMNNPV 
SQGQKKLCHL YQDHMQYIGE GAKTGIKECQ YQFRHRRWNC 
AFTYAVSAAG WNAMSRACR EGELSTCGCS RAARPKDLPR 
FVDARERERI- HAKGSYESAR ILMNLHNNEA GRRTVYNLAD 
LADFRKVGDA LKEKYDSAAA MRLNSRGKLV QVNSRFNSPT 
GSLGTQGRLC NKTSEGMDGC ELMCOGRGYD QFKTVQTERC 
QFVCK 

Seq ID NO: "250 DNA' sequence 
Nucleic Acid Accession gi NMJJ1405B 
' Coding sequence : 5 6 .. 1324 



QMSEVYIIGA 
STVDNTSVFG 
DWLKGGCGDN 
VACKCHGVSG 
TQDLVYIDPS 
HCKFKWCCYV 



TGACTTGGAT 
TCGGCCAGAT* 
CGTCATCTTC 
GAGATATAAT 
ACTATATGCT 
TGAATCAATG 
TCAGGTTATC 
TAGATTTCAC' 
TGAAAAGCTG. 
AAAAATCAAC 
TAAAACTCTA 
GCCCTGGCAG 
TGCCACATGG 
GACTGCTTCC 
AATTGTCCAT. 
TTCTAGCCCTV 
TGAGTTTCAA 
TTACAGTCAA- 
TGAACCTCAA 
AGGAAAAACA 
AGAT ATCTGG : 
GCCTGGTGTT 
CTAAGAGAGA- 
CCATTTTTAG 
AATAAACTGT 



11 
I 

GTAGACCTCG 
GTGGTGAGGG. 
ATATCCCTGA- 
CAAAAGAAGA 
GAGTTTGGCA 
GTGAAAAATG 
AAGTTCAGTC 
TCTACTGAGG 
CAAGATGCTG 
AAGACAGAAA 
GGTCAGAGTC 
GCTAGCCTGC 
CTTGTGAGTG 
TTTGGAGTAA 
GAAAAATACA 
GTTCCCTACA 
CCAGGTGATG. 
AATCATCTTC 
GCTTACAATG. 
GATGCATGCC 
TACCTTGCTG' 
TATACTAGAG, 
AAAGCCTCAT 
AGATACAGAA 
TTGCTTGATG 



21 
I 

accttcacag 
' ctaggaaaag; 
.ttgtcctggc; 
cctacaatta" 

GAGAGGCTTC 
CATTTTATAA 
AACAGAAGCA- 
ATCCTGAAAC 
TAGGACCCCC 
CAGACAGCTA 
TCAGGATCGT 
AGTGGGATGG 
CTGCTCACTG 
CAATAAAACC 
AACACCCATC 
CAAATGCAGT 
TGATGTTTGT 
GACAAGCACA 
ACGCCATAAC 
AGGGTGACTC 
GAATAGTGAG 
TTACGGCCTT 
GGAACAGATA 
TTGGAGAAGA 
CAAAAAAAAA 



31 
I 

GACTCTTCAT 
AGTTTGTTGG 
AGTGTGCATT 

ctatagcaca 
taacaatttt; 

ATCTCCATTA 
TGGAGTGTTG" 
TGTAGATAAA- 
TAAAGTAGAT- 
TCTAAACCAT. 
TGGTGGGACA 
GAGTCATCGC 
TTTTACAACA 
TTCGAAAATG 
ACATGACTAT- 
ACATAGAGTT 
GACAGGATTT 
GGTGACTCTC 
TCCTAGAATG*. 
TGGAGGACCA- 
CTGGGGAGAT 
GCGGGACTGG 
ACATTTTTTT 
CTTGCAAAAC 
A 



41 
I 

TGCTGGTTGG 
GAACCCTGGG 
GGACTCACTG 
TTGTCATTTA 
ACAGAAATGA- 

agggaagaat: 
gctcatatgc 
attgttcaac 
cctcactcag: 

TGCTGCGGAA 
GAAGTAGAAG 
TGTGGAGCAA 
TATAAGAACC 
AAACGGGGTC" 
GATATTTCTC 
TGTCTCCCTG 
GGAGCACTGA 
ATAGACGCTA 
TTATGTGCTG 
CTGGTTAGTT 
GAATGTGCGA 
ATTACTTCAA 
TTGTTTTTTG 
AGCTAGATTT 



QPLCSQLAGL 

RVMQIGSRET 

IDYGYRFAKE . 

SCSLKTCWLQ 

PDYCVRNEST 

KCKKCTEIVD 



51 
I- 

CAATGATGTA 

TTATCGGCCT 

TTCATTATGT-: 

CAACTGACAA- 

GCCAGAGACT * 

TTGTCAAGTC 

TGTTGATTTG 

TTGTTTTACA- 

TTAAAATTAA 

CACGAAGAAG 

AGGGTGAATG 

CCTTAATTAA 

CTGCCAGATG. 

TCCGGAGAAT 

TTGCAGAGCT 

ATGCATCCTA ■ 

AAAATGATGG.' 

CAACTTGCAA 

GCTCCTTAGA 

CAGATGCTAG 

AACCCAACAA 

AAACTGGTAT 

GGTGTGGAGG 

GACTGATCTC 



Seq ID NO: 251 Protein sequence: 
Protein Accession #: NP_054777 



1 
I 

MYRPDWRAR 
DKLYAEFGRE- 
ICRFHSTEDP 
RSKTLGQSLR 
RWTASFGVTI 
SYEFQPGDVM 
LEGKTDACQG 
GI 



11 . 
I 

KRVCWEPWVI 
ASNNFTEMSQ 
ETVDKIVQLV 
IVGGTEVEEG 
KPSKMKRGLR 
FVTGFGALKN 
DSGGPLVSSD 



21 
I 

GIjVIFISLIV 
RLESMVKNAF- 
LHEKLQDAVG 
EWPWQASLtQW 
RIIVHEKYKH 
DGYSQNHLRQ 
ARDIWYLAGI 



31 



f 



LAVCIGLTVH 
YKSPLREEFV 
PPKVDPHSVK 
DGSHRCGATIi 
PSHDYDISLA 
AQVTLIDATT 
VSWGDECAKP 



41 
I 

YVRYNQKKTY 
KSQVIKFSQQ 
IKKINKTETD 
INATWLVSAA 
ELSSPVPYTN- 
CNEPQAYNDA 
NKPGVYTRVT 



51 
I 

NYYSTLSFTT 
KHGV1AHMLL 
SYLNHCGGTR 
HCFTTYKNPA 
AVHRVCLPDA 
ITPRMLCAGS 
ALRDMITSKT 



Seq ID NO: , 252 DNA sequence 

Nucleic Acid Accession ft: NM_003504.2 

Coding sequence: 71-1771 



1 
I 

ggcacgaggc 
cgccgtggct. 
gagggtcctt; 

GGCCTTGTTC 

acttgaaact 
tggagctaat 
gtgtgacacc 

ACTCATTAAA 
AGAGGAGGAT. 
CACACGGTTA 
GGAGGCCCGG 
GTCAGCCATG 
GTGGTGGGCC 
ATACGTGACT. 
GGATGAGGAG 
CCTGGTGCTC 
AGCCAGGTTC 
' CATGGGTCTT 
GGAGAATTTG- 
CGTGCAGACT 
CTTTGCCACC 
CATCCAGGCT 
ACTCGCCAAG 
CCTOGTCATC 
CATGCTGTTC 
TGTGTGTTCG 



11. 
I 

CTCGTGCCGC 
ATGTtCGTGT 
CTCTTCGTGG 
CAGTGTGACC 
GCATTTCTTG 
GTAGACCTAT 
CATAGGCCAG 
CAAGATGATG 
GAAGAGCATT 
GAAGAGGAGA 
AGAAGAGACA 
GTGATGTTTG 
ATCGTTGGAC 
GATGTTGGTG 
AACACACTCT 
TACCAGCACT. 
AAGCTGTGGT 
CCCCTGAAGC 
CGGGAAATGA 
TTCAGCATTC 
ATGTCTTTGA 
CTGGACAGCC 
AAGCAGCTGC 
TCCCAGGGGC 
TCTAGGCCGG 
ACAAAGAACC 



21 
I 

CGGGCTCTTG 
CCGATTTCCG 
CCTCGGACGT 
ACGTGCAATA 
AGCATAAAGA 
TGGATATTCT" 
TCAATGTCGT 
ACCTTGAAGT 
CAGGAAATGA 
TAGTGGAGCA 
TCCTCTTTGA 
AGCTGGCTTG. 
TAACAGACCA 
TCCTGCAGCG" 
CCGTGGACTG 
GGTCCCTCCA 
CTGTGCATGG 
AGGTGAAGCA" 
TTGAAGAGTC 
ATTTTGGGTT 
TGGAGAGCCC 
TCTCCAGGAG 
GAGCCACCCA 
CTTTCCTGTA 
CATCCCTAAG 
.GGCGCTGCAA 



31 
I 

GTACCTCAGC 
CAAAGAGTTC 
GGATGCTCTG 
TACGCTGGTT: 
ACAGTTTCAT. 
TCAACCTGAT. 
CAATGTATAC 
TCCCGCCTAT 
CAGTGATGGG 
AACCATGCGG 
CTACGAGCAG 
GATGCTGTCC 
GTGGGTGCAA 
CCACGTTTCC 
CACACGGATC 
TGACAGCCTG 
ACAGAAGCGG 
GAAGTTCCAG 
TGCAAATAAA 
CAAGCACAAG 
CGAGAAGGAT 
TAACCTGGAC 
GCAGACCATT 
CTGCTCTCTC 
CCTGCTCAGC 
ACTGCTGCCC 



41 
I 

GCGAGCGCCA 
TACGAGGTGG 
TGTGCGTGCA 
CCAGTTTCTG 
TATTTTATTC 
GAAGACACTA 
AACGATACCC 
GAAGACATCT 
TCAGAGCCTT 
AGGAGGCAGC 
TATGAATATC 
AAGGACCTGA 
GACAAGATCA 
CGCCACAACC 
TCCTTTGAGT 
TGCAACACCA 
CTCCAGGAGT 
GCCATGGACA 
TTTGGGATGA* 
TTTCTGGCCA 
GGCTCAGGGA 
AAGCTGTACC 
GCCAGCTGCC 
ATGGAGGGCA 
AAACACCTGC 
CTGGTGATGG 



51 
I 

GGCGTCCGGC 
TCCAGAGCCA 
AGATCCTTCA 
GGTGGCAAGA 
TCATAAACTG 
TATTCTTTGT 
AGATCAAATT 
TCAGGGATGA 
CTGAGAAGCG 
GGCGAGAGTG 
ATGGGACATC 
ATGACATGCT 
CTCAAATGAA 
ACCGGAACGA 
ATGACCTCCG. 
GCTATACCGC" 
TCCTTGCAGA • 
TCTCCTTGAA 
AGGACATGCG 
GCGACGTGGT 
CAGATCACTT 
ATGGCCTGGA 
TTTGCACCAA 
CTCCAGATGT 
TCAAGTCCTT 
CTGCCCCCCT 
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GAGCATGGAG CATGGCACAG TGACCGTGGT GGGCATCCCC 
CAGGAAGAAC TTTTTTGGGA GGGCGTTTGA GAAGGCAGOG 
GCTGCACAAC *- CATTTTGACC TCTCAGTAAT TGAGCTGAAA 
TCTGGACGCA CTTATTTCCC • TCCTGTCCTA GGAATTTGAT 
ATTTATGTAA "CTGGCTTTCA TTTAGATTGT' AAGTTATGGA 
CCATTTTTTA TTAAATAAAA TGCTTATTTT AGGCTCCGTC 
AAAAAAAAAA AA 

Seq ID NO: 253 Protein sequences 
Protein Accession 8: NP_003495.1 



PCT/US02/12476 



CCAGAGACCG ACAGCTCGGA 1620 

GAAAGCACCA GCTCCCGGAT 1680* 

GCTGAGGATC GGAGCAAGTT 1740 

TCTTCCAGAA TGACCTTCTT 1800 

CATGATTTGA GATGTAGAAG 1860 

CCCAAAAAAA AAAAAAAAAA 1920 



1 
I 

mfvsdfrkef 
aflehkeqfh- 
qdddlevpay- 
rrdilfdyeq; 
dvgvlqrhvs 
klwsvhgqkr 
fsihfgfkhk' 
kqlratqqti- 

TKNRRCKLLP. 
HFDLSVIELK 



11 
I 

YBWQSQRVIi 
YFILINCGAN 
EDIFRDEEED 
YEYHGTSSAM 
RHNHRNEDEE 
LQEFLADMGL 
FLASDWFAT. 
ASCLCTNLVI 
LVMAAPLSME 
AEDRSKFLDA 



21 
I 

LFVASDVDAL 
VDLLDILQPD 
EEHSGNDSDG 
VMFELAWMIiS 
NTIiSVDCTRI 
PLKQVKQKFOJ 
M5LMESPEKD 
SQGPFLYCSL 
HGTVTWGIP 
LISliS 



31 
I 

CACKILQALF 
EDTIFFVCDT 
SEPSEKRTRL 
KDLNDMLWWA- 
SFEYDLRLVL^ 
AMD I SLKENL 
GSGTDHFIQA* 
MEGTPDVMLF 
PETDSSDRKN 



41 
I 

QCDHVQYTLV 
HRPVNWNVY 
EEEIVEQTMR 
IVGLTDQWVQ 
YQHWSLHDSL 
REMIEESANK- 
LDSLSRSNLD 
SRPASLSLLS 
FFGRAFEKAA 



Seq ID NO: 254 DNA sequence 
Nucleic Acid Accession ft: NM_022337 
Coding sequences 48.. 683 



51 
I 

PVSGWQELBT 
NDTQIKLLIK 
RRQRREWEAR 
DKITQMKYVT- 
CNTSYTAARF 
FGMKDMRVQT 
KLYHGLELAK 
KHLLKSFVCS- 
ESTSSRMLHN 



60 
120 
180 
240 
300 
360 
420 
480 
540 



30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



1 
I 

GGCTGCGCTT 
ACAAGGAGCA- 
TCATCAAGCG- 
ACTTCGCGCT; 
ATATCGCAGG' 
GTGCATTTAT . 
AAAATGATTT 
TGGCCAACAA 
AGTTCTGCAA 
ACATTGATGA' 
TGGAGTCTAT 
GCTCTGGCTG 
TTGTTCCACA 
CACATGTGGC 
GTTCTTTCTA 
TCTGTTACAA-' 
TTATTTGCTT 

aactagctgt 
aatatattct 
gacctccatx 
acaggtgtgc 
aactgaatat: 
ctcaagctgt 
gcaagtgaac 



11 

I 

CCCTGGTCAG 
CCTGTACAAG,. 
CTACGTGCAC 
CAAGGTGCTC 
TCAAGAAAGAV 
TGTCTTCGAT 
GGACTCCAAG 
ATGTGACCAa 
GGAGCACGGT' 
AGCCTCCAGA 
TGAGCCGGAC 
TGCCAAATCC 
AATTGTGCCT 
AAGCCAAAGA 
TGCTTTCCTC 
ACTTCTGTCA 
CTTTTAATCA 1 
CAAGTCAAGG 
CTGATGGCCT' 
CTCGGCAGAC 
TATATTGTCC 
TGTATGAAAA' 
GGGGCTCCTC 
AATAAAACAT 



21 ' 
I 

GCACGGCACG 
TT GCTGGTGA 
CAGAACTTCT 
CACTGGGACC 
TTTGGAAACA 
GTCACCAGGC 
TTAAGTCTCC 
GGGAAGGATG 
TTCGTAGGAT 
TGCCTGGTGA 
GTCGTGAAGC 
TAGTAGGCAC 
CTATTTTTAC. 
TCTATGCCTC 
ACCATCATCA 
TGTAGCTGAC 
GCAAAGGCCT 7 
ACTGGCTTTC 
GACAGGCCTA 
CTAAGAGTTG 
TTGTCCTAAC 
GACATGCCTC 
TATACATGCT 
TAAAAGATAA 



31 
I 

TCTGGCCGGC 
TTGGCGACCT 
CCTCGCACTA- 
CGGAGACTGT 
TGACGAGGGT; 
CAGCCACATT 
CTAATGGCAA* 
TGCTCATGAA: 
GGTTTGAAAC 
AACACATACT 
CCCATCTCAC 
CTTTGCTGGT- 
CATTTTGGGT 
TGTTTTTTCA 
CAGTGTTTAC 
CAAAATCCTG 
CAAGTCTTAA 
ACCTTGCCCT 
TTAAGTAGAT" 
CCTCTGAGTT 
TGTCACTTGC 
CATATGTGCC 
ATACATGTAA 
AA 



41 . 
I 

CGCCAGGATG 
GGGCGTGGGG; 
CCGGGCCACA: 
GGTGCGCCTG 
CTATTACCGA- 
TGAAGCAGTG, 
ACCGGTTTCA 
CAATGGCCTC 
ATCAGCAAAG" 
TGCAAATGAG 
ATCAACCAAG 
GTCTGGTAGG 
AAACGTCAGG 
ATGAGAGAGA 
AAACTTTTGA 
CAGGGCCACA 
AATAAAAGGG 
GGTGTCTTTT 
GTGATATTTT 
AGCTCTTTGG 
CATGGCCTGA 
TTTCTGTTAG 
TATATATTAT 



51 
i 

CAGGCCCCGC 
AAGACCAGTA 
ATOGGCGTGG 
CAGCTCTGGG 
GAAGCTATGG . 
GCAAAGTGGA 
GTGGTTTTGT 
AAGATGGACC 
G AAAAT ATAA ■' 
TGTGACCTAA 
GTTGCCAGCT 
AATGACCTCA 
ATAGATATAC 
AATAGCAAAT 
AAATATTTAG 
GTCGGCACTG . 
GAGAAGAACA . 
TCCAGATTTC 
CTTCCAAGAT 
AATCGTGAAC 
ATGTTGGCTT 
CTCTCTTTGA 
ATATATTTTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



Seq ID NO: 255 Protein sequence: 
Protein Accession #: NP_071732 



11 



21 



31 



41 



51 



I I I I I 

MQAPHKEHLY KUjVIGDLGV GKTSIIKRYV HQNFSSHYRA TIGVDFALKV LHWDPETWR 
LQLWDIAGQE RFGNMTRVYY REAMGAFIVP DVTRPATFEA VAKWKNDLDS KLSLPNGKPV 
SWLLANKCD QGKDVLMNNG LKMDQFCKEH GFVGWFETSA KENINIDEAS RCLVKHILAN 
ECDLMESIEP DWKPHLTST KVASCSGCAK S 

Seq ID NO: 256 DNA sequence 
Nucleic Acid Accession #: NM_016321 
Coding sequence: 25.. 1464 



GGAACCGCCC 
CCGCTGGCGG 
GGTGTTCGTG 
GAACTTGAGC 
CGTGATGGTC 
CGCCGTGGGC 
GGGCTGGTTC 
CGCTGACTTC 
CCCCATTCAG 
CATTCTCCTT 
TGGCGCCTAC 
CAAGGAGAGA 
CCTGTGGATG 
-CCGAGCCGCC 



11 

I- 

GCTGCCAGCC 
CTGCCGCTCA 
CGCTACGACT 
GACATGGAGA 
TTCGTGGGCT 
TTCAACTTCC 
CACTTCTTAC 
TGCGTGGCCT 
CTGCTCATCA 
AACCTGCTAA 
TTTGGGCTCA 
CAGAATTCTG 
TACTGGCCCA 
ATCAACACCT 



21 - 
i 

CGGCCAGGCA 
CCTGCCTGCT 
TCGAGGCCGA 
ACGAATTCTA 
TCGGCTTCCT 
TGTTGGCAGC 
AAGACCGCTA 
CTGTCTGCGT 
TGACTTTCTT. 
AGGTGAAGGA- 
CAGTGACCCG 
TGTACCAGTC 
GCTTCAACTC 
ACTGCTCCTT 



31 
I 

CCCCTGCAGC 
CCTGCAGGTG 
CGCCCACTGG 
CTATCGCTAC 
CATGACTTTC 
CTTCGGCATC 
CATCGTCGTG 
GGCCTTTGGG 
CCAAGTGACC 
TGCAGGAGGC 
GATCCTCTAC 
GGACCTCTTT 
AGCCATATCC 
GGCAGCCTGC 



41 
I 

ATGGCCTGGA 
ATTATGGTGA 
TGGTCAGAGA 
CCAAGCTTCC 
CTGCAGCGCT 
CAGTGGGCGC 
GGCGTGGAGA 
GCAGTTCTGG 
CTCTTCGCTG 
TCCATGACCA 
CGACGCAACC 
GCCATGATTG 
TACCATGGGG 
GTGCTTACCT 



51 
1 

ACACCAACCT 

TTCTCTTCGG 

GGACGCACAA 

AGGACGTGCA 

ACGGCTTCAG- 

TGCTCATGCA 

ACCTCATCAA 

GTAAAGTCAG 

TGAATGAGTT 

TCCACACATT 

TAGAGCAGAG 

GCACCCTCTT 

ACAGCCAGCA 

CGGTGGCAAT 



60 
120 
180 



60 
120 
180' 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



285 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 ; 

ATCCAGTGCC CTGCACAAGA AGGGCAAGCT GGACATGGTG 
CGCAGGAGGG "GTGGCCGTGG GTACCGCTGC TGAGATGATG 
CATCATCGGC TTOGTCTGCG GCATCATCTC CACCCTGGGT 
CCTGGAGTCC CGGCTGCACA TCCAGGACAC ATGTGGCATT 
TGGCATCATA GGCGGCATCG TGGGTGCTGT GACAGOGGOC 
TGGAAAAGAA ; . GGGCTTGTCC ATTCCTTTGA CTTTCAAGGT 
AAGAACACAG" GGAAAGTTCC AGATTTATGQ TCTCTTGGTG 
GGGTGGCATC ATTGTGGGGC TCATTTTGAG ATTACCATTC 
GAACTGCTTT GAGGATGCGG TCTACTGGGA GATGCCTGAA 
CCCTGAGGAC. CCCACCTTCA AGCCCTCAGG ACCCTCAGTA 
CCCACTACCC ATGGCTTCCT CGGTACCCTT GGTACCCTAG 
GCAGGCTCCA CAGACTSTCC ' TGGGGCCCAG . AGGAGCTGGT 
AAGAGTGAGC. AAGCAGCACC CCCACCTGCT GGCTTGGCCT 
CCTCCCCTTC. ATCCCAGGGG GTCTGMCTGA GAATGGAGAA 

atccaagccg; ggttctggct GCAGAAGTTC TGCCTCTGCC. 
AGAAAAACAG '■ GCTCAAAGTG GGGCTGGGAC CTGGTGGGTG 
ACAACTTAGC TGCCAGTCAC CACCTATGAG GCTCTTCTAC 
AGCATCTCCT. ATGCTCCCTG GGTCCCCCAG ACCTCTCTGT 
CAGGAATAAA CATTCTTGTT GTCCTTTGTA AAAAAAAAAA 

Seq ID NO: 257 Protein sequence t 
Protein Accession 8: NP_057405 



PCT7US02/12476 



l 
I 

MAWNTN LRWR 
PSFQDVHVMVf 
GVENLINADF 
SMTIHTFGAYr 
YHGDSQHRAA' 
W1PYGALIIG 
SASLEVYGKE 
WGQPSDENCF 



11 
I 

LPLTCLLLQVt 
FVGFGFLMTF 
CVASVCVAFG 
FGLTVTRILY 
INTYCSLAAC 
FVCGIISTLG 
GLVHSFDFQG 
EDAVYWEMPE 



21 
I 

IMVIIiFGVFV 
LQRYGFSAVG 
AVLGKVSPIQ 



VLTSVAISSA 
FVYI.TPFLES 
FNGDWTARTQ 
GNSTVYIPED 



31 
I 

RYDFEADAHW 
FNFLLAAFGI- 
LLIMTFFQVT 
QNSVYQSDLF 
LKKKGKLDMV 
RLHIQDTCGI 
GKFQIYGLLV 
PTFKPSGPSV 



CACATCCAGA 
CTCATGCCTT 
TTTGTATACC 
AACAATCTGC 
TCCGCCAGCC 
TTCAAOGGGG 
ACCCTGGCCA 
TGGGGACAAC 
GGGAACAGCA 
CCCTCAGTAC 
GCTCCCAGGG 
GCTGACCTAG 
CAAGGTGCCT- 
GGAGAAGCTA 
TGGGGTCTTG 
AACCTGAGCT 
CCCGTGCCTG 
GTTGTGTGCG 
AAAAAAAA 



41 
I 

WSERTHKNLS 
QWALLMQGWF 
LFAVNEFILL 
AMIGTLFLWM 
HIQNATLAGG 
NNLKGIPGII 
TLAMALMGGI 
PSVPMVSPLP 



ATGCCACGCT 
ACGGTGCCCT 
TGACCCCATT . 
ATGGCATTCC 
TTGAAGTCTA : 
ACTGGACCGC 
TGGCCCTGAT 
CTTCAGATGA 
CTGTCTACAT * 
CCATGGTGTC 
CAGGTGAGGA 
CTAGGGATGC 
CCACCCCTGC 
CAAAGTGGGC 
GCCACATTGG 
CTCCCAGGAG 
CACCTCGGCC 
TGGCAGCCTC 



51 

I. 

DMENBFYYRY 
HFLQDRYIW 
NLLKVKDAGG 
YWPSFNSAIS 
VAVGTAAEMM- 
GGIVGAVTAA 
1VGLILRLPF 
MASSVPLVP 



900 
- 960 • 
.1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 
1500 
1560 • 
1620 
1680 
1740 
1800 
1860 
1920 



Seq ID NO: 258 DNA sequence 

Nucleic Acid Accession #: NM_002358.2 

Coding sequence: 75.. 692 



1 - 
I 

GGGAAGTGCT 
TTGTGTCCCT 
GCGCCGAAAT 
GCATATATCC 
CTACTGATCT 
TATACAAGTG 
TCCTGGAAAG. 
CCAGAGAAAA 
CAGCTACGGT- 
ATACAGACAA 
CCAATTCTGA 
TGGTGGCCTA 
TAATTTTGAA 
TTGGTTAATT 
CCTTTATTTT 
CATTGTTCAA" 
GATAGTAACT. 
GTTTTGGTCA 
AAAGGAAGTC 
AACAATGAAA 
TTGAATCAGT 
ATATTTGTAC 
TTATAAAATC 
AAAAAAAAAA 



11 
I 

GTTGGAGCCG 
GGCCATGGCG 
CGTGGCCGAG 
ATCTGAAACC 
TGAGCTCATA 
TTCAGTTCAG 
ATGGCAGTTT 
GTCTCAGAAA 
GACATTTCTG 
AGATTTGGTT 
GGAAGTCCGC 
CAAAATTCCT. 
ATGTCGTTTT 
TTTACATGGA 
TTTGGTACCT 
AAGGAACCAG 
GTAGATGGAA 
AGTAGTTTGA 
TAAATATTCA 
TATTGCTGTA 
TTCCAATTAT 
TGTTTAATGT 
AAGTTTTAAG 



21 
I 

CTGTGGTTGC 
CTGCAGCTCT 
TTCTTCTCAT 
TTTACTCGAG. 
AAATACCTAA- 
AAACTGGTTG. 
GATATTGAGT 
GCTATCCAGG 
CCACTGTTGG 
GTACCTGAAA 
CTTCGTTCAT 
GTCAATGACT 
CCTGAAATCA 
GAAAACCAAA- 
ATTTGACTTA 
GAGGTTTTTT 
AAACTTGTGC 
CTCAGTATAG 
GAATCTTTGT 
TAGCTCCTTT 
TTGACTTTAA 
TCTGTGATAC 
TGAAAGTGAG 



31 
I 

TGTCCGCGGA 
CCCGGGAGCA 
TCGGCATCAA 
TGCAGAAATA 
ATAATGTGGT- 
TAGTTATCTC 
GTGACAAGAC 
ATGAAATCCG 
AAGTnrCTTG 
AATGGGAAGA 
TTACTACTAC 
GAGGATGACA 
GGTCATCTAT 
ATGATACTTA 
CCATGGAGTT 
TGTCAACATT. 
TATAAAGCTA 
GTAGGGAGAT 
TAAGGTCCTG 
TGACCTTCAT 
TTTATGTAAC 
AGAACTCTTA 
GAAATAAAGT 



41 

I 

GTGGAAGCGC 
GGGAATCACC 
CAGCATTTTA 
CGGACTCACC 
GGAACAACTG 
AAATATTGAA 
TGCAAAAGAT 
TTCAGTGATC 
TTCATTTGAT 
GTCGGGACCA 
AATCCACAAA 
TGAGGAAAAT 
AGTTGATATG 
CTGAACTGTG 
AACATCATGA 
GTGATGTATA 
GATGCTTTCC 
ATTTAAGTAT 
AAAGTAACTC 
TTCATGTATA 
TTGAACCTAT 
AAAATGTTTT 
TAAGTTTGTT 



51 
I 

GTGCTTTTGT 

CTGCGCGGGA 

TATCAGCGTG 

TTGCTTGTAA 

AAAGATTGGT 

AGTGGTGAGG 

GACAGTGCAC 

AGACAGATCA " 

CTGCTGATTT 

CAGTTTATTA. 

GTAAATAGCA 

AATGTAATTG- 

TTTTATTTCA 

TGTAATTGTT ' 

ATTTATTGCA 

TTCCTTTGAA 

TAAATCAGAT 

AAAATACAAC 

ATAATCTATA 

GTTTTCCCTA 

GAAGCAATGG 

TTCATGTGTT 

TT AAAAAAAA 



Seq ID NO: 259 Protein sequence: 
protein Accession ft: NPJ302349.1 



a 11 21 31 41 51 

I I |il> 

KALQLSREQG ITLRGSAEIV ABFFSFGINS ILVQRGIYPS ETFTRVQKYG LTLLVTTDLE 
LIKYLNNWE QLKDWLYKCS VQKLVWISN IESGEVLERW QFDIECDKTA KDDSAPREKS 
QKAIQDEIRS VIRQITATVT FLPLLEVSCS FDliLIYTDKD LWPEKWEES GPQFITNSEE 
VRLRSFTTTI HKVNSMVAYK IPVND 

Seq ID NO: 260' DNA sequence 
Nucleic Acid Accession ft: NM_001211 
Coding sequence: 43.. 3195 

i 11 j 1 j 1 t r 

AAAGGCCTGC AGCAGGACGA GGACCTGAGC CAGGAATGCA GGATGGCGGC GGTGAAGAAG 
GAAGGGGGTG CTCTGAGTGA AGCCATGTCC CTGGAGGGAG ATGAATGGGA ACTGAGTAAA 
GAAAATGTAC AACCTTTAAG GCAAGGGCGG ATCATGTCCA CGCTTCAGGG AGCACTGGCA 
CAAGAATCTG CCTGTAACAA TACTCTTCAG CAGCAGAAAC GGGCATTTGA ATATGAAATT 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



60 
120 
160 



60 
120 
160 
240 



286 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

CGATTTTACA > CTGGAAATGA 
CAGAACTATC CTCAAGGTGG. 
GAAGCACTAC; AAGGAGAAAA 
AAATTAGGGC, GTTTATGCAA 
ATTGGTGTTT CACTTGCTCA 
AACTTTAGGA AAGCAGATGC 
GAAAGACTAC AGTCCCAGCA 
GCACTTGAGA AAGAAGAAGA* 
CTAGCTGAAC TAAAGAGCAA' 
GGTGCTCTCA : AGGCTCCAAG 
CAAAATAATA GTAGAATT AC 
TTGTCTAAGC CTACAGTCCA 
' GAGCTGCAAGf CAGGCCCTTG 
ACAGCTTCAC TGATAGCTGT 
ACTGCACAAC AGCCAGTTAT 
AGCACCAG AA AGCCTGGAAA 
CAAGCGTCTG AGGAGAAGAA- 
GTAGGGGAAT TCTCCTTTGA ; 
CAAAGGGAAG CCGAGCTATT 
GAAGAGATGG AGAAGAAGCT 
CAGCAAGAAG AGACGATGCC 
CAGAAAATAC CAGGAATGAC 
GAAACTTCAC TTGOGGAGAA 
CCTTTCTCCA TTTTTGATGA 
GATCCCCCAC GAGTTTTAGC 
ATCACCTCAA ATGAAGATGT 
TTGAGCGAGG ATGCCATTAT 
GACACTTGTG ACTTTGCCAG 
TCCTTGAAGG ATCTCCCTTC 
AAGACCTCTG AGGACCAGCA 
AAGAAGCTGA" GCCCAATTAT 
GGTTCTTCTG; CCTCGGTTGC 
CTAGAACTTA CTAATGAGAC 
CGCAGACAGC TACTGAAGTC 
GACAGACCAA- TGCCTAAGTT 
TGCATTAAAC" GAGAATACCT 
AACTCTGCAG AATTAACAGT 
ATCAACCTCA AGTTAAAGGA 
CAATATCAAG. ATGGCTGTAT 
CTTCTCCAAC ACAGTGAATA 
TTGACAATAG. TGGAGATGCT 
TGTCTGATTC TCAGAAACAG- 
TTGAAGATAG TGGACTTTTC 
CTCAGCGGCT TTCGGACTGT 
TCTCCCTACC AGGTAGACCT 
GAACACCTAC AGGTCTTCTG 
CTAAAAGATG . GTGAATTGTG 
GCCACAGTGT CTGTTCTTGG 
TTCCAAAGTC ACCTGAACAA 
TTGCTCTTTC AGTGAGCTAG 
GTATTGTGGA ACACTGAAAC 
CTACCATTGC' TGTTCTACTT 
ACAG TGAT AT ACTTACTCAT 
CAGACTCATT ACAAATGGTT 
CTTTTCCCAT TTGTAATTTG 
TAAAATAGTA TCTGTTAAAA 



PCT/US02/12476 



CCCTCTGGAT. 
GAAAGAGAGT 
AOGATATTAT . 
TGAGCCTTTG. 
GTTCTATATC 
GAT ATTTCAG : 
CCGACAATTC 
GGAGGAAGTT 
AGGGAAAAAG 
CCAGAACAGA 
TGTTTTTGAT 
GCCATGGATA 
GAACACAGGC 
ACCCGCTGTG- 
GACACCATGT 
GGAAGAAGGA 
AGAGAAGATG; 
AGAAATTCGG 
GACCAGTGCA 
AAAAGAAATC 
TACAAAGGAG 
TCTATCCAGT 
CATTTGGCAG 
GTTTCTTCTT- 
TCAACGAAGA 
GTCTCCAGAT. 
CACAGGCTTC 
AGCAGCTCGT. 
TGATCCTGAG 
GACAGCTTGT: 
TGAAGACAGT- 
AAGCACCTCC . 
TTCAGAAAAC 
CCTACCAGAG. 
GGAAATTGAG ' 
AATATGTGAA 
AATAAAGGTA 
ACGTTTAAAT 
TGTTTGGCAC 
TATTACCCAT 
ACACAAAGCA 
AATCCACGAT 
CTACAGTGTT 
ACAGATCCTG 
GTTTGGTATA 
GGATGGGTCC 
GAATAAATTC 
GGAGCTTGCA 
AGCCTTATGG 
GCAATCAAGT 
TGTATGTGCT 
TTTGGTACAG 
GGCCTTGTCT 
ACCTTGTTAT 
TAAAATGTTC 
AAAAAAAAAA 



GTTTGGGATA 

AATATGTCAA 

AGTGATCCTC 

GATATGTACA 

TCATGGGCAG 

GAAGGGATTC 

CAAGCTCGAG- 

TTTGAGTCTT- 

ACAGCAAGAG. 

GGACTCCAAA 

GAAAATGCTG 

GCACCCCCCA' 

AGGTCCTTGG 

CTTCCCAGTT- 

AAAATTGAAC 

GATCCTCTAC 

ATGTATTGTA 

GCTGAAGTTT- 

GAGAAGAGAG: 

CAAACTACTC 

ACAACTAAAC 

TCTGTTTGTC 

GAACAACCTC 

TCAGAAAAGA 

CCCCTTGCAG 

GTTTGTGATG 

AGAAATGTAA- 

TTTGTATCCA 

AGACTGTTAC 

GGCACTATCT. 

CGTGAAGCCA 

TCCATCAAAT 

CCTACTCAGT 

TTAAGTGCCT 

AAGGAAATTG 

GATTACAAGT 

TCTTCTCAAC 

GAAGATTTTG 

CAATATATAA 

GAAATAACAG. 

GAAATAGTCC 

CCCTATGATT. 

GACCTTAGGG 

GAAGGACAAA 

GCAGATTTAG: 

TTCTGGAAAC 

TTTGTGGGGA V 

GCAGAAATGA 

AAGGTAGGGA 

CTCACAGATT 

GTAATTTAAT: 

GTATATTTTG 

AACTTTTGTG. 

TTAACCCATT- 

TCTTATGATC 

AAAAAAAAAA 



GGTATATCAG 
OGTTATTAGA 
GATTTCTCAA 
GTTACTTGCA 
AAGAATATGA 
AACAGAAGGC 
TGTCTCGGCA 
CTGTACCACA 
CTCCAATCAT.. 
ATCCATTTCC 
ATGAGGCTTC. 
TGCCCAGGGC. 
AACACAGGCC 
TCACTCCATA' 
CTAGTATAAA' 
AAAGGGTTCA 
AGGAGAAGAT. 
TCCGGAAGAA' 
CAGAAATGCA- 
AGCAAGAAAG 
TGCAAATTGC 
AAGTAAACTG; 
ATTCTAAAGG 
AGAATAAAAG' 
TTCTCAAAAC 
AATTTACAGG 
CAATTTGTCC 
CTCCTTTTCA" 
CGGAAGAAGA 
ACAGTCAGAC 
CACACTCCTC 
GTCTTCAAAT 
CACCATGGTG 
CTGCAGAGTT. 
AATTAGGTAA 
TATTCTGGGT 
CTGTCCCATG 
ATCATTTTTG. 
ACTGCTTCAC. 
TGTTGATTAT 
ATGGTGACTT 
GTAACAAGAA 
TGCAGCTGGA' 
AGATCCTGGC 
CACATTTACT 
TTAGCCAAAA 
TTCTGAATGC 
ATGGGGTTTT. 
AGTTAACTAG 
GCTGCCTCAG 
TTAGGACACA 
ACGTCACTGA- 
AAGAACTATT 
TGTCTCTACT 
ACCATGTATT 
AAA 



CTGGACAGAG 
AAGAGCTGTA 
•TCTCTGGCTT 
CAACCAAGGG' 
AGCTAGAGAA 
TGAACCACTA 
AACTCTGTTG 
ACGAAGCACA 
"CCGTGTAGGA- 
TCAACAGATG 
TACAGCAGAG 
CAAAGAGAAT 
TCGTGGCAAT 
TGTGGAAGAG 
CCACATCCTA 
GAGCCATCAG ' 
TTATGCAGGA; 
ATTAAAAGAG 
GAAACAGATT " 
AACAGGTGAT 
TTCCGAGTCT • 
TTGTGCCAGA" 
TCCCAGTGTA 
TCCTCCTGCA 
CTCAGAAAGC' 
AATTGAACCC • 
TAACCCAGAA 
TGAGATAATG ' 
TCTAGATGTA . 
TCTCAGCATC* 
TGGCTTCTCT 
TCCTGAGAAA 
TTCACAGTAT 
GTGTATAGAA 
TGAGGATTAC J 
GGCGCCAAGA 
GGACTTTTAT- 
CAGCTGTTAT 
CCTTCAGGAT 
TTATAACCTT • 
GAGTCCAAGG 
CAATCAAGCT 
TGTTTTTACC 
TAACTGTTCT 
ATTGTTCAAG 
TATTTCTGAG 
CAATGATGAG 
TGACACTACA- 
TCCTGGGGCT 
AGCAATGGTT 
TTTAGATGCA 
TATTTTTTAT 
TTATTCTAAA 
TTTCCCTGTA 
TTGTAAATAA 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
• 900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 



Seq ID NO*. 261 Protein sequences 
Protein Accession #: NP_001202 



I 

MAAVKKEGGA 
AFEYEIRFYT 
FLNLWLKLGR 
QKAEPLERLQ 
PIIRVGGALK 
PRAKENELQA 
SINHILSTRK 
RKKLKEQREA 
QIASESQKIP 
NKSPPADPPR 
ICPNPEDTCD 
SQTLSIKKLS 
PWCSQYRRQL 
FWVAPRNSAE 
CFTLQDLLQH 
NKNNQALKIV 
HLLLFKEHLQ 
GVFDTTFQSH 



11 
I 

LSEAMSLEGD 
GNDPLDVWDR 
LCNEPLDMYS 
SQHRQFQARV 
APSQNRGIiQN 
GPWNTGRSLE 
PGKEEGDPLQ 
ELLTSAEKRA 
GMTLSSSVCQ 
VLAQRRPIiAV 
FARAARFVST 
PIIEDSREAT 
LKSLPELSAS 
LTVIKVSSQP 
SEYITHEITV 
DFSYSVDLRV 
VPWDGSFWKL 
LKKALWKVGK 



21 
1 

EWELSKENVQ. 
YISWTEQNYP 
YLHNQGIGVS 
SR0TLLALEK 
PFPQQMQNNS 
HRPRGNTASL 
RVQSHQQASE 
EMQKQIEEME 
VNCCARETSL 
LKTSESITSN 
PFHEIMSLKD 
HSSGFSGSSA 
AELCIEDRPM 
VPWDFYINLK 
LIIVNLLTIV 
QLDVFTLSGF 
SQNISELKDG 
LTSPGALLFQ 



31 

I 

PLRQGRIMST 
QGGKESNMST 
LAQFY1SWAE 



RITVFDENAD 
IAVPAVLPSF 
EKKEKMMYCK 
KKLKEIQTTQ 
AENIWQEQPH 
EDVSPDVCDE 
LPSDPERLLP 
SVASTSSIKC 
PKLEIEKEIE 
LKERLNEDFD 
EMLHKAEIVH 
RTVQILEGQK 
ELWNKPFVRI 



41 

I 

LQGALAOESA 
LLERAVBALQ 
EYEARENFRK 
VPQRSTLAEL 
EASTAELSKP 
TPYVEETAQQ 
EKIYAGVGEF 
QERTGDQQEE 
SKGPSVPFSZ 
FTGIEPLSED 
EEDLDVKTSE 
LQIPEKLELT 
LGNEDYCIKR 
HFCSCYQYQD 
GDLSPRCLIL 
ILANCSSPYQ 
LKANDEATVS 



51 
I 

CNNTLQQQKR 
GEKRYYSDPR 
ADAIFQEGIQ 
KSKGKKTARA 
TVQPWIAPPM 
PVMTPCKIEP 
SFEEIRAEVF 
TMPTKETTKL 
FDEFLLSEKK 
AIITGFRNVT 
DQQTACGTIY 
NETSENPTQS 
EYLICEDYKL 
GCIVWHQYIN 
RNRIHDPYDC 
VDIiFGIADLA 
VLGEIAAEMN 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



Seq ID NO: 262 DNA sequence 
Nucleic Acid Accession # S -NM_003784 
Coding sequences 365..1S07 



11 



41 



287 




60 

120 . . 



PCT7US02/12476 



ATTAGGTAGT GGTAAAACAG GCTCCCTTCG AAGCTCTCCT TCATCACCTT CCTAAGTGCA 180 

TGTACAGGGA AGCTCTCCTT ' CATCACCTTC CTAAGTGCAT GGGGGAAAAT ACCTAGGGCT 240 

CAACAGTCTT GAGAAGTGTG GAAACATTTT CTTTGTGAGT GAGAACAGAT- CACCTAGAGA 300 

: AAGGAAACCA GATTCCCATC ACTGCTTCTG GGTATCAGAT. GCTAGCGCTG CACTCCATTT 360 

TGCAATGGCC TCCCTTGCTG, CAGCAAATGC AGAGTTTTGC TTCAACCTGT- TCAGAGAGAT 420 

. GGATGACAAT - CAAGGAAATG- GAAATGTGTT CTTTTCCTCT CTGAGCCTCT TCGCTGCCCT 480 

GGCCCTGGTC CGCTTGGGCG CTCAAGATGA CTCCCTCTCT CAGATTGATA AGTTGCTTCA 540 

TGTTAACACT' GCCTCAGGAT ATGGAAACTC TTCTAATAGT CAGTCAGGGC TCCAGTCTCA . 600 

ACTGAAAAGA GTTTTTTCTG ATATAAATGC ATCCCACAAG GATTATGATC TCAGCATTGT 660 

GAATGGGCTT' TTTGCTGAAA AAGTGTATGG CTTTCATAAG GACTACATTG AGTGTGCCGA 720 

AAAATTATAC GATGCCAAAG TGGAGCGAGT TGACTTTACG AATCATTTAG ' AAGACACTAG 780 

ACGTAATATT- AATAAGTGGG TTGAAAATGA AACACATGGC AAAATCAAGA ACGTGATTGG 840 

TGAAGGTGGC ATAAGCTCAT CTGCTGTAAT GGTGCTGGTG AATGCTGTGT' ACTTCAAAGG 900 

CAAGTGGCAA- TCAGCCTTCA CCAAGAGCGA AACCATAAAT TGCCATTTCA: AATCTCCCAA. 960 

GTGCTCTGGG AAGGCAGTCG CCATGATGCA TCAGGAACGG AAGTTCAATT. TGTCTGTTAT 1020 

TGAGGACCCA TCAATGAAGA TTCTTGAGCT CAGATACAAT GGTGGCATAA ACATGTACGT 1080 

TCTGCTGCCT GAGAATGACC TCTCTGAAAT TGAAAACAAA CTGACCTTTC AGAATCTAAT 1140 

GGAATGGACC AATCCAAGGC GAATGACCTC TAAGTATGTT GAGGTATTTT TTCCTCAGTT 1200 

CAAGATAGAG: AAGAATTATG AAATGAAACA ATATTTGAGA GCCCTAGGGC TGAAAGATAT 1260 

CTTTGATGAA- TCCAAAGCAG ATCTCTCTGG GATTGCTTCG GGGGGTCGTC TGTATATATC 1320 

' AAGGATGATG CACAAATCTT ACATAGAGGT CACTGAGGAG GGCACCGAGG CTACTGCTGC 1380 

CACAGGAAGT. AATATTGTAG AAAAGCAACT CCCTCAGTCC ACGCTGTTTA GAGCTGACCA 1440 

CCCATTCCTA TTTGTTATCA : GGAAGGATGA CATCATCTTA TTCAGTGGCA AAGTTTCTTG . 1500 

CCCTTGAAAA TCCAATTGGT TTCTGTTATA GCAGTCCCCA CAACATCAAA GRACCACCAC 1560 

AAGTCAATAG ATYTGRGTTT AATTGGAAAA ATGTGGTGTT TCCTTTGAGT ■ TTATTTCTTC 1620 

CTAACATTGG TCAGCAGATG ACACTGGTGA- CTTGACCCTT CCTAGACACC TGGTTGATTG 16 BO 

TCCTGATCCC TGCTCTTAGC ATTCTACCAC CATGTGTCTC ACCCATTTCT' AATTTCATTG 1740 

TCTTTCTTCC CACGCTCATT TCTATCATTC TCCCCCATGA CCCGTCTGGA AATTATGGAG 1800 

RGTGCTCAAC TGGTAAGGAG AACGTAGAAG TAGCCCTAGG GATCCTTTTT GAAACT CTAC 1860 

AGTTATCGCA' GATATTCTAG' CTTCATTGTA AGCAATCTAG GAAATAAGCC CTGCTGCTTT 1920 

CTAGAAATAA GTGTGAAGGA TAAATTTTCT TTGTTGACCT ATGAAGATTT TAGAGTTTAC 1980 

CTTCATATGT TTGATTTTAA ATCAGTGTAT AATCTAGATG GTAAAAAATG TGA AATT GGG 2040 

ATTAGGGACC TACCAAAATA TTTCATTAAT GCTTTCAATT GACAAATTTT GGCCTTTCTT 2100 

TGATAAGACA- ATATGTACAT GTTTTTTCAA ATATTAAAGA TCTTTTAACT GTTGGCAGTT 2160 

GTTATCTACA GAATCATATT TCATATGCTG TGTAGTTTAT AAGTTTTTCC TCTATTTATC 2220 
AGAATAAAGA- AATACAACAT ACCTGTAAA 

Seq ID NO t 263 Protein sequence: 
Protein Accession ih NPJ303775 



1 11 21 31 41 .51 

I I I I ! 1 

MASLAAANAE FCPNLPREMD DNQGNGNVFP SSLSLPAALA LVRLGAQDDS LSQIDKLLHV 60 

NTASGYGNSS NSQSGU2SOL - KRVFSDINAS HKDYDLSIVN GLFAEKVYGP HKDYIECAEK 120 

LYDAKVERVD FTNHLEDTRR NINKWVENET KGKIKNVIGE GGISSSAVMV LVNAVYFKGK 180 

WQSAFTKSET INCHFKSPKC SGKAVAMMHQ ERKFNLSVIE DPSMKILELR YNGGINMYVL 240 

IiPENDLSEIE NKLTFQNLME WTNPRRMTSK YVEVFFPQPK IEKNYEMKQY LRALGLKDIF 300 

DESKADLSGI ASGGRLYISR MMHKSYIEVT EEGTEATAAT GSNIVEKQLP QSTLFRADHP 360 
FLFVIRKDDI ILFSGBCVSCP 

Seq ID NO: 264 DNA sequence 
Nucleic Acid Accession #: AB052906 
Coding sequence: 74-814 

1 11 21 31 41 51 

AAAACCTTGA GGTGATTCAT CTTCCAGGCT CTCCTTCCAT CAAGTCTCTC CTCCCTAGCG '60 

CTCTGGGTCC TTAATGGCAG CAGCCGCCGC TACCAAGATC CTTCTGTGCC TCCCGCTTCT 120 

GCTCCTGCTG TCCGGCTGGT CCCGGGCTGG GCGAGCCGAC CCTCACTCTC TTTGCTATGA 180 

CATCACCGTC ATCCCTAAGT TCAGACCTGG ACCACGGTGG TGTGCGGTTC AAGGCCAGGT. 240 

GGATGAAAAG ACT TTT CT T C ACTATGACTG TGGCAACAAG ACAGTCACAC CTGTCAGTCC 300 

CCTGGGGAAG AAACTAAATG TCACAACGGC CTGGAAAGCA CAGAACCCAG TACTGAGAGA 360 

GGTGGTGGAC ATACTTACAG AGCAACTGCG TGACATTCAG CTGGAGAATT ACACACCCAA 420 

GGAACCCCTC ACCCTGCAGG CCAGGATGTC TTGTGAGCAG AAAGCTGAAG GACACAGCAG 480 

■ TGGATCTTGG CAGTTCAGTT TCGATGGGCA GATCTTCCTC CTCTTTGACT CAGAGAAGAG 540 

AATGTGGACA ACGGTTCATC CTGGAGCCAG AAAGATGAAA GAAAAGTGGG AGAATGACAA 600 

GGTTGTGGCC ATGTCCTTCC ATTACTTCTC AATGGGAGAC TGTATAGGAT GGCTTGAGGA 660 

CTTCTTGATG GGCATGGACA GCACCCTGGA GCCAAGTGCA GGAGCACCAC TCGCCATGTC. 720 

CTCAGGCACA ACCCAACTCA GGGCCACAGC CACCACCCTC ATCCTTTGCT GCCTCCTCAT 780 

CATCCTCCCC TGCTTCATCC TCCCTGGCAT CTGAGGAGAG TCCTTTAGAG TGACAGGTTA 840 

AAGCTGATAC CAAAAGGCTC GTGTGAGCAC GGTCTTGATC AAACTCGCCC TTCTGTCTGG 900 

CCAGCTGCCC ACGACCTACG GTGTATGTCC AGTGGCCTCC AGCAGATCAT GATGACATCA 960 

TGGACCCAAT AGCTCATTCA CTGCCTTGAT TCCTTTTGCC AACAATTTTA CCAGCAGTTA 1020 

TACCTAACAT ATTATGCAAT TTTCTCTTGG TGCTACCTGA TGGAATTCCT GCACTTAAAG 1080 

TTCTGGCTGA CTAAACAAGA ' TATATCATTT TCTTTCTTCT CTTTTTGTTT GGAAAATCAA 1140 

GTACTTCTTT GAATGATGAT CTCTTTCTTG CAAATGATAT TGTCAGTAAA ATAATCACGT 1200 

TAGACTTCAG ACCTCTGGGG ATTCTTTCCG TGTCCTGAAA GAGAATTTTT AAATTATTTA 1260 

ATAAGAAAAA ATTTATATTA ATGATTGTTT CCTTTAGTAA TTTATTGTTC TGTACTGATA 1320 
TTTAAATAAA GAGTTCTATT TCCCAAAAAA AAAAAAAAAA A 

Seq ID NO: 265 Protein sequence: 
Protein Accession #: BAB61048.1 



288 



WO 02/086443 ■ ; PCT/US02/12476 

1 11 21 . 31 41 51 

MAAAAATKIL LCLPLLLLLS GWSRAGRADP HSLCYDITVI PKPRPGPRWC AVQGQVDEKT 60 

- FLHYDCGNKT VTPVSPLGKK . LNVTTAWKAQ NPVLREWDI LTEQLRDIQL ENYTPKEPLT 120 - . 

5 LQARMSCBQK AEC2ISSGSWQ FSPDGQIFLL FDSEKRMWTT VHPGAKKMKE . KWENDKWAM 180 - 

SPHYFSMGDC IGWLEDFLKG MDSTLBPSAG APLAMSSGTT QLRATATTLI LCCLIillLPC 240 

FILPGI ■ - - 

! • 

- Seq ID NO: 266- DNA sequence " ■ - ~ ' 

10 Nucleic Acid Accession #: XM_084B53.1 

Coding sequences 127-444 ■ ^ - 

1 . 11* 21 31 41 51 

1 5 ATTGATGATA TATTTAACGA AATCAAATTT GGTGAATATG TGGACACTGG AAAGCTAATC 60 

GACAAGATCA ACTTACGAGA . TTTCCT AAAA GTGTACCTTA ACCACAAGCC ACCTTTTGGT 120 

AACACCATGA GTGGCATCCA CAAGAGCTTT GAGGTGCTCG* GTTATACCAA CTCCAAAGGG- 180 - 

AAAAAGGCCA TTCGAAGAGA GGACTTCCTG AGACT GCTCG * TTACTAAAGG TGAGCATATG 240 

ACGGAGGAGG AGATGTTGGA; TTGCTTTGCT, TCACTGTTTG GCCTGAATCC CGAGGGATGG" 300 

20 AAATCCGAGC CTGCAACCTG" CTCCGTCAAA GGTTCAGAAA- TTTGCCTTGA" AGAAGAACTT- 360 

CCAGACGAAA TCACTGCAGA AATATTCGCG' ACTGAAATTC TTGGCTTAAC CATT TCAGA A" 420 , 

GATTCCGGCC AGGATGGTCA GTGAAGTTAC CAGGAATGTT TAAAGCACAA AGGACTTTGG 480 

GTGTGTGTGC ATGCACATGT GTGTGTTTTC CATGAGGCAC TGCTTTTTAT- GCATTTCCCT 540 
CCCCCCTCTC ATCTTTAGAA CATTTAGACA TTAAAGCAAG TTTCTGGTGA GCAATG 



25 



Seq ID NO: 267 Protein sequence: 
Protein Accession -8: XP_084853.1 



30 1 11 21 31 41 51 

MSGIHKSFEV LGYTNSKGKK AIRREDFLRL LVTKGEHMTE EEMLDCFASL FGLNPEGWKS 60 
EPATCSVKGS EICLEEELPD EITAEIFATE ILGLTISEDS GQDGQ 

35 Seq ID NO: 268 DNA sequence 

Nucleic Acid Accession »: NM_001898 
Coding sequence: 57-4 82 

! il 21 31 41 51 ■ ■ 

40 | | | I I I 

GGCTCTCACC CTCCTCTCCT GCAGCTCCAG CTTTGTGCTC TGCCTCTGAG GAGACCATGG 60 

CCCAGTATCT GAGTACCCTG CTGCTCCTGC TGGCCACCCT AGCTGTGGCC CTGGCCTGGA 120 

GCCCCAAGGA GGAGGATAGG'- ATAATCCCGG GTGGCATCTA TAACGCAGAC CTCAATGATG 180 

AGTGGGTACA GCGTGCCCTT CACTTCGCCA TCAGCGAGTA TAACAAGGCC ACCAAAGATG 240 

45 ACTACTACAG ACGTCCGCTG CGGGTACTAA GAGCCAGGCA ACAGACCGTT GGGGGGGTGA 300 

ATTACTTCTT CGACGTAGAG GTGGGCOGCA CCATATGTAC CAAGTCCCAG CCCAACTTGG' 360 

ACACCTGTGC CTTCCATGAA CAGCCAGAAC TGCAGAAGAA ACAGTTGTGC TCTTTCGAGA 420 

TCTACGAAGT- TCCCTGGGAG AACAGAAGGT CCCTGGTGAA ATCCAGGTGX CAAGAATCCT 480 

AGGGATCTGT GCCAGGCCAT TCGCACCAGC CACCACCCAC TCCCACCCCC TGTAGTGCTC 540 

50 CCACCCCTGG ACTGGTGGCC CCCACCCTGC GGGAGGCCTC. CCCATGTGCC TGCGCCAAGA 600 

GACAGACAGA GAAGGCTGCA GGAGTCCTTT GTTGCTCAGC AGGGCGCTCT GCCCTCCCTC 660 

CTTCCTTCTT GCTTCTAATA GCCCTGGTAC ATGGTACACA CCCCCCCACC TCCTGCAATT 720 
AAACAGTAGC ATCGCC 

55 Seq ID NO: 269 Protein sequence: 
Protein Accession ft:NP_001889.1 

1 11 21 31 41 51 

60 MAQYLSTLLL LLATLAVALA WSPKEEDRII PGGIYNADLN DEWVQRALHF AISEYNKATK 60 
DDYYRRPLRV LRARQQTVGG VNYFFDVEVG RTICTKSQPN LDTCAFHEQP ELQKKQLCSF 120 
EIYEVPWENR RSLVKSRCQE S 

Seq ID NO: 270 DNA sequence 
65 Nucleic Acid Accession #: XMJ)93210 
Coding sequence: 13-1854 

1 11 21 31 41 51 

70 ATGGCAAGCG CCGGAATCTC CTCAGCTGCC GTTTCACAAA AGAGGTACCA GGTCCGCACC 60 

AAACGAGCAC ACAAGCAGCA CCAGGAGCTG CAGAAGAAGG AGGCGGCAGC GATGGACCAG 120 

GGCAGAGGGA ATGGGGAGGG GGCATCCTAC CCCATATCTG AGGTGCGACT GCGGGACGTA 180 

GAGOGGACTG GGCCTTTCCC GTTGGCGCGT GGCCTCAATC AGGACTTCTT GCCCACGTGC 240 

GCCTTCAAAA CGGTAAGAGC TGCAACTGAA- CGTGTGAGAC ATGGTGCAGA TAGGCTGAGA • 300 

75 GGCGGCGGGA GAGATGCCCA TGAACTCAAG TACCCGGACA CGCCCTCCAC TTCTACCACC 360 

ACGAGTAACA CCGCCCCCAC GGGACCGCTC TCGAGGTCCC CCAAGCCAAG GACGCAAGGA 420 

GGAACGCCCC GGCGCGCGGC CAGCAGCGGC GGGCACCGGC CCAATGGCCA CGGAACTCAG 480 

CACTGGCAGT CGGCCCTCCT CACACCGCAG GCGTGCAGTG TGGCCGACGG AGCCTCCCGG- 540 

GCCGAGGACC CAGCTAGGCC GTCACCCCGG TTGCTCCCAC GGGAAGGGGC ACCAGGCAAA 600 

80 CTGCCCAAGG CCCCGAGCCC AGGCTCCCTG GCGGAGGCCT CCGCTGGTCC CGCCCAGATC 660 

ATGGCCGCCA CCAGGCTCCC GAGCCATGGC TTCCTGTCCG GGAACGGCCC GGCGTCCTGG 720 
CTGTCCAGCT AG ' 

Seq ID NO: 271 Protein sequence: 
85 protein Accession 8* XP_O93210 

1 11 21 31 41 51 



289 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 
I I " I I I I 

MLRKGEQKRK RARKXNDFLP TCAFKTVRAA TERVRHGADR LRGGGRDAHB LKYPDTPSTS 
TTTSNTAPTG PLSRSPKPRT QGGTPRRRPA AAGTRANGHG TQHWQSALLT PQACSVADGA 
SRAEDPARPS PRLLPREGAP GKLPKAPSPG SLAEASAGLL AHVRLQNADA QRVSISQALP 
PNSSVGRKEE RPGAGQQRRA PAPMATELST GSRPSSHRRR AVWPTEPPGP RTQLEPSPRL 
XiPREGAPGKL PKAPSPGSIA EASAGPAQIM AATRLPSRGP LSGNGPASWL SS 

Seq ID NO: 272 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 1..732 



PCT/US02/12476 



i 
I 

GGATACTGTG 
TGAAAAAGCT 
TAATGTGGAG 
ATACCCACTT- 
ATGATTTTGT 
TAAATTATTT 
TTAGTATCAC 
AAAATTGCAG 
TTAAGCC 



11 
I 

TCACTCAAAG 
TTTTTTCCCA- 
GAAATTATTC 
GAAGCCTCTG 
CTTGTTTCTG 
TTATTTATCT 
AATTTATGGG 
AAGTCATAGG 



21 

I. 

TAATGGGAGG 
CTTTTAACTT 
TTTCTCATTG. 
TAGAAATGTC 
CAGTGAGAAA : 
TTCATATAGT- 
AGAGGGTTTT 
ACTGTCATGT 



31 
I 

GAGAGAGAAC 
GCTTTAGCGT 
GAGATTACAG 
TCGTCCTCCG 
TTACATCCAT 
TCTTACAATT. 
TTGTATTTTT 
ATTGCAGCTC 



41 
I 

AGGGAGGGTA 
TAAGAGTACT 
AATATATCTA 
GTTGTATTTC 
AGCAAAGACA 
TCTAAAAAAT 
AAGCATATGT 
TGAGAACCAA 



Seq ID NO: 273 Protein sequence: 
Protein Accession ft: Eos sequence 

1 11 21 31 

i 11 1 

MGGRENREGR DAFBKAPFPT FNIiL 

Seq ID NO: 274 DNA sequence 

Nucleic Acid Accession ft: NM_003976.2 

Coding sequence: 299-961 



41 
I 



51 



1 
I 

CTCTGAGCTT 
CATGGAGTTG 
CTACTTCTGC 
GGGTGGCAGG 
CAGGAGGGTG 
GGAACTTGGA 
TGCCCTGTGG 
GGGCTCCGCG 
CGCCGGCCAC 
GCCGOCGCAG. 
CGGGGGCCGC 
GGGCTGCCGC 
CGACGAGCTG. 
CGACCTCAGC 
GCCCGTCAGC 
CAACAGCACC 
AGGGCTCGCT 
CCTCCOGCAG 
AGGCCCCTAC 
CAGCCCCAGA 
GGAGCCCTTC 
CCCTCCTCTG 
ACAGCATTTG- 
CCTGTACTCA 



11 
I 

CTCTGAGCCT 
TGAAAGAATA 
TGGGTTGAGT 
CCGGTCCCCC 
GGGGAACAGC 
CTTGGAGGCC 
CCCACCCTGG 
CCCCGCAGCC 
CTGCCGGGGG 
CCTTCTCGGC 
GCGGCGCGGG 
CTGCGCTCGC 
GTGCGTTTCC 
CTGGCCAGCC 
CAGCCCTGCT 
TGGAGAACCG 
CCAGGGCTTT 
AGTCCCACTA 
CGGTGGGTGA 
GCCCTCACCC 
GGACCCACTT 
ATGAACACTA 
AAGGACACAT 
CTCATGGGAG 



21 
I 

TGTTTGCTCA 
GCTGCAAAGC 
CT AG CTGTGT 
ACAAAAGATA- 
TCAACAATGG 
TCTCCACGCT 
CCGCTCTGGC 
CTGCCCCCCG 
GACGCACGGC 
CCGCGCCCCC 
CTGGGGGCCC 
AGCTGGTGCC 
GCTTCTGCAG 
TACTGGGCGC 
GCCGACCCAC 
TGGACCGCCT 
GCAGACTGGA 
GCCAGCGGCC 
TGGATATCAT 
TGCGGATCCC 
CTCACAGACT 
CAGTGGCTGA 
ATTGCAGTTG 
CTGGCCCC 



31 

I 

TCTGGAAAAA 
ACCTAACACA 
AGGCCCCTTG 
ACTCATCTCT 
CTGATGGGCG. 
GTCCCACTGC 
TCTGCTGAGC 
CGAAGGCCCC 
CCGCTGGTGC 
GCCGCCTGCA 
GGGCAGCCGC. 
GGTGCGCGCG 
CGGCTCCTGC 
CGGGGCCCTG 
GCGCTACGAA 
CTCCGCCACC 
CCCTTACCGG 
TCAGCCAGGG 
CCCCGAACAG 
AGCCTAAAAG 
CTGGCACTGG" 
GGCATCAGCC 
CTTGGTTGAA 



41 
I 

GGGGATTAAA 
TAGTAAGGTT 
TTCCTCACCT 
TAATTTGCAA 
CTCCTGGTGTi 
CCCTGGCCTA: 
AGCGTCGCAG 
CCGCCTGTCC 
AGTGGAAGAG 
CCCCCATCTG, 
.GCTCGGGCAG: 
CTCGGCCTGG 
CGCCGCGCGC 
CGACCGCCCC 
GCGGTCTCCT 
GCCTGCGGCT 
TGGCTCTTCC 
ACGAAGGCCT 
GTGAAGGGAC 
ACACCAGAGA 
CCAGGCCTCG 
CCCGCCCAGG 
AGTGCCTGTG 



51 
I 

CCATTTACCT 
CCCAGTGCAG 
GGAGAAACTG 
GCTGCCTCAA 
TGATAGAGAT 
GGCGGCAGCC 
AGGCCTCCCT 
TGGCGTCCCC 
CCCGGCGGCC 
CTCTTCCCCG 
CGGGGGCGCG 
GCCACCGCTC 
GCTCTCCACA- 
OGGGCTCCCG 
TCATGGACGT 
GCCTGGGCTG; 
TGCCTGGGAC 
CAAAGCTGAG 
AACTGACTAG 
CCTCAGCTAT 
AACCTGGGAC 
CCCTGTAGGG 
CTGGAACTGG 



Seq ID NO: 275 Protein sequence: 
Protein Accession ft: NP_003967.1 



11 



51 



21 31 41 

I I I I I I 

MELGLGGLST LSHCPWPRRQ PALWPTLAAL ALLSSVAEAS LGS APRS PAP REGPPPVLAS 
PAGHLPGGRT ARWCSGRARR PPPQPSRPAP PPPAPPSALP RGGRAARAGG PGSRARAAGA 
RGCRLRSQLV PVRALGLGHR SDELVRFRFC SGSCRRARSP HDLSLASLLG AGALRPPPGS 
RPVSQPCCRP TRYEAVSPMD VNSTWRTVDR LSATACGCLG 

Seq ID NO: 276 DNA sequence 

Nucleic Acid Accession ft: NM_057091.1 

Coding sequence: .783-1445 



I 

ACTGGCCGCT 
. GGACCCCCAA 
TCGCTCCCCG 
CGCGTGTCTA 
CTCCATATCC 
CAAGCTAGGG 
CGGGGCAGGG 
CACCGGACGG 
CAGACAAGGC 



11 

I • 

GAGAGAAGAA 
ATCTGCACGT 
CCCTCACTCA 
CAAACTCAAC 
GAGGGGCCCC 
GGGACTGGAT 
GCGCTCCCAG 
CTGCGGCGGC 
CCGGGGGCTC 



21 
I 

TCGGGTGGAG 
ACCAGCAGTC 
CTTTCTCCCG 
TCCCGGTTTC 
TCOCAGCATC 
CCGACGGGTG 
CCCCACCCCG 
GGGCAGGAGG 
CGCCAGCAGC 



31 
I 

CAGAGAGCAG 
AGCCGCCCCA 
CCCTCGGCCC 
CGTGCCTCTC 
TACCCCCCTC 
GAGCAGCCAG 
GGATCTGGTG 
CTGCTGAGGG 
AGGTCCCTCG 



41 
I 

CTGCTGCAGG 
CGCAGGGACC 
GGCCTCCCAG 
CACCGCTCGA 
CCAACCTCGG 
GTGAGCCCCG 
ACGCTGGGGC 
ATGGAGTTGG 
GGCCCCAGCC 



51 
I 

GCAGACAGCC 
GGCTTACCCC 
CTCTCTACTT 
GTTCTCTACT 
GGGACCTAGC 
AAAGGTGGGG 
TGGAATTTGA 
GCCCGGCCCC 
CTCGCTGCCA 



60 
120 
180 
240 



51 
I 

GGGATGCTTT 
TACCAGCTAA 
TTCATCTTGA 
TAAAACCTAC. 
AAAGTCTTTT 
TAACACTCAT 
GGCTTATATA 
TGCCTGAAAC 



60 
120 
180 
240 
300 
360 
420 
480 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 



290 



WO 02/086443. 

CCCGGGCCTG GAGCCCCACA CCCGAGGGTG CAGACTGGCT GCCAAGGCCA CACTTTTGGC 600 

TAAAAGAGGC ACTGCCAGGT GTACAGTCCT GGGCATGCGC TGTTTGAGCT TCGGGGGAGA 660 

GCCCAGCACT GGTCCCCGGA AAGGTGCCTA GAAGAACAAG GTGCAGGACC CCGTGCTGCC 720 

TCAACAGGAG'- GGTGGGGGAA CAGCTCAACA ATGGCTGATG GGCGCTCCTG GTGTTGATAG 780 

AGATGGAACT TGGACTTGGA GGCCTCTCCA CGCTGTCCCA CTGCCCCTGG CCTAGGCGGC 840 

AGCCTGCCCT GTGGCCCACC CTGGCCGCTC TGGCTCTGCT: GAGCAGCGTC GCAGAGGCCT 900 

CCCTGGGCTC CGCGCCCCGC AGCCCTGCCC CCCGCGAAGG CCCCCCGCCT GTCCTGGCGT 960 

CCCCCGCCGG CCACCTGCCG GGGGGACGCA CGGCCCGCTG GTGCAGTGGA AGAGCCCGGC 1020 

GGCCGCCGCC GCAGCCTTCT CGGCCOGCGC CCCCGCCGCC TGCACCCCCA TCTGCTCTTC 1080 

CCCGCGGGGG CCGCGCGGCG CGGGCTGGGG GCCCGGGCAG CCGCGCTCGG GCAGCGGGGG. 1140 

CGCGGGGCTG CCGCCTGCGC TCGCAGCTGG TGCCGGTGCG CGCGCTCGGC CTGGGCCACC 1200 

GCTCCGACGA : GCTGGTGCGT TTCCGCTTCT GCAGCGGCTC CTGCCGCCGC GCGCGCTCTC 1260 

CACACGACCT CAGCCTGGCC AGCCTACTGG GCGCCGGGGC CCTGCGACCG CCCCCGGGCT 1320 

CCOGGCCCGT CAGCCAGCCC TGCTGCCGAC CCACGCGCTA CGAAGCGGTC TCCTTCATGG 1380. 
ACGTCAACAG CACCTGGAGA ACCGTGGACC GCCTCTCCGC CACCGCCTGC GGCTGCCTGG - 1440 

' GCTGAGGGCT CGCTCCAGGG CTTTGCAGAC TGGACCCTTA CCGGTGGCTC TTCCTGCCTG 1500 

GGACCCTCCC GCAGAGTCCC ACTAGCCAGC GGCCTCAGCC AGGGACGAAG GCCTCAAAGC- 1560 

TGAGAGGCCG CTACCGGTGG GTGATGGATA .TCATCCCCGA ACAGGTGAAG GGACAACTGA 1620 
CTAGCAGCCC: CAGAGCCCTC ACCCTGCGGA TCCCAGCCTA AAAGACACCA GAGACCTCAGV 1680 

CTATGGAGCC CTTCGGACCC ACTTCTCACA GACTCTGGCA CTGGCCAGGC CTCGAACCTG" 1740 

GGACCCCTCC TCTGATGAAC ACTACAGTGG CTGAGGCATC AGCCCCCGCC CAGGCCCTGT 1800 

AGGGACAGCA' TTTGAAGGAC ACATATTGCA GTTGCTTGGT TGAAAGTGCC TGTGCTGGAA 1860 
CTGGCCTGTA CTCACTCATG GGAGCTGGCC CC 

Seq ID NO: 277 Protein sequence: 
Protein Accession #: NP_003967.1 

! 11 21 31 41 51 

MELGLGGL5T-. LSHCPWPRRQ PALWPTIAAb ALLSSVAEAS LGSAPRSPAP REGPPPVLAS 60 

PAGHLPGGRT ARWCSGRARR PPPQPSRPAP PPPAPPSALP RGGRAARAGG PGSRARAAGA 120 

RGCRLRSQLV PVRALGLGHR SDELVRFRPC SGSCRRARSP HDLSLASLLG AGALRPPPGS 180 
RPVSQPCCRP TRYEAVSFMD VNSTWRTVDR LSATACGCLG 

Seq ID NO: 278 DNA sequence 

Nucleic Acid Accession #: NM_057160.1 

Coding sequences 1-714 

.1 . 11 21 31 41 SI 

ATGCCCGGCC TGATCTCAGC CCGAGGACAG CCCCTCCTTG AGGTCCTTCC TCCCCAAGCC 60 

CACCTGGGTG CCCTCTTTCT CCCTGAGGCT CCACTTGGTC TCTCCGCGCA GCCTGCCCTG 120 

TGGCCCACCC TGGCCGCTCT GGCTCTGCTG AGCAGCGTCG CAGAGGCCTC CCTGGGCTCC 180 

GCGCCCCGCA GCCCTGCCCC CCGCGAAGGC CCCCCGCCTG. TCCTGGCGTC CCCCGCCGGC 240 

CACCTGCCGG GGGGACGCAC GGCCCGCTGG TGCAGTGGAA GAGCCCGGCG GCCGCCGCCG 300 

CAGCCTTCTC GGCCCGCGCC CCCGCCGCCT GCACCCCCAT CTGCTCTTCC CCGCGGGGGC 3 60 

CGCGCGGCGC GGGCTGGGGG CCCGGGCAGC CGCGCTCGGG CAGCGGGGGC GCGGGGCTGC 420 

CGCCTGCGCT- CGCAGCTGGT GCCGGTGCGC GCGCTCGGCC TGGGCCACCG CTCCGACGAG 480 

CTGGTGCGTT TCCGCTTCTG CAGCGGCTCC TGCCGCCGCG CGCGCTCTCC ACACGACCTC 540 

AGCCTGGCCA GCCTACTGGG CGCCGGGGCC CTGCGACCGC CCCCGGGCTC CCGGCCCGTC 600 

AGCCAGCCCT- GCTGCCGACC CACGCGCTAC GAAGCGGTCT CCTTCATGGA CGTCAACAGC 660 

ACCTGGAGAA CCGTGGACCG CCTCTCCGCC ACCGCCTGCG GCTGCCTGGG CTGAGGGCTC 720 

GCTCCAGGGC TTTGCAGACT. GGACCCTTAC CGGTGGCTCT TCCTGCCTGG GACCCTCCCG 780 

CAGAGTCCCA CTAGCCAGCG GCCTCAGCCA GGGACGAAGG CCTCAAAGCT GAGAGGCCCC 840 

TACCGGTGGG TGATGGATAT CATCCCCGAA CAGGTGAAGG GACAACTGAC TAGCAGCCCC 900 

AGAGCCCTCA CCCTGCGGAT CCCAGCCTAA AAGACACCAG AGACCTCAGC TATGGAGCCC 960 

TTCGGACCCA CTTCTCACAG ACTCTGGCAC TGGCCAGGCC TCGAACCTGG GACCCCTCCT 1020 

CTGATGAACA CTACAGTGGC TGAGGCATCA GCCCCCGCCC AGGCCCTGTA GGGACAGCAT 1080 

TTGAAGGACA CATATTGCAG TTGCTTGGTT GAAAGTGCCT GTGCTGGAAC TGGCCTGTAC 1140 

TCACTCATGG GAGCTGGCCC C 

Seq ID NO: 279 Protein sequence: 
Protein Accession «: NP_476501.l 

1 11 f f I 1 I 1 

MPGLISARGQ PLLEVLPPQA HLGALFLPEA PLGLSAQPAL WPTLAALALL SSVAEASLGS 60 

APRSPAPREG PPPVLASPAG HLPGGRTARW CSGRARRPPP QPSRPAPPPP APPSALPRGG 120 

RAARAGGPGS RARAAGARGC RLRSQLVPVR ALGLGHRSDE LVRFRFCSGS CRRARSPHDL 180 
SLASLLGAGA LRPPPGSRPV SQPCCRPTRY EAVSFMDVNS TWRTVDRLSA TACGCU3 

Seq ID NO: 280 DNA sequence 

Nucleic Acid Accession #: NM_057090.1 

Coding sequence: 29-715 

1 11 21 31 j 1 j 1 

CTGATGGGCG CTCCTGGTGT TGATAGAGAT GGAACTTGGA CTTGGAGGCC TCTCCACGCT 60 

GTCCCACTGC CCCTGGCCTA GGCGGCAGGC TCCACTTGGT CTCTCCGCGC AGCCTGCCCT 120 

" GTGGCCCACC CTGGCCGCTC TGGCTCTGCT GAGCAGCGTC GCAGAGGCCT CCCTGGGCTC 180 

. CGCGCCCCGC AGCCCTGCCC CCCGCGAAGG CCCCCCGCCT GTCCTGGCGT CCCCCGCCGG 240 

CCACCTGCCG GGGGGACGCA ' CGGCCCGCTG GTGCAGTGGA AGAGCCCGGC GGCCGCCGCC 300 

GCAGCCTTCT CGGCCOGCGC CCCCGCCGCC TGCACCCCCA TCTGCTCTTC CCCGCGGGGG 360 

CCGCGCGGCG CGGGCTGGGG GCCCGGGCAG CCGCGCTCGG GCAGCGGGGG CGCGGGGCTG 420 

CCGCCTGCGC TCGCAGCTGG TGCCGGTGCG CGCGCTCGGC CTGGGCCACC GCTCCGACGA 480 

GCTGGTGCGT TTCCGCTTCT GCAGCGGCTC CTGCCGCCGC GCGCGCTCTC CACACGACCT 540 

CAGCCTGGCC AGCCTACTGG GCGCCGGGGC CCTGCGACCG CCCCCGGGCT CCCGGCCCGT 600 

CAGCCAGCCC TGCTGCCGAC CCACGCGCTA CGAAGCGGTC TCCTTCATGG ACGTCAACAG 660 



291 



CACCGCCTGC 
CCGGTGGCTC 
AGGGACGAAG 
ACAGGTGAAG 
AAAGACACCA 
CTGGCCAGGC 
AGCCCCCGCC 
TGAAAGTGCC 



GGCTGCCTGG 
TTCCTGCCTG 
GCCTCAAAGC 
GGACAACTGA 
GAGACCTCAG 
CTCGAACCTG 
CAGGCCCTGT 
TGTGCTGGAA 



GCTGAGGGCT 
GGACCCTCCC 
TGAGAGGCCC 
CTAGCAGCCC 
CTATGGAGCC 
GGACCCCTCC 
AGGGACAGCA 
CTGGCCTGTA 



WO 02/086443 

CACCTGGAGA" ACCGTGGACC GCCTC TCCGC 
. CGCTCCAGGG CTTTGCAGAC TGGACCCTTA 
. GCAGAGTCCC ACTAGCCAGC GGCCTCAGCC 

CTACCGGTGG GTGATGGATA 1 TCATCCCOGA 

CAGAGCCCTC AOCCTGCGGA TCCCAGCCTA 

CTTCGGACCC ACTTCTCACA GACTCIGGCA 

TCTGATGAAC ACTACAGTGG CTGAGGCATC 

TTTGAAGGAC" ACATATTGCA GTTGCTTGGT 

CTCACTCATG GGAGCTGGCC CC 

Seq ID NO: 281" Protein sequence: 
Protein Accession ft: NP_47643l.l ■ 

1 H \. 21 31 41 SI 

I I - I I I I 

MELGLGGLST LSHCPWPRRQ APLGLSAQPA UflPTLAALAI* LSSVAEASLG SAPRSPAPRE 
GPPP.VLASPA GHLPGGRTAR WCSGRARRPP PQPSRPAPPP PAPPSALPRG GRAARAGGPG. 
SRARAAGARG. CRLRSQfcVPV RALGLGHRSD ELVRFRFCSG SCRRARSPKD LSLASLIX3AG 
AliRPPPGSRP VSQPCCRPTR YEAVSFMDVN STWRTVDRLS ATACGCLG 



720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
120 
180 



Seq ID NO r 282- DMA sequence 

Nucleic Acid Accession &: Eos sequence 



11 



CTACTGCACC 
ATAGCAATTT. 
CCGTGAACCT 
AGGTAACGTTv 
GATCTAATCA 
GCTAAAAATG 
TATTCTTCAT- 
CACrCTTAAG, 
GAATCTCAGG 



TGCCCTCTGT 
CCTTTGGTTT. 
TATGAATGCT 
TTCTTTTGCT; 
ATGATCTAGT 
AACTTGAAAC 
CCTGGCTGTA 
GATTTTATGT 
ATACCTACAT 



21 
I 

TTCCTTTGGA 
TTAAGACTTC 
GCTTAAAAAT 
CTCATCTTGG 
CAGAAGCTAC 
ACGGAAGTAG 
ATAATAGCCA 
GTATTATTCA 
TTATCACTTT 



31 

I 

AATCTCTTAC 
TACATTGCTT 
AATGTCAAAA 
TTTCCATATA 
TTCACTGGCT. 
TGGTTGGTCC 
TTATTTGTTA 
AATTGCTATT 
TTCAATATAT 



41 
I 

CTTTCATTAG 
TTTCTTTTAT 
TATGTTTTAG 
CTATTTTTGG 
AACAGTGATC 
AGTTTGAAAG 
TGCCTTTGTT 
ACTGTTCTTT 
ATGTATTTCT 



SI 
I 

GGTTTCTTTC 

tatctgtgct 
ctgcctactc 
ttttttgtga 
atgttcatgt 
ctcttattag 
atgtagcaga 

TTATAGTTGA 
TATT 



60 
120 
180 
240 
300 
360 
420. 
480 



Seq ID NO: 283 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 564-1481 



1 - 
1 

GAGACTTTTA" 
CAAAGAGGTC; 
GAAGTGCTAC 
ACTTGGATCA 
CGACOGATGC 
CTTTCCCTGA 
GCAAACAAGG- 
GGGGAGACAC 
AAGGGCTGGC: 
CCTCCCGGCC 
GCCTGGCGCG 
ACTTGGCGGG- 
CCGGTTTCTC 
CTATGACGGG 
CGCTGAGCTG 
AGTTTGACGA 
TGGATGAAGA 
AGGGGTGCGC 
GCTTACCAGA 
AACGTGACCG 
AAGAAATGGA 
TTCAGAAAAA 
AGGAAAAAGC 
ATCAAGAATG 
AAAACAACAG 
ATGGTTGGAA 
TGGAAAACTT 
AATTCCGTGG 
GAAGAGTAAA 
AGCCAGGAGC 
AATAACATGC; 
CAGTCAATAA 
TGGAGTCCTT 
GTTGATCTTG 
ATTTAGCTTG 



11 

I 

ATCATCTATC 
AAAAATTCAC 
CAAAACACGC 
AGTTCCCTCC 
TCACAATTCT. 
TTCAGTGGCA- 
GACAGGCCCT 
CGTACTAAAC 
TAAGGGAGGC 
TCCACGCGTC 
CGCAGGGCCC 
CCAGATGTGG 
TGCTGACTGC 
CGCACGTGGG 
CAGCAATTCC 
GGACGACGGT 
TGCCCATGAT. 
CAGCACTCAG 
AAGCCGCCTG 
GCTGCAACTG 
AGAACGTGAA 
GAATGAGCAA 
AGCAAAGGAA 
GTTAAAGAAA 
CAAGCTGAAA 
AATGCGAAAC 
ACAGGTTTTT 
AAACCAATTC 
AGACCTGTGA 
AATCTTTGCC 
TTTTATCTGG 
CTCAACACTT 
AGAGTTGAGG 
GCATGTTGTT 
TATTAAAAGT 



21 
I 

CCTTGTGCTT 
ATGTGTAGAC 
AATGACTGTC 
CCTCTCCTCA 
GACCTCGTAA 
GGTAACATATV 
CAAAGTTGTC 
AAGCTTGCAA- 
CGGGGCGGAG 
GCCATGGCAA 
GCTGGGAAAG 
GCGGCGGGGC 
AGACCCAGGT 
CAGGGGCTGG; 
ACCAGGTCGC 
GACGGGGAGG 
TCAGAGGCCA 
GTTGAATCAG 
ACACCATGGG 
AAAGCTCTAG 
AAAAGAAAGA 
AAAAGAAAAG 
CTGGAGAAAG 
AAAAATGCTG 
TACAGGAGAA- 
ATAAACCTCG 
ACAGTGGAAA 
ATATGCCACC 
TAAGTCAGCC 
TTGGAACTCT 
AGCTATTTAA 
AATGTGATTA 
AAGATATTTT- 
TTGCAGAATA 
ACACTGTAAT 



31 
I 

TACGCAGACC 
AAATTAGGTC 
CTAAAAGTGC. 
AAATATATCG 
TTATATAGGG 
TTCATGTACA 
GGTAGGGAGC 
ACAGCAGGCA 
GAAGCCAAGC 
CGGGGGGTCT 
CGCGTCCCCG 
GCTGGGGGCC 
CTCGGCCCTC 
AGGTGGTGCG- 
TGTTGTCTCC 
ATGAGGAAGA 
AAGTGGCGAG 
AAAATAACCA 
AGGTGTGGTT 
AGGAATTAAA 
TAATTGCTGA 
AAAGAGAACA 
AATACTTGCA" 
AAGAATGTGA 
AAAGGAAATA 
TCCAGCTGCA 
TTCCTATCCA 
TCCCAAAGAA 
ACACAAGTCA: 
GTGCAGAATA 
TTTAAAAATC 
TTGACAAATA 
CTGGATTTTG 
AGTGGCTGAA 
ACCAATAAAA 



41 
I 

CTACAATACA 
CCTTAAGATG 
GTTCTGGGAT 
ACTTGTGCTG.; 
GGTGGTTTTG 
AAATGAACTG 
CAGGACCCCG; 
CCTTCCTGCC 
TCTGCAGGCC 
GTGCTGGCCG 
CCGCGGCTCC 
TACTTTTCCC 
CTCGGACTCC 
CTCGCCGTCG 
CCTTGGCCAC 
CGTGGATGAT 
CCTGAGAGGA 
AGAAGAACAG 
TATTGGCAAA 
TCAACAACTA 
AGAAAAGCAC 
AAAAATTAAT 
AGAAAAAGCA 
GAGGAAGAAG 
GCAGAAAAAA 
AAGAGCTATG. 
GAACCAGCCT 
GCTAAGGATC. 

tcatctctgg: 

CAAAGATAGC 
AGAAATTGTT 
GCAATTTTTG 
GTTTTTATAA 
TATGTAAGAA 
CTAACAATTT 



51 

! 

CTAGAGGCTT 
CCAGGCAAAC 
ACACCTGTAA 
AAAGAAATCA- 
GTTTCTGCGT 

caacaccacg 
ccagtggcgt: 

ACTGAGGAGG 
CTGACAAAGT- 
GGATTGGCCG' 
GCCAGTTTGA- 
TCTTCCTACG 
TGCTCAGTCC. 
CCGCCGCTGC 
CAGAGCTTCC. 
GAGGAAGACG 
ATGGAGTTAC 
AAACAGGTGC 
GAAAAAGAAG 
GAAAAAAGAA 
AAGGAATGGG 
AAAGAAATGG 
AAAGAAAAAT 
AAAGAAAAGA- 
AGTTTCAAGA 
GTTATGCCAA 
TTTATAATCC 
TATCAGGAAG 
TAATTCATAA 
GTATGTGGAA- 
TTTTACTGCT 
CATTTGTATA 
ACTTTTTAAG 
TTGTGTTTGT 
TTCTTG 



60 
120 
180 
. 240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680- 
1740 
1600 
1860 
1920 
1980 
2040 



Seq ID NO: 284 Protein sequence: 
Protein Accession Eos sequence 



I 11-21 31 41 SI 

I i i • • I l I 

MATRGLCWPG LAGLARAGPA GKARPRRGSA SLNIAGQMWA AGRWGPTFPS SYAGFSADCR 
PRSRPSSDSC SVPMTGARGQ GLEWRSPSP PLPLSCSNST RSLLSPLGHQ SFQFDEDDGD 
GEDEEDVDDE EDVDEDAHDS EAKVASLRGM BLQGCASTQV ESENNQEEQK QVRIiPESRLT 
PWEVWFIGKE KEERDRLQUK ALEELNQQIiE KRKEMEEREK RKIIAEEKHK EWVQKKNEQK 
RKEREQKINK EMEEKAAKEL EKEYLQEKAK EKYQEWLKKK NAEECERKKK EKKNNSKXKY 



60 
120 
180 
240 
300 
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WO 02/086443 * \\ ■ 

- RRKRK " " ' " . ■ 

Seq ID NO: 285 DNA sequence • • m 

Nucleic' Acid; Accession ft : Eos sequence ■ 
Coding sequences 1-1746 

i ■ M 2 . x • 31 I 1 j 1 . ':. 

ATGCCACTGA AGCATTATCT CCTTTTGCTG GTGGGCTGCC AAGCCTGGGG TGCAGGGTTG; 60 

GCCTACCATG GCTGCCCTAG CGAGTGTACC TGCTCCAGGG CCTCCCAGGT. GGAGTGCACC 120 

GGGGCACGCA. TTGTGGCGGT GCCCACCCCT CTGCCCTGGA ACGCCATGAG CCTGCAGATC 180 

CTCAACACGC ACATCACTGA ACTCAATGAG TCCCCGTTCC TCAATATCTC AGCCCTCATC 240 

GCCCTGAGGA TTGAGAAGAA TGAGCTGTCG- CGCATCACGC CTGGGGCCTT;, CCGAAACCTG 300 

GGCTCGCTGC GCTATCTCAG CCTCGCCAAC AACAAGCTGC AGGTTCTGCC CATCGGCCTC 360 

TTCCAGGGCC ■ TGGACAGCCT TGAGTCTCTC CTTCTGTCCA* GTAACCAGCT: GTTGCAGATC 420 

CAGCCGGCCC ACTTCTCCCA GTGCAGCAAC CTCAAGGAGC TGCAGTTGCA CGGCAACCAC 480 

• CTGGAATACA TCCCTGACGG AGCCTTCGAC CACCTGGTAG GACTCACGAA^ GCTCAATCTG 540 

GGCAAGAATA GCCTCACCCA CATCTCACCC AGGGTCTTCC AGCACCTGGG r . CAATCTCCAG* 600 

GTCCTCCGGC TGTATGAGAA CAGGCTCACG GATATCCCCA TGGGCACTTT: TGATGGGCTT. 660 

GTTAACCTGC AGGAACTGGC TCTACAGCAG AACCAGATTG GACTGCTCTC CCCTGGTCTC 720 

TTCCACAACA ACCACAACCT. CCAGAGACTC TACCTGTCCA ACAACCACAT CTCCCAGCTG 780 

CCACCCAGCA TCTTCATGCA GCTGCCCCAG- CTCAACCGTC. TTACTCTCTT TGGGAATTCC 840 

CTCAAGGAGC TCTCTCTGGG GATCTTCGGG CCCATGCCCA ACCTGCGGGA- GCTTTGGCTC 900 

TATGACAACC ACATCTCTTC TCTACCCGAC AATGTCTTCA GCAACCTCCG CCAGTTGCAG 960 

GTCCTGATTC TTAGCCGCAA TCAGATCAGC TTCATCTCCC CGGGTGCCTT CAACGGGCTA 1020 

ACGGAGCTTC GGGAGCTGTC CCTCCACACC AACGCACTGC AGGACCTGGA CGGGAATGTC 1080 

TTCCGCATGT TGGCCAACCT GCAGAACATC. TCCCTGCAGA ACAATCGCCT CAGACAGCTC 1140 

CCAGGGAATA TCTTCGCCAA CGTCAATGGC CTCATGGCCA' TCCAGCTGCA GAACAACCAG 1200 

CTGGAGAACT TGCCCCTCGG' CATCTTCGAT CACCTGGGGA AACTGTGTGA GCTGCGGCTG, 1260 

TATGACAATC CCTGGAGGTG TGACTCAGAC ATCCTTCCGC TCCGCAACTG GCTCCTGCTC 1320 

AACCAGCCTA GGTTAGGGAC GGACACTGTA CCTGTGTGTT- TCAGCCCAGC CAATGTCCGA 1380 

GGCCAGTCCC TCATTATCAT CAATGTCAAC. GTTGCTGTTC- CAAGCGTCCA: TGTCCCTGAG 1440 

GTGCCTAGTT ACCCAGAAAC ACCATGGTAC CCAGACACAC CCAGTTACCC TGACACCACA 1500 

TCCGTCTCTT CTACCACTGA GCTAACCAGC CCTGTGGAAG . ACTACACTGA TCTGACTACC 1560 

ATTCAGGTCA- CTGATGACCG ■ CAGCGTTTGG GGCATGACCC. AGGCCCAGAG . CGGGCTGGCC 1620 

ATTGCCGCCA TTGTAATTGG CATTGTCGCC CTGGCCTGCT CCCTGGCTGC CTGCGTCGGC 1680 

TGTTGCTGCT GCAAGAAGAG GAGCCAAGCT ' GTCCTGATGC AGATGAAGGC ACCCAATGAG 1740 

TGTTAAAGAG GCAGGCTGGA GCAGGGCTGG - GGAATGATGG GACTGGAGGA CCTGGGAATT 1800 

TCATCTTTCT. GCCTCCACCC CTGGGTCCAT GGAGCTTTCC CGTGATTGCT CTTTCTGGCC ' 1860 

CTAGATAAAG GTGTGCCT AC CTCTTCCTGA CTTGCCTGAT TCTCCCGTAG ' AGAAGCAGGT 1920 

CGTGCCGGAC CTTCCTACAA TCAGGAAGAT AGATCCAACT GGCCATGGCA AAAGCCCTGG- 1980 

GGATTTCCGA TTCATACCCC TGGGCTTCCT TCGAGAGGGC TCTTCCTCCA AATCCTCCCC 2040 

ACCTGTCCTC CAAGAACAGC CTTCCCTGCG. CCCAGGCCCC CTCCGGGCCT CTGTAGACTC 2100 

AGTTAGTCCA CAGCCTGCTC ACTTCGTGGG AATAGTTCTC CGCTGAGATA GCCCCTCTCG 2160 

CCTAAGTATT ATGTAAGTTG ATTTCCCTTC TTTTGTTTCT CTTGTTTGTG CTATGGCTTG 2220 

ACCCAGCATG TCCCCTCAAA TGAAAGTTCT.' CCCCTTGATT TTCTGCTCCT GAAGGCAGGG 2280 

TGAGTTCTCT CCTCAAAGAA GACTTCAAAC CATTTAACTG GTTTCTTAAG AGCCGTCAAT 2340 

CAGCCTGGTT TTGGGGATGC TATGAAAGAG AGAAGGAAAA' TCATGCCGCT CAGTTCCTGG 2400 

AGACAGAAGA GCCGTCATCA GTGTCTCACT TGTGATTTTT ATCTGGAAAA GGAAGAAACA 2460 

CCCCAGCACA GCAAGCTCAG CCTTTTAGAG AAGGATATTT CCAAACTGCA* AACTTTGCTT 2520 

TGAAAAGTTT AGCCCTTTAA GGAATGAAAT CATGTAGAAT TTTGGACTTC TAAAAACATT 2580 

. AAAATCAGCT TATTAATACG. GGATAGAGAA AGAAATCTGG TGCCTGGGGG TCCCTGTGTT 2640 

CACCCCTAGA GTTTGTTTTA> AAATTTTTAA TTGAAGCATG. TGAAGTGTAC STGCAGAAAA 2700 

GTGGGAACAT GATAGTGTAT GGCTTGGTGG ATTTTCACAA ACTGAACATA CCTGTGTAAT 2760 

CAGCATCTAG ACCCAGACCC AGAGCATCAC AAATATCCCC CATCCTGGGC TTTTCCCAGA 2820 

GGAGATGGGG GCTTCTGAAG ATGGACTTAC CTGGGACCTG'/ CCCCCCATGA : GCCAGGACGG - 2880 

TCCCCCCACA GTCAGCCTGT GCAAAGGCCC OGTGGCCAGG GGTGGAGGAG- AATATGTGGG 2940 

TGTGGACAGG ATGGGAGACT GTGGCCTGAA" CAGGAGATTTS TATTATATCT GGAGACCCTG 3000 

AGAGACCCTG AGACCTGGGG CACCATGGCT GGCCAGGTCA GAAGCATCCT GACTGCAGAG 3060 

GTCCGTGCAG CCACACCCTC TTCCCTGCCA GCAAGTTGTC TGCGGCTCAT CGGAGGCCCC 3120 

TCCGCCTGGA GCCTTCTATG GACGTGATAT GCCTGTATCT GTTTTTAATT TTCATTCTTC 3180 

ACTTAGGGGA AGTGAAATCG CTCAGAGATG AGATCCTTTA ATTGAAAACG AAGTGTAACG 3240 

GAATCTAGTG TCTTTCTAAT GTGGTAAAAT TCTCCATCAA CATCACAGTC AGCTGGCAGC 3300 

TGAACTTCAG AATCTCACTT ACAGCAGGCG ACACGGGGGT ACACCGATGG GTCACACTGG 3360 

GTCTGGGGGC TCCCTGGAGC TCCTCCTGCG TGTGGTCTGG TTAGGAGTTG AGTTGTTTGC 3420 

TCCAGGGTTA TTCTCCTCCT CGAGTCACAG TCACACGAAT ACCTGCCTTC TCTGGCTTTC 3480 

CTGCTATACA CATATTCACA TGGCGCTCAA GAAGTTAGGC TCATGGCAAC GTGTGTCTTT 3540 

CTCTGGACAA CTGGCCCAGT TTACAGTGAA ATGGAGAATT' TCAGGTCTCC ACGTCTGCCC 3600 

AGGAAAGAAC TTCAGCTGAC TCCACGGGGA TCTGGAAATC CACGACCAAT CCCGATCGGC 3660 

TCTTATTAGC TCCCCGCTCC ACAAGACACC TGTGCTTTGG AAATCCACCA CCAATCCCGA 3720 

TCGGCTCTTA TTAGCTCCCC GCTCCACAAG ACACCTGTGA TCTGGAAATC TACCACCAAT 3780 

CCCGATCGGC TCTTATTAGC TCCCCGCTCC ACAAGACACC TGTGACATCC TCCAGGGCCA 3840 

CAGGAGCACG TGCTGACCAG TTTTCCCTTC CAGTTCCTGC ACAAAAAGTG TCCAGAGGGC 3900 

. TGTTTGCAAA CACTAGTGCA CTTTGTAGCT TTTCACCCTC TGTCCCAGGG AATCTAGGAG 3 960 

AGATGAGGCC CGTCAGAGTC AAGAGATGTC ATCCCCCCAG GGTCTCCAAG GCATTTCCAC 4020 

ACTATTGGTG GCACCTGGAG GACATGCACC AAGGCTTGCC AGAGCCAACA GGAAGTGAGC 4080 

CCAGAGCATG GCACATGAGC ATCACCCGCT GATGGTGGCC TGCTGTGCCT GGTGCCAACA 4140 

GGGGCATCCC GGCCCGTACC CCTCCAGACA- GGAAGCATGG GTTTGCCCAC AGACCTGTCG 4200 

GGTGCTCCTG TGAGTGGCCT CCAGATGTCT TTGTGCATAG GCACAAGTGG GCCAGGGCTG 4260 

GAGGGAGGTG GGAAACCTCA TCATCCGGTG GGCCCTGCCA ATCTTAACCC AGAACCCTTA 4320 

GGTATTCCTG GCAGTAGCCA TGACATTGGA GCACCTTCCT CTCCAGCCAG AGGCTGACCT 4380 

GAGGGCCACT GTCCTCAGAT GACACCACCC AGGAGCACCC TAGGTGAGGG GTGAGGGCCC 4440 

CCTTATGTGA ACCTCTTGCC TCTTCCTTTC TCCCATCAGA GTGGTTGGAT GGAGCCATTG 4500 

GCCTCCTTTT CTTCAGCGGG CCCTTCAACC TCTCTGCACC ATGTTGTCTG GCTGAGGAGC 4560 

TACTAGAAAA GCTGAGTGGA GTCTCCTTTC CAACAGGATG ATGCATTTGC TCAATTCTCA 4620 

GGGCTGGAAT GAGCCGGCTG GTCCCCCAGA AAGCTGGAGT GGGGTACAGA GTTCAGTTTT 4680 

CCTCTCTGTT TACAGCTCCT TGACAGTCCC ACGCCCATCT GGAGTGGGAG CTGGGAGTTA 4740 
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10 
15 
20 
25 
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40 
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GTGTTGGAGA 
TGCACAGATA 
GGTAGGAGTG 
GGTGTTCAAT 
TTGTCTTGGG 
TTAGTCTTGG 
GGAAAAAATA 
TGGGCTGTAT. 
AACTTTTCAT 
GAACTTCCAA 
AGTTGGTC6A 
GCCCCCAGAT 
GGAAGGAAGC 
CTCCTTCCGC 
CTTCATGCTG 
GCCCCAGTGC 
AGCCCTGGTG 
GGTGTACAGA 



AGAAACAACA 
CTCTTCAAGC 
CCGCCTCTAC 
AGGCTGGGAG 
CTTTCGTCAT 
TCATCAGAAC 
AACTCTTCCA 
GTATATTGTT 
GGACACAATT 
ACTCAGGAAG 
CAGATGTTAG 
CCCACAGTCA 
CATGGCTGTG 
CCCAGGTTTC 
CCTTCAAAGC 
TTGGCGATGC 
GGCAGGGTTG- 
ATCAACAATA 



AAAGCCAATT 
ACTGGACGTG 
CCACTTGTGA 
TTTTATTTAT 
TAAACCAAAG 
CTCACTTGGT 
TCCCTTAAAG- 
CTTCCTCCTT 
TCCACAACCT 
TTTGCAGAGA 
ATGTATCCTA 
GAACTGAATC 
GTTCAGAGAG 
TTCTTCTCTT 
TAGATCATGT 
ATTTACAGAT- 

AATAATATAC 



AGAACCACTA 
GATTCTCTCT 
TGGGGTACAG 
CTCTTCAAAC 
GAAATGGAAG 
ACCATATAGA 
AATAGAATAG 
AGAATTTAGA 
TTCAGATGCT 
GCAGACAGCT. 
GCTTTTAGCC 
TGCGTTGTTG 
GGTGGGCTGG 
AAGGAGAGAT 
TTGCCTTGCT 
TTCTAGGCCC 
CTTCTGCTGG 
ATGTAT 



TTTTTAAAAA 
CTAGCCCTCA 
AGGCACTTGC 
TTTGTACAAG 
CCATTCCCCT 
TCAAAAGCTT 
TTTGTCCCTC 
GATACAAGAG 
GATGTAGAGC 
AGAGATAACT 
ATAAACCACT 
GGAAGCCAGC 
CAAGCCACTT 
TGTTCTCACC 
TAGAGAATTA 
TCAGGGTTTT 
ATGCTGCTTG 



Seq ID NO: 286 Protein sequence: 
Protein Accession #: NP_570843.1 



l 
I 

MPLKHYLLLI* 
LNTHITELNE 
FQGLDSLESL 
GKNSLTHISP. 
FHNNHNLQRL 
YDNHISSLPD 
FRMLANLQNI 
YDNP WRCDSD ' 
VPSYPETPWY 
IAAIVIGIVA 



11 
I 

VGCQAWGAGL 
SPFLNISALI 
LLSSNQLLQI 
RVFQHLGNLQ 
YLSNNHISQL. 
NVFSNLRQLQ- 
SLQNNRLRQL 
ILPLRNWLLL 
PDTPSYPDTT 
LACSLAACVG 



21 
I 

AYHGCPSECT 
ALRIEKNELS- 
QPAHFSQCSN 
VLRLYENRLT 
PPSIFMQLPQ 
VLILSRNQIS 
PGNIFANVNG 
NQPRLGTDTV: 
SVSSTTELTS 
CCCCKKRSQA 



31 
I 

CSRASQVECT 
RITPGAFRNL 
LKELQLHGNH 
DIPKGTFDGIi 
LNRLTLFGNS 
FISPGAFNGL 
LMAIQLQNNQ 
PVCFSPANVR- 
PVEDYTDLTT 
VLMQMKAPNE 



41 
I 

GARIVAVPTP 
GSLRYliSLAN 
LEY I PDGAFD 
VNLQELALQQ 
LKELSLGIFG 
TELRELSLHT 
LENLPLGIFD 
GQSLIIINVN 
IQVTDDRSVW 
C 



GTGCTTACTG 
GCACCCCTGC 
TCTTCTGCAT 
AGCTCATGGC 

TGTAACCACA 

TCATGGGAAT. 

TTCTACTTAG 

TATTGGGAAA 

CGGGACCCAG 

CAAAGATTCA 

AGTGGCCTTG- 

CCGGGGAAAA 

AACCCGCTGC 

CTGCAAATCA 

GTAGAGTGTG 

TAATCCATTT 



51 
I 

LPWNAMSLQI 
NKLOVIiPIGI. 
HLVGLTKLNL 
NQIGLLSPGti 
PMPNLRELWL 
NALQDLDGNV 
HLGKLCELRL 
VAVPSVHVPE 
GMTQAQSGLA 



Seq ID NO: 287 DNA sequence 
Nucleic Acid Accession #: NM_002362 
Coding sequence: 1-.954 



ATGTCTTCTG 
GAGGCCCTGG 
TCCTCCTCCT 
GGTCCTCCCC 
TGGAGGCAAC 
GACGCAGAGT 
CTGCTCCGCA- 
ATCAAAAATT- 
ATGATCTTTG 

acctgcctgg: 
ggccttctga 
gaaatctggg 
ggggagccca 
caggtacccg 
gaaaccagct 
gcctacccat 



11 
I 

AGCAGAAGAG 
GCCTGGTGGG 
CTCCTCTGGT. 
AGAGTCCTCA 
CCAATGAGGG 
CCTTGTTCCG 
AGTATCGAGC 
ACAAGCGCTG. 
GCATTGACGT 
GCCTTTCCTA 
TAATCGTCCT 
AGGAGCTGGG 
GGAAACTGCT 
GCAGTAATCC 
ATGTGAAAGT 
CCCTGCGTGA 



21 
I 

TCAGCACTGC 
TGCACAGGCT. 
CCCTGGCACC 
GGGAGCCTCT 
TTCCAGCAGC 
AGAAGCACTC 
CAAGGAGCTG 
CTTTCCTGTG 
GAAGGAAGTG: 
TGATGGCCTG 
GGGCACAATT 
TGTGATGGGG 
CACCCAAGAT 
TGCGCGCTAT 
CCTGGAGCAT 
AGCAGCTTTG 



31 
I 

AAGCCTGAGG 
CCTACTACTG 
CTGGAGGAAG 
GCCTTACCCA 
CAAGAAGAGG' 

agtaacaagg; 
gtcacaaagg 
atcttcggca 
gaccccgcca 

CTGGGTAATA 
GCAATGGAGG 
GTGTATGATG 
TGGGTGCAGG. 
GAGTTCCTGT 
GTGGTCAGGG 
TTAGAGGAGG 



41 
I 

AAGGCGTTGA 
AGGAGCAGGA 
TGCCTGCTGC 
CTACCATCAG 
AGGGGCCAAG 
TGGATGAGTT 
CAGAAATGCT 
AAGCCTCCGA 
GCAACACCTA 
ATCAGATCTT 
GCGACAGCGC 
GGAGGGAGCA 
AAAACTACCT 
GGGGTCCAAG 
TCAATGCAAG 
AAGAGGGAGT 



51 
I 

GGCCCAAGAA 
GGCTGCTGTC 
TGAGTCAGCA 
CTTCACTTGC. 
CACCTCGCCT 
GGCTCATTTT 
GGAGAGAGTC- 
GTCCCTGAAG; 
CACCCTTGTC 
TCCCAAGACA 
CTCTGAGGAG 
CACTGTCTAT 
GGAGTACCGG ' 
GGCTCTGGCT 
AGTTCGCATT 
CTGA 



4 BOO 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 ' 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 



60 
120 
180 
240 
300 
360 
420 
480 
540 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



60 
65 
70 
75 
80 
85 



Seq ID NO: 288 Protein sequence: 
Protein Accession 8: NP_002353.1 

1 11 21 31 41 51 

I I I I I I 

MSSEQKSQHC KPEEGVEAQE EALGLVGAQA PTTEEQEAAV SSSSPLVPGT LEEVPAAESA 
GPPQSPQGAS ALPTTISFTC WRQPNEGSSS QEEEGPSTSP DAESLFREAL SNKVDELAHF 
LLRKYRAKEL VTKAEMLERV IKNYKRCFPV. IFGKASESLK MIFGIDVXEV DPASNTYTLV 
TCLGLSYDGL LGNNQIFPKT GLLIIVLGTI AMEGDSASEB EIWEELGVMG VYDGREHTVY 
GEPRKLLTQD WVQENYLEYR QVPGSNPARY EFLWG PRAIA ETSYVKVLEH WRVNARVRI 
AYPSLREAAL LEEEEGV 

Seq ID NO: 2B9 DNA sequence 
Nucleic Acid Accession #: NM_002362 
Coding sequence: 46.. 13 44 



1 
I 

CGGCGGCCGC 
GGCGACCTGA 
CATCAGCGCG 
CTCAACAGAC 
TTGACCAGAA 
CAGCCCATCG 
GGCCCCAGCA 
GTTCTACCTG 
AAATCCCATC 



11 
I 

GCCCTGGTTG 
AGCAGGCGCT 
GCAGCAGCAC 
ATAATATTGT 
ATGTGCAGTC 
ATTTGAGTGC 
GTGAAAATCT 
CAGCTGAATT 
rCCTCGATTA 



21 
I 

GGTCCCCACT 
TCCCTGTGTG 
TGCAAAGAAA 
GTTTGGTGAT. 
TGTGTCTATT 
ATGCACTGTT 
GGAGGAAGAG 
CCATGGGCTT 
TGTGATGACA 



31 
I 

GCTCTCGGGG 
GCCGAGTCGC 
GAAGACATAA 
TACACATGGA 
ATTGACACAG 
GCACTTCACA 
ACAGAAAACA 
TGGGACAGCT 
ACTrTACTGT 



41 
I 

GCGCCATGGA 
CAACGGTCCA 
ACCTGAGTGT 
CTGAGTTTGA 
AATTAAAGGT 
TTTTCCAGCT 
TAATTGCAGC 
TGGTATACGA 
TTTCAGACAA 



51 
I 

CGAGGCCGTG 
CGTGGAGGTG 
TAGAAAGCTA 
TGAACCTTTT 
TAAAGACTCA 
GAATGAAGAT 
AAATCACTGG 
TGTGGAAGTC 
GAACGTCAAC 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 



294 



WO 02/086443 ...... 

AGCAACCTCA TCACCTGGAA CCGGGTGGTG CTGCTCCACG GTCCTCCTGG CACTGGAAAA 600 

. ACATCCCTGT GTAAAGCGTT AGCCCAGAAA TTGACAATTA GACTTTCAAG CAGGTACCGA 660 

TATGGCCAAT taattgaaat AAACAGCCAC AGCCTCTTTT CTAAGTGGTT TTCGGAAAGT .. 720. 

GGCAAGCTGQ TAACCAAGAT GTTTCAGAAG ATTCAGGATT TGATTGATGA TAAAGACGCC 780 

CTGGTGTTCG TGCTGATTGA TGAGGTGGAG AGTCTCACAG COGCCCGAAA TGCCTGCAGG 840 

GOGGGCACCG AG CCATCAG A TGCCATCCGC GTGGTCAATG CTGTCTTGAC CCAAATTGAT 900 

CAGATTAAAA GGCATTCCAA' TGTTGTGATT CTGACCACTT CTAACATCAC CGAGAAGATC 960 

GACGTGGCCT TCGTGGACAG GGCTGACATC AAGCAGTACA TTGGGCCACC CTCTGCAGCA 1020 

GCCATCTTCA. AAATCTACCT CTCTTGTTTG GAAGAACTGA TGAAGTGTCA GATCATATAC 1080 

CCTCGCCAGC AGCTGCTGAC CCTCCGAGAG. CTAGAGATGA- TTGGCTTCAT; TGAAAACAAC 1140 

GTGTCAAAAT TGAGCCTTCT TTTGAATGAC ATTTCAAGGA"- AGAGCGAGGG: CCTCAGCGGC 1200 

CGGGTCCTGA'. GAAAACTCCC CTTTCTGGCT: CATGCGCTGT ATGTCCAGGC CCCCACCGTC 1260 

ACCAT AGAGG * GGTTCCTCCA . GGCCCTGTCTj CTGGCAGTGG- ACAAGCAGTT. TGAAGAGAGA 1320 

AAGAAGCTTG: CAGCTTACAT CTGATCCTGG GCTTCCCCAT CTGGTGCTTT TCCCATGGAG 1380 

AACACACAAC CAGTAAGTGA GGTTGCCCCA CACAGCCGTC TCCCAGGGAA TCCCTTCTGC 1440 

AAACCAAACG TTACTTAGAC TGCAAGCTAG AAAGCCACCA- AGGCCAGGCT" TTGTTAAAAG 1500 

AAGTGTATTC TATTTATGTT GTTTT AAAAT GCATACTGAG. AGACAAACAT CTTGTCATTT 1560 

TCACTGTTTG TAAAAGATAA TTCAGATTGT TTGTCTCCTT- GTGAAGAACC ATCGAAACCT 1620 

GTTTGTTCCC AGCCCACCCC CAGTGGATGG GATGCATAAT GCCAGCAAGT TTTGTTTAAC 1680 

AGCAAAAAAG GAAGATTAAT GCAGGTGTTA TAGAAGCCAG AAGAGAAACT GTGTCACCCT 1740 

AAAGAAGCAT ATAATCATAG CATTAAAAAT- GCACACATTA CTCCAGGTGG AAGGTGGCAA - 1800- 

TTGCTTTCTG- ATATCAGCTC GTTTGATTTA GTGCAAAAAT GTTTTCAAGA CTATTTAATG. : 1860 

GATGTAAAAA AGCCTATTTC TACATTATAC CAACTGAGAA AAAAATGGTC GGTAAAGTGT' 1920 

TCTTTCATAA- TAAATAATCA AGACATGGTC CCATTTGCAG GAAAAGTGCA- GACTCTGAGT 1980 

GTTCCAGGGA AACACATGCT GGACATCCCT, TGTAACCCGG TATGGGCGCC CCTGCATTGC 2040 

TGGGATGTTT CTGCCCACGG TTTTGTTTGT- GCAATAACGT " TATCACATTT CTAATGAGGA- 2100 

TTCACATTAA TATAATATAA- AATAAATAGG TCAGTTACTG GTCTCTTTCT GCCGAATGTT 2160 
ATGTTTTGCT TTTATCTCAC AGTAAAATAA" ATATAATTAA AAA 

Seq ID NO: 290 Protein sequence: 
Protein Accession #i NP_004228 

1 11 21 31 41 51 

MDEAVGDLKQ ALPCVAESPT- VHVEVHQRGS, STAKKEDINI* SVRKLLNRHN IVFGDYTWTE 60 

FDEPFLTRNV QSVSIIDTEL KVKDSQPIDL SACTVALHIF QLNEDGPSSE NLEEETENII 120 

AANHWVLPAA' EFHGLWDSLV' YDVEVKSHLL DYVMTTLLFS DKNVNSNIiIT- WKRWLLHGP 180 

PGTGKTSLCK ALAQKLTIRL SSRYRYGQLI EINSHSLFSK' WFSESGKLVT KMFQKIQDLI 240 

DDKDALVFVL IDEVESLTAA- RNACRAGTEP SDAIRWNAV LTQIDQIKRH SNWIIiTTSR. 300 

ITEKIDVAFV-- DRADIKQYIG PPSAAAIFKI YLSCLEELMK OQIIYPRQQL LTLRELEMIG 360 

FIENNVSKLS- LLLNDISRKS. EGLSGRVLRK LPFLAHALYV QAPTVTIEGF LQALSLAVDK 420 
QFEERKKLAA YI 

Seq ID NO: 291 DNA sequence 

Nucleic Acid Accession #: NMJ)02658.1 

Coding sequence: 77-1372 

1 11 21 31 |1 SI 

GTCCCCGCAG CGCCGTCGCG CCCTCCTGCC GCAGGCCACC GAGGCCGCCG CCGTCTAGCG 60 

CCCCGACCTC GCCACCATGA GAGCCCTGCT GGCGCGCCTG: CTTCTCTGCG TCCTGGTCGT 120 

GAGCGACTCC AAAGGCAGCA ATGAACTTCA TCAAGTTCCA TCGAACTGTG ACTGTCTAAA 180 

TGGAGGAACA TGTGTGTCCA ACAAGTACTT CTCCAACATT CACTGGTGCA ACTGCCCAAA 240 

GAAATTCGGA GGGCAGCACT GTGAAATAGA TAAGTCAAAA ACCTG CTATG AGGGGAATGG; 300 

TCACTTTTAC CGAGGAAAGG CCAGCACTGA CACCATGGGC CGGCCCTGCC TGCCCTGGAA 360 

CTCTGCCACT GTCCTTCAGC AAACGTACCA- TGCCCACAGA TCTGATGCTC TTCAGCTGGG 420 

CCTGGGGAAA - CATAATTACT GCAGGAACCC AGACAACCGG AGGCGACCCT GGTGCTATGT ' 480 

GCAGGTGGGC CTAAAGCCGC TTGTCCAAGA GTGCATGGTG CATGACTGCG CAGATGGAAA 540 

AAAGCCCTCC TCTCCTCCAG AAGAATTAAA ATTTCAGTGT. GGCCAAAAGA CTCTGAGGCC 600 

CCGCTTTAAG ATTATTGGGG GAGAATTCAC CACCATCGAG AACCAGCCCT GGTTTGCGGC 660 

CATCTACAGG AGGCACCGGG GGGGCTCTGT CACCTACGTG TGTGGAGGCA GCCTCATCAG 720 

CCCTTGCTGG GTGATCAGCG CCACACACTG CTTCATTGAT TACCCAAAGA AGGAGGACTA 780 

CAT CGTCT AC CTGGGTCGCT CAAGGCTTAA CTCCAACACG CAAGGGGAGA TGAAGTTTGA- 840 

GGTGGAAAAC CTCATCCTAC ACAAGGACTA CAGCGCTGAC ACGCTTGCTC ACCACAACGA 900 

CATTGCCTTG CTGAAGATCC GTTCCAAGGA GGGCAGGTGT GCGCAGCCAT CCCGGACTAT 960 

ACAGACCATC TGCCTGCCCT CGATGTATAA CGATCCCCAG TTTGGCACAA GCTGTGAGAT 1020 

CACTGGCTTT GGAAAAGAGA ATTCTACCGA CTATCTCTAT CCGGAGCAGC TGAAAATGAC 1080 

TGTTGTGAAG CTGATTTCCC ACCGGGAGTG TCAGCAGCCC CACTACTACG GCTCTGAAGT 1140 

CACCACCAAA ATGCTATGTG CTGCTGACCC CCAATGGAAA* ACAGATTCCT GCCAGGGAGA 1200 

CTCAGGGGGA CCCCTCGTCT- GTTCCCTCCA' AGGCCGCATG ACTTTGACTG GAATTGTGAG 1260 

CTGGGGCCGT GGATGTGCCC TGAAGGACAA GCCAGGCGTC TACACGAGAG TCTCACACTT. 1320 

CTTACCCTGG ATCCGCAGTC ACACCAAGGA AGAGAATGGC CTGGCCCTCT GAGGGTCCCC: 1380 

AGGGAGGAAA CGGGCACCAC CCGCTTTCTT GCTGGTTGTC ATTTTTGCAG TAGAGTCATC 1440 

TCCATCAGCT GTAAGAAGAG ACTGGGAAGA TAGGCTCTGC ACAGATGGAT TTGCCTGTGG 1500 

CACCACCAGG GTGAACGACA ATAGCTTTAC CCTCACGGAT AGGCCTGGGT GCTGGCTGCC' 1560 

CAGACCCTCT" GGCCAGGATG GAGGGGTGGT CCTGACTCAA CATGTTACTG ACCAGCAACT 1620 

TGTCTTTTTC TGGACTGAAG CCTGCAGGAG TTAAAAAGGG CAGGGCATCT CCTGTGCATG 16B0 

GGCTCGAAGG GAGAGCCAGC TCCCCCGACC GGTGGGCATT TGTGAGGCCC ATGGTTGAGA. 1740 



GCAGCTGAGG TCTCTTGAGG GAGCTTAGCC 1800 

ACACTAACGA CTTCAGGGCA GGGCTCTGAT 1860 

TGTGTGTATG TTTGCACACT TGTTGTGTGG 1920 

TGTCTGATTG TTAAGTCTAA ATATTTCCTT 1980 

m , ^ iVJi0 ^ lvJ4uniu TGGTCTTTCT GGAGAGGTTA TAGGTCACTC 2040 

CTGGGGCCTC TTGGGTCCCC CACGTGACAG TGCCTGGGAA TGTACTTATT CTGCAGCATG 2100 

ACCTGTGACC AGCACTGTCT CAGTTTCACT TTCACATAGA TGTCCCTTTC TTGGCCAGTT 2160 

ATCCCTTCCT TTTAGCCTAG TTCATCCAAT CCTCACTGGG TGGGGTGAGG ACCACTCCTT 2220 

ACACTGAATA TTTATATTTC ACTATTTTTA TTTATATTTT TGTAATTTTA AATAAAAGTG 2280 



AATGAATAAT- TTCCCAATTA GGAAGTGTAA 
AATGTGGGAG CAGCGGTTTG GGGAGCAGAG 
ATTCCATGAA TGTATCAGGA AATATATATG 
GCTGTGAGTG TAAGTGTGAG TAAGAGCTGG 
AAACTGTGTG GACTGTGATG CCACACAGAG 



295 



WO 02/086443 

ATCAATAAAA TGTGATn'TT CTGA" 

Seq ID NO: 292 Protein* sequence : 
Protein Accession 8 :NP_002649.1 

1 11 ; 21 . 31 41 51 . . 

iL^LARIJ^L CVLVVSDSKG SNEUHQVPSN CDCLNGGTCV SNKYFSNIHW CNCPKKFGGQ 60 

HCEIDKSKTC YEGNGHFYRG KASTDTKGRP CLPWNSATVL QQTYHAHRSD ALQLGLGKHN 120 

YCRNPDNRRR* PKCYVQVGLK PLVQECMVHD CADGKKPSSP ■ PEELKFQCGQ KTLRPRFKI I 180 

"GGEFTTIENQ PWPAAIYRRH RGGSVTYVCG GSLISPCWVI SATHCFIDYP KKEDYIVYLG 240 

RSRLNSNTQG EMKFEVENLI LHKDYSADTL AHHNDIALLK IRSKEGRCAQ PSRTIQTICL. 300 

PSMYNDPQFG TSCEITGFGK ENSTDYLYPE QLKMTWKLI SHRECQQPHY YGSEVTTKML • 360 

CAADPQWKTD SCQGDSGGPL VCSLQGRMTL TGIVSWGRGC ALKDKPGVYT RVSHFLPWIR 420 
SHTXEENGLA L 

Seq ID NO: 293 DNA sequence " 
•Nucleic Acid Accession #: NM_001498 
Coding sequence: 9 3.. 2 00 6 

1 ll 21 31 41 51 

I I I - I I I 

GGCACGAGGC TGAGTGTCCG TCTCGCCCCC GGAAGCGGGC GACCGCCGTC AGCCCGGAGG 60 

AGGAGGAGGA. GGAGGAGGAG. GAGGGGGCGG CCATGGGGCT GCTGTCCCAG GGCTCGCCGC 120 

TGAGCTGGGA GGAAACCAAG. CGCCATGCCG^ ACCACGTGCG GCGGCACGGG ATCCTCCAGT 180 

TCCTGCACAT- CTACCACGCC GTCAAGGACC GGCACAAGGA* CGTTCTCAAG TGGGGCGATG 240 

AGGTGGAATA CATGTTGGTA TCTTTTGATC ATGAAAATAA AAAAGTCCGG TTGGTCCTGT 300 

CTGGGGAGAA AGTTCTTGAA- ACTCTGCAAG"; AGAAGGGGGA AAGGACAAAC CCAAACCATC 360 

CTACCCTTTG GAGACCAGAG. TATGGGAGTT ACATGATTGA AGGGACACCA GGACAGCCCT 420 

ACGGAGGAAC AATGTCCGAG TTCAATACAG TTGAGGCCAA* CATGCGAAAA CGCCGGAAGG 480 

AGGCTACTTC TATATTAGAA GAAAATCAGG CTCTTTGCAC AATAACTTCA TTTCCCAGAT 540 

TAGGCTGTCC TGGGTTCACA- CTGCCCGAGG . TCAAACCCAA CCCAGTGGAA GGAGGAGCTT • 600 

CCAAGTCCCT CTTCTTTCCA GATGAAGCAA" TAAACAAGCA CCCTCGCTTC AGTACCTTAA 660 

CAAGAAATAT'- CCGACATAGG AGAGGAGAAA AGGTTGTCAT CAATGTACCA ATATTTAAGG 720 
ACAAGAATAC " ACCATCTCCA TTT ATAGAAA CATTTACTGA GGATGATGAA GCTTCAAGGG 780 

CTTCTAAGCC GG AT CAT ATT TACATGGATG CCATGGGATT TGGAATGGGC AATTGCTGTC: 840 

TCCAGGTGAC ATTCCAAGCC TGCAGTATAT CTGAGGCCAG- ATA CCTTT AT GATCAGTTGG- 900 

CTACTATCTG • TCCAATTGTT ATGGCTTTGA' GTGCTGCATC TCCCTTTTAC CGAGGCTATG- 960 

TGTCAGACAT TGATTGTCGC TGGGGAGTGA 1 TTTCTGCATC TGTAGATGAT AGAACTCGGG- 1020 

AGGAGCGAGG ACTGGAGCCA TTGAAGAACA: ATAACTATAG GATCAGTAAA TCCCGATATG- 1080 

ACTCAATAGA' CAGCTATTTA TCTAAGTGTG. GTGAGAAATA TAATGACATC GACTTGACGA: 1140 

TAGATAAAGA GATCTACGAA CAGCTGTTGC AGGAAGGCAT. TGATCATCTC CTGGCCCAGC 1200 

ATGTTGCTCA- TCTCTTTATT AGAGACCCAC TGACACTGTT TGAAGAGAAA ATACACCTGG 1260 

ATGATGCTAA- TGAGTCTGAC CATTTTGAGA ATATTCAGTC CACAAATTGG CAGACAATGA 1320 

GATTTAAGCC CCCTCCTCCA AACTCAGACA TTGGATGGAG AGTAGAATTT CGACCCATGG 1380 

AGGTGCAATT AACAGACTTT GAGAACTCTG CCTATGTGGT GTTTGTGGTA CTGCTCACCA- 1440 

GAGTGATCCT TTCCTACAAA* TTGGATTTTC TCATTCCACT GTCAAAGGTT GATGAGAACA 1500 

TGAAGGTAGC ACAGAAAAGA GATGCTGTCT TGCAGGGAAT GTTTTATTTC AGGAAAGATA 1560 

TTTGCAAAGG TGGCAATGCA GTGGTGGATG GTTGTGGCAA GGCCCAGAAC AGCACGGAGC. 1620 

TCGCTGCAGA GGAGTACACC CTCATGAGCA TAGACACCAT CATCAATGGG AAGGAAGGTG 1680 

TGTTTCCTGG ACTGATCCCA ATTCTGAACT CTTACCTTGA AAACATGGAA GTGGATGTGG 1740 

ACACCAGATG TAGTATTCTG AACTACCTAA AGCTAATTAA GAAGAGAGCA TCTGGAGAAC 1800 

TAATGACAGT. TGCCAGATGG ATGAGGGAGT TTATCGCAAA CCATCCTGAC TACAAGCAAG I860 

ACAGTGTCAT AACTGATGAA ATGAATTATA GCCTTATTTT GAAGTGTAAC CAAATTGCAA 1920 

ATGAATTATG TGAATGCCCA GAGTTACTTG GATCAGCATT TAGGAAAGTA AAATATAGTG, 1980 

GAAGTAAAAC TGACTCATCC AACTAGACAT TCTACAGAAA GAAAAATGCA TTATTGACGA" 2040 

ACTGGCTACA GTACCATGCC TCTCAGCCCG TGTGTATAAT- ATGAAGACCA AATGATAGAA' 2100 

CTGTACTGTT TTCTGGGCCA' GTGAGCCAGA AATTGATTAA GGCTTTCTTT GGTAGGTAAA 2160 

TCTAGAGTTT ATACAGTGTA ' CATGTACATA GTAAAGTATT TTTGATTAAC AATGT ATTTT 2220 

AATAACATAT CTAAAGTCAT- CATGAACTGG CTTGTACATT TTTAAATTCT -TACTCTGGAG 2280 

CAACCTACTG TCTAAGCAGT TTTGTAAATG TACTGGTAAT TGTACAATAC TTGCATTCCA 2340 

GAGTTAAAAT GTTTACTGTA AATTTTTGTT CTTTTAAAGA CTACCTGGGA CCTGATTTAT 2400 

TGAAATTTTT- CTCTTTAAAA ACATTTTCTC TCGTTAATTT TCCTTTGTCA TTTCCTTTGT 2460 

TGTCTACATT AAATCACTTG AATCCATTGA AAGTGCTTCA AGGGTAATCT TGGGTTTCTA 2520 

GCACCTTATC TATGATGTTT CTTTTGCAAT TGGAATAATC ACTTGGTCAC CTTGCCCCAA 2580 
GCTTTCCCCT CTGAATAAAT ACCCATTGAA CTCTGAAAAA AAAAAAAAAA AAAA 

Seq ID NO: 294 Protein sequence: 
Protein Accession # : NP_001489 

1 11 . * . 21 . 3 1 41 5 * 

KGLLSQGSPL SWEET KRHAD HVRRHGILQF LHIYHAVKDR HKDVLKWGDE VEYMLVSFDH 60 

ENKKVRLVLS GEKVLETLQE RGERTNPNHP TLWRPEYGSY MtEGTPGQPY GGTMSEFNTV 120 

EANMRKRRKE ATSILEENQA I*CTITSFPRL GCPGFTLPEV KFNPVEGGAS KSLFFPDEAI 180 

NKHPRFSTLT RNIRHRRGEK WINVPIFKD KNTPSPFIET FTEDDEASRA SKPDHIYMDA 240 

MGFGMGNCCI* QVTFQACSIS EARYLYDQLA TICPIVMALS AASPFYRGYV SDIDCRWGVI 300 

SASVDDRTRE ERGLEPLKNN NYRISKSRYD SIDSYLSKCG EKYNDIDLTI DKEIYEQLLQ 360 

EGIDHLLAQH VAHLFIRDPL TLFEEKIHLD DANESDHFEN IQSTNWQTMR FKPPPPNSDI 420 

GWRVEFRPME VQLTDFENSA YWFWLLTR VILSYKLDFL IPLSKVDENM KVAQKRDAVL 480 

OGMFYFRKDI CKGGNAWDG CGKAQNSTEL AAEEYTLMSI DTIINGKEGV FPGLIPILNS 540 

YLENMEVDVD TRCSILNYLK LIKKRASGEL MTVARWMREF IANHPDYKQD SVITDEMNYS 600 
LIItKCNQIAN ELCECPEl»iiG SAFRKVKYSG SKTDSSN 



296 
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15 
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85 



VVO02/086443 

Seq ID NO: 295 DNA sequence ■ 

Nucleic Acid Accession 8 : Bos sequence 

Coding sequence: 247-816 



PCT/US02/12476 



I 

AGTGTTCGGC 
GGCCAAACGG 
CCTAGGGGGC 
• GGGAGGCGCC 
GAAACAATGA 
CCCAGGGAAT 
AAACAAGGAG 
ATGACAGGAC 
TTCAGCAAAG' 
ACCAGCAGTT 
CAACGAGAAA 
CAAAAATATG 
AAGCGATTTT 
AAGCACCTTA 
CACACCCCAA 
TTCTACAATG 
CTTCCAGAGG 
TTGAAAGGAT' 
TAACCCATTA 



11 
I 

TGGGGCAGGC 
GATCGGTGCT 
ACATTTCCCA 
ACAACTTCAC 
CCGATAAAAC 
GTGACAGTCC 
CAGGAGACAG 
ATGCTATTCC' 
ATAGGATGAT- 
TCTCTGGAGA 
TTAATGCTGA 
AAAAAATCTT 
TTGAATCCAT 
AGAAGAAACT 
ATGCATAATC 
GAGCAGGATA 
CTAAGAAATT 
AACTTGTGTT 
GGTAAATACT 



21 
I 

AOGCTGTGGC 
TCTGGTGAGA 
CAACTCCCAG 
TGCCATTTTG 
AGAGAAGGTG 
TTCGTATCAG 
CCTTATTGCA 
ACCCAGCCAA 
GCAGAAACCT. 
TGACCTAGAA 
TATAAAACGT- 
CGAAATGCTT 
CATCAAGGAA 
GAAACGTATG 
TCGTTAATGA- 
TTGCTGAAGT 
TCTGTTAGTA 
TTGGTTATTT 
ATTACAGTCG 



31 
I 

TGGCTACTTC 
CGCCTCCCCA 
AGGGCAGGTT 

tgaggtgccg; 
gctgtagatc 
aaaaggcaga 
ggctctgcca 

TTGGATTCTC 
GGTAGCAATG 
TGCAGAGAAA 
AAATTAGTGA 
GAAGGAGTGC 
GCAGCAAGAT 
ATTTGAGAAT 
TTGAGGAGAG 
CTCCTGGCAT 
AAAGATGTTC 
TGTATTCCCA 
TGGTTTCTGC 



41 

I 

CCTTCCTCCC 
TGCACATCAC 
■TCTAGAAAGT 
CCGTCTCTCC 
CTGAAACTGT 
GGATGGCCCT- 
TGTCCAAAGA 
AGATTGATGA 
CACCTGTGGG 
CAGCCTCCTC 
AGGAACTCCG 
AAGGACCTAC 
GTATGAGACG 
ACTTGTCCCT 
AAAAGGATCA 
ATGTTACCGA 
TTTTTCCCAA 
CCTGTGCTGG 
A 



51 
I. 

ATCCCCCTTG 

TCCCAGGTGC 

GCCACCAGTG 

TCCAGCAAGG • 

GTTTAAACGT 

GTTGGCAAGG 

AAAGAAGCTT 

CTTCACTGGT 

AGGAAACGTT 

TCCCAAAAGC 

ATGCGTTGGA . 

tgcagtcagg; 

AGACTTTGTT 
GGAGGATTAT 
GATTGCTGTT 
ATCAAATAGC- 
AGCATTTTAT 
TAGATATTAT 



Seq ID NO: 295 Protein sequence: 
Protein Accession #: Eos sequence 

1 11 21 31 41* .5i 

! | . | - I I I 

MTDKTEKVAV DPETVFKRPR ECDSPSYQKR QRMALLARKQ GAGDSLIAGS AMSKEKKLMT 
GHAIPPSQLD SQIDDFTGFS KDRMMQKPGS NAPVGGNVTS SFSGDDLECR ETASSPKSQR 
. EINADIKRKL. VKELRCVGQK YEKIFEMLEG VQGPTAVRKR FFESIIKEAA RCMRRDFVKH 
UQCKLKRMI 

Seq ID NO: 297 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 247-815 



AGTGTTCGGC 
GGCCAAACGG 
CCTAGGGGGC 
GGGAGGCGCC 
GAAACAATGA ' 
CCCAGGGAAT 
AAACAAGGAG 
ATGACAGGAC 
TTCAGCAAAG 
ACCAGCAGTT. 
CAACAAGAAA 
- CAAAAATATG 
AAACGATTTT, 
AAGCACCTTA 
CACACCCCAA 
TTCTACAATG 
CTTCCAGAGG 
TTGAAAGGAT 
TAACCCATTA 



11 
I 

TGGGGCAGGC 
GATCGGTGCT. 
ACATTTCCCA 
ACAACTTCAC 
CCGATAAAAC 
GTGACAGTCC 
CAGGAGACAG: 
ATGCTATTCC 
ATAGGATGAT^ 
TCTCTGGAGA 
TTAATGCTGA 7 
AAAAAATCTT 
TTGAATCCAT 
AGAAGAAACT 
ATGCATAATC 
GAGCAGGATA 
CTAAGAAATT 
AACTTGTGTT 
GGTAAATACT 



21 
I 

ACGCTGTGGC 
TCTGGTGAGA 
CAACTCCCAG 
TGCCATTTTG. 
AGAGAAGGTG 
TTCGTATCAG 
CCTTATTGCA" 
ACCCAGCCAA 
GCAGAAACCT 
TGACCTAGAA" 
TATAAAACGT 
CGAAATGCTT 
CATCAAGGAA 
GAAACGTATG 
TCATTAATGA 
TTGCTGAAGT 
TCTGTTAGTA 
TTGGTTATTT 
ATTACAGTCG 



31 
I 

TGGCTACTTC 
CGCCTCCCCA 
AGGGCAGGTT 
TGAGGTGCCG- 
GCTGTAGATC 
AAAAGGCAGA^ 
GGCTCTGCCA- 
TTGGATTCTC 
GGTAGCAATG 
TGCAGAGAAA 
AAATTAGTGA 
GAAGGAGTGC 
GCAGCAAGAT 
ATTTGAGAAT 
TTGAGGAGAG: 
CTCCTGGCAT 
AAAGATGTTC 
TGTATTCCCA 
TGGTTTCTGC 



41 
i 

CCTTCCTCCC 
TGCACATCAC 
TCTAGAAAGT 
CCGTCTCTCC 
CTGAAACTGT, 
GGATGGCCCT- 
TGTCCAAAGA 
AGATTGATGA 
CACCTGTGGG 
CAGCCTCCTC 
AGGAACTCCG 
AAGGACCTAC 
GTATGAGACG 
ACTTGTCCCT 
AAAAGGATCA 
ATGTTACCGA 
TTTTTCCCAA 
CCTGTGCTGG 
A 



51 
I 

ATCCCCCTTG 
TCCCAGGTGC 
GCCACCAGTG. 
TCCAGCAAGG 
GTTTAAACGT 
GTTGGCAAGG 
AAAGAAGCTT 
CTTCACTGGT 
AGGAAACGTT 
TCCCAAAAGC 
ATGCGTTGGA: 
TGCAGTCAGG 
AGACTTTGTT 
GGAGGATTAT 
GATTGCTGTT 
ATCAACTGGC 
AGCGTTTTAT- 
TAGATATTAT 



Seq ID NO: 2 98 Protein sequence: 
Protein Accession #: Eos sequence 



11 



21 
I 



31. 



41 



51 



MTDKTEKVAV DPETVFKRPR ECDSPSYQKR QRMALLARKQ GAGDSLIAGS AMSKEKKLMT 
GHAIPPSQLD SQIDDFTGFS KDRMMQKPGS NAPVGGNVTS SFSGDDLECR BTASSPKSQQ 
EINADIKRKL VKELRCVGQK YEKIFEMLEG VQGPTAVRKR FFESIIKEAA RCMRRDFVKH 
LKKKLKRMI 

Seq ID NO: 299 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 247-815" 



1 
I 

AGTGTTCGGC 
GGCCAAACGG 
CCTAGGGGGC 
GGGAGGCGCC 
GAAACAATGA 
CCCAGGGAAT 
AAACAAGGAG 
TGACAGGACA 



11 
I 

TGGGGCAGGC 
.GATCGGTGCT 
ACATTTCCCA 
ACAACTTCAC 
CCGATAAAAC 
GTGACAGTCC 
CAGGAGACAG 
TGCTATTCCA 



21 
I ' 

ACGCTGTGGC 
TCTGGTGAGA 
CAACTCCCAG 
TGCCATTTTG 
AGAGAAGGTG 
TTCGTATCAG 
CCTTATTGCA 
CCCAGCCAAT 



31 
I 

TGGCTACTTC 
CGCCTCCCCA 
AGGGCAGGTT 
TGAGGTGCCG 
GCTGTAGATC 
AAAAGGCAGA 
GGCTCTGCCA 
TGGATTCTCA 



41 
I 

CCTTCCTCCC 
TGCACATCAC 
TCTAGAAAGT 
CCGTCTCTCC 
CTGAAACTGT 
GGATGGCCCT 
TGTCCAAAGC 
GATTGATGAC 



51 
I 

ATCCCCCTTG 
TCCCAGGTGC 
GCCACCAGTG 
TCCAGCAAGG 
GTTTAAACGT 
GTTGGCAAGG 
AAAGAGCTTA 
TTCACTGGTT 
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WO 02/086443 ■ * 

TCAGCAAAGA TAGGATGATG CAGAAACCTG GTAGCAATGC ACCTGTGGGA GGAAACGTTA 
CCAGCAGTTT CTCTGGAGAT GACCTAGAAT GCAGAGAAAC AGCCTCCTCT CCCAAAAGCC 
AACAAGAAAT TAATGCTGAT ATAAAACGTA AATTAGTGAA GGAACTCCGA TGCGTTGGAC 
AAAAATATGA AAAAATCTTC GAAATGCTTG AAGGAGTGCA AGGACCTACT GCAGTCAGGA' 
AACGATTTTT TGAATCCATC ATCAAGGAAG CAGCAAGATG TATGAGACGA GACTTTGTTA 
AGCACCTTAA GAAGAAACTG AAACGTATGA TTTGAGAATA CTTGTCCCTG GAGGATTATC 
ACACCCCAAA- TGCATAATCT CATTAATGAT TGAGGAGAGA AAAGGATCAG ATTGCTGTTT 
TCTACAATGG AGCAGGATAT TGCTGAAGTC TCCTGGCATA TGTTACCGAA TCAACTGGCC 
TTCCAGAGGC TAAGAAATTT CTGTTAGTAA - AAGATGTTCT TTTTCCCAAA GCGTTTTATT 
TGAAAGGATA ACTTGTGTTT TGGTTATTTT GTATTCCCAC CTGTGCTGGT AGATATTATT 
AACCCATTAG GTAAATACTA TTACAGTCGT GGTTTCTGCA 

Seq ID NO: 300- Protein" sequence: 
Protein Accession ft: Eos sequence 

: 21 



41 



51 



1 11 .21 31 

MTDKTBKVAV DPETVFKRPR ECDSPSYQKR QRMALLARKQ GAGDSLIAGS AMSKAKKLMT 
GHAIPPSQLD SQIDDFTGFS KDRMMQKPGS NAPVGGNVTS SFSGDDLECR ETASSPKSQQ 
EINADIKRKL VKELRCVGQK YEKIFEMLEG VQGPTAVRKR FFESIIKEAA RCMRRDFVKH 
LKKKLKRMI 

Seq ID NO: 301 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 247-812 



AGTGTTCGGC 
GGCCAAACGG 
CCTAGGGGGC 
GGGAGGCGCC 
GAAACAATGA 
CCCAGGGAAT 
AAACAAGGAG 
TGACAGGACA 
TCAGCAAAGA 
CCAGCAATTT 
AACAAGAAAT 
AATATGAAAA 
GATTTTTTGA 
ACCTTAAGAA 
CCCCAAATGC 
ACAATGGAGC 
CAGAGGCTAA 
AAGGATAACT 
CCATTAGGTA 



11 

I 

TGGGGCAGGC 
GATCGGTGCT- 
ACATTTCCCA 
ACAACTTCAC 
CCGATAAAAC 
GTGACAGTCC 
CAGGAGACAG 
TGCTATTCCA 
TGGGATGATG: 
CTCTGGAGAT 
TAATGCTGAT 
AATCTTCGAA 
ATCCATCATC 
GAAACTGAAA 
ATAATCTCAT 
AGGATATTGC 
GAAATTTCTG: 
TGTGTTTTGG 
AATACTATTA 



21 
I 

ACGCTGTGGC 
TCTGGTGAGA 
CAACTCCCAG. 
TGCCATTTTG. 
AGAGAAGGTG 
TTCGTATCAG 
CCTTATTGCA 
CCCAGCCAAT 
CAGAAACCTG 
GACCTAGAAT 
ATAAAATGTC 
ATGCTTGAAG 
AAGGAAGCAG 
CGTATGATTT 
TAATGATTGA 
TGAAGTCTCC 
TTAGTAAAAG 
TTATTTTGTA 
CAGTCGTGGT 



31 
I 

TGGCTACTTC 
CGCCTCCCCA 
AGGGCAGGTT- 
TGAGGTGCCG, 
GCTGTAGATC 
AAAAGGCAGA 
GGCTCTGCCA 
TGGATTCTCA 
GTAGCAATGC 
GCAGAGGAAT-. 
AAGTAGTGAA 
GAGTGCAAGG, 
CAAGATGTATv 
GAGAATACTT 
GGAGAGAAAA 
TGGCATATGT 
ATGTTCTTTT 
TTCCCACCTG 
TTCTGCA 



41 
I 

CCTTCCTCCC 
TGCACATCAC 
TCTAGAAAGT 
CCGTCTCTCC 
CTGAAACTGT. 
GGATGGCCCT 
TGTCCAAAGA 
GATTGATGAC- 
ACCTGTGGGA 
AGCCTCCTCT 
GGAAATCCGA 
ACCTACTGCA 
GAGACGAGAC 
GTCCCTGGAG 
GGATCAGATT 
TACCGAATCA 
TCCCAAAGCG 
TGCTGGTAGA 



51 
I 

ATCCCCCTTG 
TCCCAGGTGC 
GCCACCAGTG 
TCCAGCAAGG 
GTTTAAACGT 
GTTGGCAAGG. 
AAAGAGCTTA 
TTCACTGGTT 
GGAAATGTTA 
CCCAAAAGCC 
TGCCTTGGAC 
GTCAGGAAAC 
TTTGTTAAGC 
GATTATCACA 
GCTGTTTTCT 
ACTGGCCTTC 
TTTTATTTGA 
TATTATTAAC 



Seq ID NO: 302 Protein sequence: 
Protein Accession 8: Eos sequence 



11 



21 



31 41 51 

I I I 

MTDKTEKVAV DPETVFKRPR ECDSPSYQKR QRMALLARKQ GAGDSLIAGS AMSKEKKLMT 
GHAIPPSQLD. SQIDDFTGFS KDGMMQKPGS NAPVGGNVTS NFSGDDLECR GIASSPKSQQ 
EINADIKCQV VKEIRCLGQY EKIFEMLEGV QGPTAVRKRF FESIIKEAAR CMRRDFVKHL 
KKKLKRM1 

Seq ID NO: 303 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 247-B15" 



AGTGTTCGGC 
GGCCAAACAG 
CCTAGGGGGC 
GGGAGGCGCC 
GAAACAATGA 
CCCAGGGAAT 
AAACAAGGAG 
TGACAGGACA 
TCAGCAAAGA 
CCAGCAGTTT 
AACAAGAAAT 
AAAAATATGA 
AACGATTTTT 
AGCACCTTAA 
ACACCCCAAA 
TCTACAATGG 
TTCCAGAGGC 
TGAAAGGATA 
AACCCATTAG 



11 
I 

TGGGACAGGC 
GATCGGTGCT 
ACATTTCCCA 
ACAACTTCAC 
CCGATAAAAC 
GTGACAGTCC 
CAGGAGACAG 
TGCTATTCCA 
TAGGATGATG 
CTCTGGAGAT 
TAATGCTGAT 
AAAAATCTTC 
TGAATCCATC 
GAAGAAACTG 
TGCATAATCT 
AGCAGGATAT 
TAAGAAATTT 
ACTTGTGTTT 
GTAAATACTA 



21 

I 

ACGCTGTGGC 
TCTGGTGAGA 
CAACTCCCAG 
TGCCATTTTG 
AGAGAAGGTG 
TTCGTATCAG 
CCTTATTGCA 
CCCAGCCAAT 
CAGAAACCTG 
GACCTAGAAT 
ATAAAACGTA 
GAAATGCTTG 
ATCAAGGAAG 
AAACGTATGA 
CGTTAATGAT 
TGCTGAAGTC 
CTGTTAGTAA 
TGGTTATTTT 
TTACAGTCGT 



31 
I 

TGGCTACTTC 
CGTCTCCCCA 
AGGGCAGGTT 
TGAGGTGCCG 
GCTGTAGATC 
AAAAGGCAGA 
GGCTCTGCCA 
TGGATTCTCA 
GTAGCAATGC 
GCAGAGAAAC 
AATTAGTGAA 
AAGGAGTGCA 
CAGCAAGATG 
TTTGAGAATA 
TGAGGAGAGA 
TCCTGGCATA 
AAGATGTTCT 
GTATTCCCAC 
GGTTTCTGCA 



41 
I 

CCTTCCTTCC 
TGCACATCAC 
TCTAGAAAGT 
CCGTCTCTCC 
CTGAAACTGT 
GGATGGCCCT 
TGTCCAAAGC 
GATTGATGAC 
ACCTGTGGGA 
AGCCTCCTCT 
GGAACTCCGA 
AGGACCTACT 
TATGAGACGA 
CTTGTCCCTG 
AAAGGATCAG 
TGTTACCGAA 
TTTTCCCAAA 
CTGTGCTGGT 



51 
I 

ATCCCCCTTG 
TCCCAGATGC 
GCCACCAGTG 
TCCAGCAAGG 
GTTTAAACGT 
GTTGGCAAGG 
AAAGAGCTTA 
TTCACTGGTT 
GGAAACGTTA 
CCCAAAAGCC 
TGCGTTGGAC 
GCAGTCAGGA 
GACTTTGTTA 
GAGGATTATC 
ATTGCTGTTT 
TCAACTGGCC 
GCGTTTTATT 
AGATATTATT 
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Seq ID NO: 304 Protein. sequence : 
Protein Accession ft: Eos sequence 
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WO 02/086443 

1 11 21 31 41 Si 

I - I I .1 I I 

MTDRTEKVAV DPETVFKRPR ■ ECDSPSYQKR QRMALIARKQ GAGDSLIAGS AMSKAKKLMT 
GHA1PPSQLD SQIDDPTGPS KDRMMQKPGS NAPVGGNVTS SFSGDDLECR BTASSPKSQQ 
ED7ADIKHKL- VKELRCVGQK YEKIFEMLEG VQGPTAVRKR FFESIIKEAA RCMRRDFVKH 
LKKKLKRMI 

Seq ID NO: 305 DNA sequence 
nucleic Acid Accession ft: Eos sequence 
. Coding sequencei 87-689 

1 11 21 31 41 51 

I I I I I I 

CGTGGAGGCA GCTAGCGCGA GGCTGGGGAG CGCTGAGCCG OGCGTCGTGC CCTGCGCTGC 
CCAGACTAGC GAACAATACA" GTCAGGATGG CTAAAGGTGA CCCCAAGAAA CCAAAGGGCA 
AGATGTCCGC TTATGCCTTC TTTGTGCAGA CATGCAGAGA AGAACATAAG AAGAAAAACC 
CAGAGGTCCC TGTCAATTTT; GCGGAATTTT CCAAGAAGTG CTCTGAGAGG TGGAAGACGA • 
TGTCCGGGAA AGAGAAATCT. AAATTTGATG AAATGGCAAA' GGCAGATAAA GTGCGCTATG 
ATCGGGAAAT GAAGGATTAT GGACCAGCTA AGGGAGGCAA GAAGAAGAAG* GATCCTAATG 
CTCCCAAAAG GCCACCGTCT GGATTCTTCC TGTTCTGTTC AGAATTCCGC. CCCAAGATCA 
AATCCACAAA CCCCGGCATC TCTATTGGAG ACGTGGCAAfli AAAGCTGGGT GAGATGTGGA 
ATAATTTAAA TGACAGTGAA' AAGCAGCCTT. ACATCACTAA GGCGGCAAAG CTGAAGGAGA 
AGTATGAGAA GGATGTTGCT GACTATAAGT. CGAAAGGAAA GTTTGATGGT GCAAAGGGTC 
CTGCTAAAGT TGCCCGGAAA AAGGTGGAAG AGGAAGATGA AGAAGAGGAG GAGGAAGAAG 
AGGAGGAGGA GGAGGAGGAG GATGAATAAA- GAAACTGTTT ATCTGTCTCC TTGTGAATAC 
TTAGAGTAGG GGAGCGCCGT AATTGACACA TCTCTTATTT- GAGAAGTGTC • TGTTGCCCTC. 
ATTAGGTTTA ATTACAAAAT TTGATCACGA TCATATTGTA GTCTCTCAAA GTGCTCTAGA- 
AATTGTCAGT GGTTTACATG AAGTGGCCAT GGGTGTCTGG AGCACCCTGA AACTGTATCA 
AAGTTGTACA TATTTCCAAA CATTTTTAAA- ATGAAAAGGC ACTCTCGTGT TCTCCTCACT' 
CTGTGCACTT TGCTGTTGGT GTGACAAGGC ATTTAAAGAT . GTTTCTGGCA TTTTCTTTTT'" 
ATTTGTAAGG TGGTGGTAAC TATGGTTATT GGCTAGAAAT, CCTGAGTTTT CAACTGTATA- 
TATCTATAGT TTGTAAAAAG . AACAAAACAA CCGAGACAAA CCCTTGATGC TCCTTGCTCG 
GCGTTGAGGC TGTGGGGAAG ATGCCTTTTG;- GGAGAGGCTG' TAGCTCAGGG CGTGCACTGT 
GAGGCTGGAC CTGTTGACTC. TGCAGGGGGC ATCCATTTAG CTTCAGGTTG TCTTGTTTCT" 
GTATATAGTG ACATAGCATT CTGCTGCO\T- CTTAGCTGTG GACAAAGGGG GGTCAGCTGG 
CATGAGAATA TTTTTTTTTT TAAGTGCGGT AGTTTTTAAA CTGTTTGTTT-' TTAAACAAAC 
TATAGAACTC TTCATTGTCA GCAAAGCAAA' GAGTCACTGC ATCAATGAAA GTTCAAGAAC 
CTCCTGTACT TAAACACGAT TCGCAACGTT CTGTTATTTT TTTTGTATGT TTAGAATGCT 
GAAATGTTTT TGAAGTTAAA TAAACAGTAT TACATTTTTA AAACTCTTCT CTATTATAAC 
AGTCAATTTC TGACTCACAG CAGTGAACAA 1 ACCCCCACTC CATTGTATTT GGAGACTGGC 
CTCCCTATAA ATGTGGTAGC TTCTTTTATT ACTCAGTGGC CAGCTCACTT AGGGCTGAGA 
TGAAGGAGAG GGCTACTTGA AGCTACTGTG TGATTTTGTT TGTGTCTGAG TGGCATTCAG 
ATGAAGTCTG GAGGAGTTAG GAGAACGACA- TAGGCAAGGT TCAGCAGCCT TCCAAGGTAT 
AGGAAGGTGG. GTGATTAGGA CTGAGGCTAT- CTAGGTTTAA CTTTTGTCCC ACCTCCACCC 
CCTATTTTGT GGGGCCAAAT GCATTGCTAA ACAGCAATTT CAGAGTGTAT GGTGTGTCAA. 
AAATTAAGGC CTTATTGTTT TTCTCTTTCA- CCCCTACCCC CCGTGCTCCT GGCACATATC " 
ACATTATTTG TGGTGCCCAA CATTTGGGGT CTTGAGCCTG CTGCTGGTCT CCTGGATGCC 
AGTGAGGGTA TGTGGGATGG GGTGGTGGGG TAGGGGACGG TATCCTTTTT TTGCTCCTAC 
TTGGAAACAC CAAACACCCC AAGGAAGATG' ATAGGCTCCA TCTTGGGCCA CCTGAGCTAT - 
AGGGCAGGCT. AATGGAATCA ACCATTTCTG AGCACTAAAT GTATCATGAA AAGTTGAATG 
GCCTGCTCAT- AAGTTTAGCT CATTCACTGG AAATGTAGAT: TGATGTTCAA TGTTAAACTG 
GAAGGAGCTT GGTTTGTGTG TCAGTGGTTA- TATTAGTGGG: TAGTGTAACA TTTTATCCAG 
GTTGGGGTGA" GGGGAGATGG . CCACAGTAGC AAGTGGTGAC ACTAAATACC ATTTTGAAGG 
CTGATGTGTA TATACATCAT TACTGTCCGT. AGCAATGAAG GATACAGTAC TGTGTTGTGG 
GTGAGTGTTG CTATTGCCCA GCATTAATAT TTGGGTGTGT- ATGTTTGAGG CTATGAAACA 
CGCAGGAGTG TTTTTGTGCT ATTAATTTTA AGAGAAAGCA GCTTTTTCTT AAAATTCACT 
GTTGAGAAAC TTGCATGTCT GGAGGCGGTG TCCTCTCCGC CCTGTCGGGT CCTGGATGAG r 
TACGAGTTAT GGTCACGGTC ACAGCCTGAT CTCTTATGTG TTCATAGCCA' TTCGCTCTCC 
CATCAGAACT GTTTGTCCTG. AATGTGTTCC' TCTAGTTCTA GAAAATGACC ACTAATTTAA 
AAAACTCGGT TGTGAGGTTT GCCCAGAGGC ACTTGTTCCA GAATTTCCCC TCCTGCTTCA 
GCCATGTCCT TGTCACTTGG CATTCTAAGC TAAAGCTTTA GCTTCCCAAT TCGTGATGTG 
CTAGGCCAAG ATTCGGGAGC TGTTGCCAGC CTCGTCAAAT ATGGAAGAGA AACAACCTGC 
GGTCAAAAGG GAGTGATTTG TTAAGTGGTG . CGCGTCTATC TCATAACTAG ATGTACCAAC 
CAGGGAAGGG CCAAGGATGG AAAGGGGTAA CTTTTGTGCT TCCAAAGTAG CTAAGCAGAA 
GTGGGGGAGC AGTTTAGCCA GATGATCTTT GATTAGGCAA ACATTGAGTT TTAAAGAGGC 
TGTCAAGTTG AGGCCACTTG GTCCATTAGC TGGGGCAGCA AGATCACTAC TCAACGTTTT 
CACACTGTGG CAAGATTGCT CTTCTAGTGG AATAATGCCC TAGTTTCTCT GAGATGATGT 
AAGTGGCATG ATGTTACCTA- AGGCTTAGGC TTAGCTTGAT TTCTGGGCCC ACTGTCTGTG 
TTCTTAAGAT GCCAACCTGT TGCTTTTTTT TTTTTTTTCC CCCATTTAAA AGGATAGTAC 
CTACTCCCTC TAACCACCTC ACCCCATTCT! TGAATGACAT TTTATCCTTC GGAAAGAACA 
AGGCTGTGAT GTAGTGACTA TTGTCTGTGT CTCCTGTGTG TGTCTGTTCT TGTCACAAAT 
GTATTTGGGG ACGTTGGATG CATTCATTTT CTGTAATAAA G 

Seq ID NO: 306 Protein sequence: 
Protein Accession ft: NP_005333.1 . 

1 11 ... 21 31 41 51 

11-11 I I 

MAKGDPKKPK GKMSAYAFFV QTCREEHKKK NPEVPVNPAB FSKKCSEHWK TMSGKEKSKF 
DSMAKADKVR YDREMKDYGP AKGGKKKKDP NAPKRPPSGF FLFCSEFRPK IKSTKPGISI 
GDVAKKLGEM WNNLNDSEKQ PYITKAAKLK EKYEKDVADY KSKGKFDGAK GPAKVARKKV 



Seq ID NO: 307 DNA sequence 
Nucleic Acid Accession #: NM_022342 
Coding sequence: 1. .2178 
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60 
120 
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299 



WO 02/086443 
1 11 2i j x . j 1 -f 1 

ATGGGTACTA GGAAAAAAGT TCATGCATTT GTCCGTGTCA AACCCACCGA TGACTTTGCT 60 

CATGAAATGA TCAGATACGG AGATGACAAA- AGAAGCATTG ATATTCACTT AAAAAAAGAC 120 

ATTCGGAGAG GAGTTGTCAA TAACCAACAG ACAGACTGGT CGTTTAAGTT GGATGGAGTT 160 

TTCACGATG CCTCCCAGGA CTTGGTTTAT GAGACAGTTG CAAAGGATGT GGTTTCTCAG 240 

CCCTCGATG GCTATAATGG CACCATCATG. TGTTATGGGC AGACGGGAGC TGGCAAGACA 300 

ACACCATGA TGGGGGCAAC TGAGAATTAC AAGCACCGGG GGATCCTCCC TCGTGCCCTG 360 

AGCAGGTTT TTAGGATGAT CGAAGAACGC CCCACACATG CCATCACTGT GCGTGTTTCC 420 

ACTTGGAAA TCTATAATGA GAGCCTGTTT GATCTCCTGT; CCACTCTGCC CTATGTTGGA 480 

CCTCAGTCA CACCAATGAC CATCGTGGAA AACCCTCAAG GAGTCTTCAT TAAGGGCTTG 540 

CAGTTCACC TCACAAGTCA GGAGGAGGAT GCATTCAGCC TCCTTTTTGA GGGTGAGACC . 600 

ACAGGATTA TAGCCTCCCA CACTATGAAC AAAAACTCTT CCAGATCACA CTGCATTTTC 660 

CCATCTACT TAGAGGCCCA ttcccggacc TTATCAGAGG AAAAGTACAT CACTTCCAAA 720 

TTAACTTGG TGGATCTGGC AGGCTCAGAG AGGCTGGGGA AGTCTGGGTC TGAGGGCCAA 780 

TCCTGAAGG AAGCCACCTA CATCAACAAA TCGCTCTCAT' TCCTGGAGCA GGCCATCATT 840 

CCCTTGGGG ACCAGAAGCG GGACCACATC CCCTTTCGGC AGTGCAAGCT- CACCCACGGT 900 

TGAAGGACT- CGTTAGGGGG AAACTGCAAT; ATGGTCCTCG TGACAAACAT- CTATGGAGAA 960 

CTGCCCAGT TAGAAGAAAC GCTATCTTGA'' CTGAGATTTG: CCAGCAGGAT GAAGCTAGTC '* 1020 

CCACTGAGC CTGCCATCAA . TGAAAAGTAX GATGCTGAGA- GAATGGTCAA GAACCTGGAG 1080 

AGGAACTAG CACTACTCAA GCAGGAGCTG : GCTATCCATG ACAGCCTGAC CAACCGCACC 1140 

TTGTGACCT' ATGACCCCAT GGATGAAATC CAGATTGCTG -AGATCAACTC CCAGGTGCGG 1200 

GGTACCTGG AGGGGACACT GGACGAGATC GACATAATCA GCCTTAGACA GATCAAGGAG 1260 

TGTTCAACC AGTTCCGGGT GGTTCTGAGC CAACAGGAAC AGGAAGTGGA GTCCACTTTG 1320 

GCAGGAAGT ACACCCTCAT TGACAGGAAT GACTTTGCAG CCATTTCTGC TATCCAGAAG 1380 

CGGGGCTTG. TGGATGTTGA TGGCCACCTA GTGGGTGAGC CTGAAGGACA AAACTTTGGA 1440 

TCGGAGTCG CCCCTTTCTC TACGAAACCT GGGAAGAAAG CCAAGTCCAA' GAAGACATTC 1500 

AAGAGCCAC TCAGGCCCGA CACCCCACCC TCCAAACCAG TGGCCTTTGA GGAGTTTAAG 1560 

ATGAGCAAG GTAGTGAGAT. CAACCGAATT TTCAAAGAAA ACAAATCCAT CTTGAATGAA 1620 

GGAGGAAAA GGGCCAGCGA GACCACACAG CACATCAATG CCATCAAGCG GGAGATTGAT 1680 

TGACCAAGG AGGCCCTGAA ■ TTTCCAGAAG TCACTACGGG AGAAGCAAGG CAAGTACGAA 1740 

ACAAGGGGC TGATGATCAT OGATGAGGAA GAATTCCTGC TGATCCTCAA GCTCAAAGAC 1800 
TCAAGAAGC AGTACCGCAG CGAGTACCAG GACCTGCGTG ACCTCAGGGC TGAGATCCAG - 1860 

ATTGCCAGC ACCTAGTGGA TCAGTGTCGC CACCGCCTGC TCATGGAATT TGACATCTGG 1920 

ACAATGAGT CCTTTGTCAT CCCTGAGGAC ATGCAGATGG' CACTGAAGCC AGGCGGCAGC 1980 

TCCGGCCAG GCATGGTCCC TGTGAACAGG ATTGTGTCTC TGGGAGAAGA TGACCAGGAC: 2040 

AATTCAGCC AGCTGCAGCA GAGGGTGCTT CCTGAGGGCC CTGATTCCAT. CTCCTTCTAC 2100 

ATGCCAAAG TCAAGATAGA GCAGAAGCAT AATTACTTGA AAACCATGAT GGGCCTCCAG 2160 
AGGCACATA GAAAATAG . 

Seq ID NO: 308 Protein sequence: 
Protein Accession #: NP_071737 

1 11 21 31 41 51 

KGTRKKVHAP VRVKPTDDFA HEMIRYGDDK RSIDIHLKKD IRRGWNNQQ TDWSFKLDGV 
LHDASQDLVY ETVAKDWSQ ALDGYNGTIM CYGQTGAGKT YTMMGATENY- KHRGILPRAL 
QQVPRMIEER PTHAITVRVS- YLEIYNESU DLLSTLPYVG PSVTPMTIVE NPQGVFIKGL 
SVHLTSQEED APSLLFEGET NRI I ASHTMNr KNSSRSHCIF TIYLEAHSRT LSEEKYITSK 
INLVDLAGSE RLGKSGSEGQ VLKEATYINK SLSFLEQAII ALGDQKRDHI PFRQCKLTHA 300 
LKDSLGGNCN MVLVTNI YGE AAQLESTLSS LRFASRMKLV TTEPAINEKY DAERMVKNLE 360 
KSIALLKQSb AIHDSLTNRT- FVTYDPMDBX':- QIAEINSQVR : RYLEGTLDEI- DIISLRQIKE 420 
VFNQFRWLS QQEQEVESTL RRKYTLIDRN? DFAAISAIQK- AGLVDVDGHL VGEPEGQNFG 480 
LGVAPPSTKP GKKAKSKKTF. KEPLRPDTPP- SKPVAPEBFK. NBQGSEINRI FKENKSILNE 540 
RRKRASETTQ HINAIKREID VTKEALNFQK SLREKQGKYE NKGLMI IDEE EPLLILKLKD 600 
LKKQYRSEYQ DLRDLRAEIQ YCQHLVDQCR HRLLMEFDIW YNESFVIPED MQMALKPGGS 660 
IRPGMVPVNR IVSLGEDDQD KFSQLQQRVL PEGPDSISFY NAKVKIEQKH NYLKTMMGLQ 720 
QAHRK 

Seq ID K0: 309 DNA sequence. 

Nucleic Acid Accession #: CAT cluster 

l 11 21 31 41 51 

XTTT TTTTTT TTTTTTTTAA TGCCTGCTGT CATGCTCTGT CTACCAGGGT GAATTTCCAA 60 

AAATTTCTGC ATAGCAATTT TAGCCAAAAC TATATATGTT CTGGGGAGGA TAGGCATAGG 120 

CACATTGAAG ACCAAAGGAA AGAGTGAAGA" AGTGTAGTTG GGTCATTGTG AATGGATGTT 180 

TAGATTGTCA AGAAAAGTGG GCCAGAGGCC CCACCTCACA CTAGGACGGC AATTGCCTCT 240 

CATTAGTATC TCAGGCACCA TGGGTCTTAT TTGGTGTCAT- AAGAAACACC CTCAACAAAG 300 

TAATGAACCC TCAGCCTCCA GCTTCTCTTC TTCGGGATTC TTCTTAGGGC CTCCTTTTTC 360 

CTTTTATGTT TCCAGTACCC TGAATTTCTT ATTCCCATCC CCCATTAAAA TCTGCTTCAA 420 

AGAAAAAACA AGAAGGACAC ATTCACTTTA AGATCCAAAT GAATGATAAG AGCTTAAAAC 480 
ATTATACTTA TCAGTATTAT TTGCATTTTT ATAGAAACCA AAACCATATT TCAACAAC 

Seq ID NO: 310 DNA sequence 

Nucleic Acid Accession NM_018622.2 

Coding sequence: 1-1140 

! 11 . 21 31 41 51 

ATGGCGTGGC GAGGCTGGGC GCAGAGAGGC TGGGGCTGCG GCCAGGCGTG GGGTGCGTCG 60 

GTGGGCGGCC GCAGCTGCGA GGAGCTCACT GCGGTCCTAA CCCCGCCGCA GCTCCTCGGA 120 

CGCAGGTTTA ACTTCTTTAT TCAACAAAAA TGCGGATTCA GAAAAGCACC CAGGAAGGTT 180 

GAACCTCGAA GATCAGACCC AGGGACAAGT GGTGAAGCAT ACAAGAGAAG TGCTTTGATT 240 

CCTCCTGTGG AAGAAACAGT CTTTTATCCT TCTCCCTATC CTATAAGGAG TCJGATAAAA 300 

CCTTTATTTT TTACTGTTGG GTTTACAGGC TGTGCATTTG GATCAGCTGC TATTTGGCAA 360 



60 
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WO 02/086443 

TATGAATCAC TGAAATCCAG GGTCCAGAGT TATTTTGATG 
GATAGCATAA GACCACAAAA AGAAGGAGAC TTCAGAAAGG 
AACCTAAGTG ATGGCCAGCG GACTGTGACA GGTATTATAG 
TGTTTATGGA GAGTACCTTC TCTGCAGCGG ACAATGATCA 
GCCTCAAAGG TCCTTTGTTC TCCAATGTTG CTGTCAACAT 
CACATGGCAG CAAATATGTA TGTTTTGTGG AGCTTCTCTT 
GGTCAAGAGC AGTTCATGGC AGTGTACCTA . TCTGCAGGTG 
TACCTGGGT A AAGTTGCCAC AGGAAGATAT GGACCATCAC 
ATGACAGTCC TCGCAGCTGT CTGCACTAAG; ATCCCAGAAG 
CTTCCGATGT TCACGTTCAC AG CAGGGAAT GCCCTGAAAG 
GCAGGAATGA TCCTGGGATG GAAATTTTTT GATCATGCGG 
TTTGGAATAT: GGTATGTTAC TTACGGTCAT GAACTGATTT 
GTGAAAATCT GGCATGAAAT AAGGACTAAT GGCCCCAAAA 



PCT/US02/12476 



Seq ID NO: 311 Protein sequence: 
Protein Accession #: NP_061092 .2 



1 
I 

MAWRGWAQRG 
BPRRSDPGTS 
YESLKSRVQS 
CLWRVPSLQR 
GQBQFMAVYL 
LPMFTFTAGN 
VKIWHEIRTN 



11 
I 

WGCGQAWGAS 
GEAYKRSALI 
YFDGIKADWI* 
TMIRYFTSNP' 
SAGVISNFVS 
ALKAIIAMDT. 
GPKKGGGSK 



21 
I 

VGGRSCEEI/T 
PPVEETVFYP 
DSIRPQKEGD 
ASKVLCSPML 
YLGKVATGRY 
AGMILGWKFP 



31 
I 

AVLTPPQLLG 
SPYPIRSlilK 
FRKEINKWWN 
1»STFSHFSI»P 
GPSLGASGAI- 
DHAAHLGGAL 



GTATAAAAGC 
AGATTAACAA 
CTGCAAATGT 
GATATTTCAC 
• TCAGTCACTT 
CCAGCATAGT 
TTATTTCCAA 
TTGGTGCATC 
GGAGGCTTGC 
CCATTATCGC 
CACATCTTGG 
GGAAGAACAG 
AAGGAGGTGG 



41 
I 

RRFNFFIQQK 
PLFFTVGFTG 
NLSDGQRTVT 
HMAANMYVLW 
MTVLAAVCTK 
PGIWYVTYGH 



TGATTGGTTG 
GTGGTGGAAT 
CCTTGTATTC 
ATCGAATCCA 
CTCCTTATTT 
GAACATTCTG 
TTTTGTCAGT 
TGGTGCCATC 
CATTATTTTC 
CATGGATACA 
GGGAGCTCTT 
GGAGCCGCTA 
CTCTAAGTAA 



51 
I 

CGFRKAPRKV 
CAFGSAAIWQ 
GIIAANVLVF 
SFSSSIVNII, 
IPEGRLAIIP 
BLIWKNREPL 



420 
' 4B0 
S4 0 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



Seq ID NO: 312 DNA sequence 
Nucleic Acid Accession #: NM_000625 
Coding sequence: 195.. 3656 



CTCTCGGCCA 
CACAGGTCTC 
GCCCCACAGT 
GTAAAGCCAT- 
ATGCAATGAA 
CGAGTCCAGT 
CCCCGCAGCC 
CAACCCCATT- 
TCCAAGACAC 
TGGGGTCCAT 
CAGATGAGCT 
AGGCAAAAAT- 
CAGTAACCTA 
ATGCCCCACG 
GCTGTTCCAC 
ACAATGGCAA 
ACTTCCGGGT 
GCATCAGAGG 
AGCCCAAGTA 
CTGAGCTCTT 
ACGAGTGGTT 
TGCTTGAGGT 
CAGAGATCGG 
GCAGGAGAAT 
TTGAGATCAA 
ACCACTCGGC 
GCTGCCCGGC 
TTCACCAGGA 
GGAAAACCCA 
AAGTCTTGGT. 
GAGTCAGAGT. 
ACCTGGGGGC 
GGCTGAGCTG 
GAGACTGCCC 
ACAACAAATT 
CCTTTGCTCA 
TGGGAGAAGG 
CCTTCAAGGC 

- AGCTCTACAC 
AGCCTTTGGA 
GGCTCAAATC 
AACTCTCCTG 
GCCCAGGCAA 
CACCCCACCA 
ACAAGAGGCT 
CACCCCCAAC 

" GACAGAGGCT 
GCCCCACATT 
TGCTTTCCCA 
ACACGCCCAC 
AGGGTCCCCT 
CAGTGCCCTG 



11 
I 

CCTTTGATGA 
TTCCTGGTTT 
GAAGAACATC 
AGAGATGGCC 
TGGGGAAAAA- 
GACACAGGAT 
CCTGGTGGAG' 
GTCCTCCCCA 
ACTTCACCAT- 
TATGACTCCC 
TCTACCTCAA 
AGAGGAACAT 
CCAACTGACG 
CTGCATTGGG 
TGCCCGGGAA- 
CATCAGGTCG: 
GTGGAATGCT 
GGACOGTGCC 
CGGCCGCTTC 
CGAAATCCCA 
TCGGGAACTG 
GGGCGGCCTG 
AGTCCGGGAC 
GGGCCTGGAA 
CATTGCTGTG 
TGCAGAATCC 
AGACTGGATT. 
GATGCTGAAC 
TGTCTGGCAG 
CAAAGCTGTG 
CACCATCCTC 
CTTATTCAGC 
CCTGGAGGAG 
TGGCAATGGA 
CAGGTACGCT 
TGACATTGAT 
GGATGAGCTC 
AGCCTGTGAG 
CTCCAATGTG 
CCTCAGCAAA 
TCGGCAGAAT 
TGAGGATGGC 
CCAGCCGGCC 
GGCAGTGCGC 
GCCCCCCTGC 
CCAGCTGCTG 
GGAGGCCCTG 
CCTGGAGGTG 
GCTCCXCATT 
GGAGATCCAC 
GCACCACGGC 
CTTTGTGCGG 



21 

I 

GGGGACTGGG 
GACTGTCCTT 
TGAGCTCAAA 
TGTCCTTGGA 
GGCATCAACA 
GACCTTCAGT 
ACGGGAAAGA 
CGGCATGTGA 
AAGGCCAAAG 
AAAAGTTTGA 
GCTATCGAAT 
CTGGCCAGGG: 
GGAGATGAGC 
AGGATCCAGT 
ATGTTTGAAC 
GCCATCACCG 
CAGCTCATCC 
AAOGTGGAAT 
GATGTGGTCC 
CCTGACCTTG- 
GAGCTAAAGT 
GAGTTCCCAG 
TTCTGTGATG; 
ACGCACAAGC 
CTCCATAGTT 
TTCATGAAGT 
TGGCTGGTCC 
TACGTCCTGT 
GACGAGAAGC 
CTCTTTGCCT 
TTTGCGACAG 
TGTGCCTTCA 
GAACGGCTGC 
GAGAAACTGA 
GTGTTTGGCC 
CAGAAGCTGT 
AGTGGGCAGG- 
ACGTTTGATG 
ACCTGGGACC 
GCCCTCAGCA 
CTACAAAGTC 
CAAGGCCTGA 
CTGGTCCAAG 
CTGGAGGCCC 
TCACTCAGCC 
CTCCAAAAGC 
TGCCAGCCCT 
CTAGAGGAGT 
CTGAAGCCCA 
CTGACTGTGG 
GTCTGCAGCA 
AATGCCAGCG 



31 

I 

CAGTTCTAGA 
ACCCCGGGGA 
TCCAGATAAGr 
AATTTCTGTT 
ACAATGTGGA 
ATCACAACCT 
AGTCTCCAGA 
GGATCAAAAA 
GGATTTTAAC 
CCAGAGGACC 
TTGTCAACCA 
TGGAAGCGGT 
TCATCTTCGC 
GGTCCAACCT 
ACATCTGCAG 
TGTTCCCCCA 
GCTATGCTGG 
TCACTCAGCT 
CCCTGGTCCT/ 
TGCTTGAGGT; 
GGTACGCCCT. 
GGTGCCCCTT- 
TCCAGCGCTA- 
TGGCCTCGCT 
TCCAGAAGCA 
ACATGCAGAA 
CTCCCATGTC 
CCCCTTTCTA 
GGAGACCCAA 
GTATGCTGAT 
AGACAGGAAA 
ACCCCAAGGT 
TGTTGGTGGT 
AGAAATCGCT 
TCGGCTCCAG 
CCCACCTGGG 
AGGACGCCTT 
TCCGAGGCAA 
CGCACCACTA 
GCATGCATGC 
CGACATCCAG 
ACTACCTGCC 
GCATCCTGGA 
TGGATGAGAG 
AGGCCCTCAC 
TGGCCCAGGT 
CAGAGTACAG 
TCCCGTCCCT 
GGTTCTACTC 
CCGTGGTCAC 
CATGGCTCAA 
GCTTCCACCT 



41 

I 

CAGTCCCGAA 
GGCAGTGCAG 
TGACATAAGT 
CAAGACCAAA 
GAAAGCCCCC 
CAGCAAGCAG 
ATCTCTGGTC 
CTGGGGCAGC 
TTGCAGGTCC 
CAGGGACAAG 
ATATTACGGC 
AACAAAGGAG 
CACCAAGCAG 
GCAGGTCTTC 
ACACGTGCGT 
GCGGAGTGAT 
CTACCAGATG 
GTGCATCGAC 
GCAGGCCAAT 
GGCCATGGAA 
GCCTGCAGTG 
CAATGGCTGG 
CAACATCCTG 
CTGGAAAGAC 
GAATGTGACC 
TGAATACCGG 
TGGGAGCATC 
CTACTATCAG. 
GAGAAGAGAG 
GCGCAAGACA 
ATCAGAGGCG 
TGTCTGCATG 
GACCAGTACG 
CTTCATGCTG 
CATGTACCCT. 
GGCCTCTCAG 
CCGCAGCTGG 
ACAGCACATT 
CAGGCTCGTG 
CAAGAACGTG 
CCGTGCCACC 
GGGGGAGCAC 
GCGAGTGGTG, 
TGGCAGCTAC 
CTACTTCCTG 
GGCCACAGAA 
CAAGTGGAAG 
GCGGGTGTCT 
CATCAGCTCC 
CTACCACACC 
CAGCCTGAAG 
CCCCGAGGAT 



51 
1 

GTTCTCAAGG 

CCAGCTGCAA 

GACCTGCTTT 

TTCCACCAGT '- 

TGTGCCACCT. 

CAGAATGAGT 

AAGCTGGATG 

GGGATGACTT 

AAATCTTGCC 

CCTACCCCTC- 

TCCCTCAAAG 

ATAGAAACAA- 

GCCTGGCGCA 

GATGCCCGCA 

TACTCCACCA- 

GGCAAGCACG- 

CCAGATGGCAv 

CTGGGCTGGA 

GGCCGTGACC 

CATCCCAAAT 

GCCAACATGC • 

TACATGGGCA 

GAGGAAGTGG 

CAGGCTGTCG 

ATCATGGACC 

TCCCGTGGGG 

ACCCCCGTGT 

GTAGAGGCCT 

ATTCCATTGA 

ATGGCGTCCC 

CTGGCCTGGG 

GATAAGTACA 

TTTGGCAATG 

AAAGAGCTCA 

CGGTTCTGCG 

CTCACCCCGA 

GCCGTGCAAA 

CAGATCCCCA 

CAGGACTCAC 

TTCACCATGA 

ATCCTGGTGG 

CTTGGGGTTT 

GATGGCCCCA- 

TGGGTCAGTG 

GACATCACCA 

GAGCCTGAGA 

TTCACCAACA 

GCTGGCTTCC 

CCCCGGGATC 

CGAGATGGCC 

CCCCAAGACC 

CCCTCCCATC 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1690 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 



301 
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WO 02/086443 

CTTGCATCCT CATCGGGCCT GGCACAGGCA 
GGCTCCATGA * CTCCCAGCAC* AAGGGAGTGC 
GOCGCCGCCC AGATGAGGAC CACATCTACC 
GGGTGCTGCA- 'TGCGGTGCAC ACAGCCTATT 
TTCAGGACAT CCTGCGGCAG CAGCTGGCCA 
CAGGCCACCT CTATGTTTGC GGGGATGTGC 
" AGCAGCTGGT GGCTGCCAAG CTGAAATTGA 
AGCTCAAGAG CCAGAAGCGC TATCACGAAG 
CGAAGAAGGA CAGGGTGGCG GTGCAGCCCA 
CTACAGGAGG GGTTAAAGCT GCCGGCACAG 
CTGAGGTCAC AGGGCCTGGG GAGATGGAGG 
TTCCTCAACG TTGCTCCCCA' TCAAGCCCTT 
ATTGATCGGA GCCTC 

Seq ID NO: 313 Protein sequence: 
Protein Accession ft: NP_ 000616 



PCI7US02/12476 



TCGCGCCCTT 
GGGGAGGCCG 
AGGAGGAGAT 
COCGCCTGCC 
GCGAGGTGCT 
GCATGGCCCG 
ATGAGGAGCA 
ATATCTTTGG 
GCAGCCTGGA' 
AACTTAAGGA 
AAAGTGATAT 
TACTTGACCT 



COGCAGTTTC 
CATGACCTTG 
GCTGGAGATG 
TGGCAAGCCC 
CCGTGTGCTC 
GGACGTGGCC 
GGTCGAGGAC 
TGCTGTATTT 
GATGTCAGCG. 
TGGAGCCAGC 
CCCCCAGCCT 
CCTAACAAGT 



TGGCAGCAAC 

GTGTTTGGGT 

GCCCAGAAGG 

AAGGTCTATG 

CACAAGGAGC 

CACACCCTGA 

TATTTCTTTC 

CCTTACGAGG . 

CTCTGAGGGC 

TCTGCATTAT 

CAAGTCTTAT 

AGCACCCTGG 



1 
I 

MACPWKFLFK 
VETGKKSPES 
TPKSLTRGPRi 
LTGDELIFAT. 
RSAITVFPQR 
RFDWPLVLQ' 
GLEFPGCPPN 
AVLHSFQKQN: 
LNYVLSPFYY." 
IliFATETGKS 
NGEKLKKSLF 
ELSGQEDAFR 
SKALSSMHAK 
PALVQGILER 
LZiLQKLAQVA 
PILKPRFYSI' 
VRNASGFHLP 
EDHIYQEEML 
VCGDVRMARD 
VAVQPSSLEM 



11 
I 

tkfhqyamng 
lvkldatpls; 
dkptppdell: 

KQAWRNAPRC 
SDGKHDFRVW 
ANGRDPELFE. 
GWYMGTEIGV 
VTIMDHHSAA 
YQVEAWKTHV 
EALAWDLGAIi 
MLKELNNKFR 
SWAVQTFKAA 
NVFTMRLKSR 
WDGPTPHQA 
TEEPERQRLE 
SSPRDHTPTE 
EDPSHPCILI 
EMAQKGVLHA 
VAHTLKQLVA 
SAL 



21 
I 

EKGINNNVEX 
SPRHVRIKNWr 
PQAIEFVNQY- 
IGRIQWSNLQ 
KAQLIRYAGY- 
IPPDLVIiEVA- 
RDFCDVQRYN 
ESFMKYMQNE 
WQDEKRRPKR 
FSCAFNPKW 
YAVFGLGSSM 
CETFDVRGKQ 
QNLQSPTSSR 
VRLEALDESG 
ALOQPSEYSK 
IHLTVAWTY- 
GPGTGIAPFR 
VHTAYSRLPG 
AKIiKLNEEQV 



31 
I 

APCATSSPVT 
GSGMTFQPTL. 
YGSLKEAKIE 
VFDARSCSTA 
QMPDGSIRGD 
MEHPKYEWFR 
ILEEVGRRMG 
YRSRGGCPAD 
REIPLKVLVK 
CMDKYRLSCL 
YPRFCAFAHD 
HIQIPKLYTS 
ATILVELSCE 
SYWVSDKRLP 
WKFTNSPTFL 
HTRDGQGPLK 
SFWQQRLHDS 
KPKVYVQDII* 
EDYFFQLKSQ 



41 
I 

QDDLQYHNLS 
,. HHKAKGILTC 
EHLARVEAVT 
. REMFEHICRH. 
PANVEFTQLC 
ELELKWYALP 
i LETHKLASLW 
WIWLVPPMSG 
• AVLFACMLMR 
, EEERLLLWT. 
i. IDQKLSHLGA 
. NVTWDPHHYR 
!. DGOGLNYLPG 
PCSLSQALTY 
EVLEEFPSLR- 
HGVCSTWLNS 
QHKGVRGGRM 
RQQIiASEVLR 
KRYHEDIFGA 



51 
I 

KQQNESPQPIi 

RSKSCLGSIM 

KEIETTVTYQ . 

VRYSTNNGNI 

IDLGWKPKYG 

AVANMLLEVG 

KDQAWEINI 

SITPVFHQEM 

KTMASRVRVT 

STFGNGDCPG 

SQLTPKGEGD 

LVQDSQPLDL 

EHLGVCPGNQ 

FLDITTPPTQ 

VSAGFLLSQL 

LKPQDPVPCF 

TLVFGCRRPD. 

VLHKEPGHLY 

VFPYEAKKDR 



Seq ID NO: 314 DNA sequence 
Nucleic Acid Accession 8: XM_08?254 
Coding sequence: 47.. 2332 



AGAGTACGTG 
ATGTTCAATT-; 
AACACCAGAC 
CTTATCCCAT? 

aattaaagaa: 
caatgttcaa- 
gttggagtac 

TGGTATTGTG 
GGAACGGTAC. 
AATTGTTCAG 
TCTCCCTAAA- 
TTTGAAAGGG 
GGAAATAGAT 
GGCAGCTGTT 
AGACAGACTA 
AGTATGGGTA 
CCATTTTCAT 
TGCTGAACAA 
GCTGGTAGTG 
GGAAGTTTGC 
AAAAGTAATA 
TGGTGCTAAT 
AGAAGGAAGA 
CAAATTGCTT 
TTTTTTTTAT. 
GTTTTCTCAG 
tTCCCTACCT: 
AAATAAGCCC 
TCTTTATTGG 
ACTAATAGGG 
TGGCACTTTG 
TCATTTTTGG 
ATTTTCCTTG 
TGTGTTTATT . 
TACATGTCTA 
TACTGAAAAG 
CTGTTTCCCG 
AGG AAGATGT 
CAACGACAGG 



11 

I . 

TTTACAGATA 
AATGGCATGA 
TCTTCAGAAG 
CTTACAACCA 
CATGATCTCT 
ACTGACTGCA 
TATGCATCTT 
TTTATTGGCA- 
AAACTGCTTC 
GCACCTTCAG 
TGTATAGGTG' 
CTAAGAACTC 
AAACGCATAT- 
TTCCAGTTCA 
CAAGATAAAG. 
CTTACTGGGG 
AGAACCATGA 
TTGAGGCAGC 
GATGGGACCA 
AGAAATTGTT 
AGACTAATAA 
GACGTAAGCA 
CAGGCTGCAA 
TTTGTTCATG 
AAGAATGTGT 
CAAACATTGT 
ATTCTGATAT 
ACCCTTTATC 
ACCATCCTGG 
AAAGATACAT 
GTCTTCACAG 
ACTTGGATCA 
TTTTATGGAG 
CAGCTCCTGT' 
TTTCTTGATA 
GCACAGCTTA 
GAAGGAGAAG 
AGTCCAACCC 
AGCATCTTGA 



21 
I 

AAACTGGTAC 
AATACCAAGA" 
GAAACTTATC 
GTTCCTCTTT 
TCTTTAAAGC 
CTGGTGATGG 
CACCAGATGA 
ATTCTGAAGA 
AT ATTCTGGA 
GTGAGAAGTT 
GAGAAATAGA 
TGTGTATAGC 
TTGAAGCCAG 
TAGAGAAAGA 
TTCGAGAAAC 
ATAAACATGA 
ACATCCTTGA 
TTGCCAGAAG 
GCCTATCTCT 
CAGCTGTATT 
AAATATCACC 
TGATACAAGA 
GAAACAGTGA 
GTCATTTTTA 
GCTTTATCAC 
ATGACAGCGT- 
ATAGTCTTTT 
GAGACATTAG 
GCTTCAGTCA 
CTCTGCTTGG 
TCATGGTTAT 
ACCATCTCGT 
GGATTCTCTG 
CAAGTGGTTC 
TCATAAAGAA 
CTGAAACAAA 
CAGCGTGTGC 
ACATCAGCAG 
CTCTCTCCAC 



31 

1 

ACTGACAGAA 
AATTAATGGT 
TTATCTTAGT 
CAGAACCAGT 
AGTCAGTCTC 
TCCCTGGCAA 
AAAGGCTCTA 
AACTATGGAG 
ATTTGATTCA- 
ATTATTTGCT. 
AAAAACCAGA 
ATATAGAAAA 
GACTGCCTTG 
CCTGATATTA 
TATTGAAGCA 
AACAGCTGTT 
ACTTATAAAC 
AATTACAGAG 
TGCACTCAGG 
ATGCTGTCGT 
TGAGAAACCT 
AGCCCATGTT 
CTATGCAATA 
TTATATTAGA 
ACCCCAGTTT 
GTACCTGACT. 
GGAACAGCAT- 
TAAAAACCGC 
TGCCTTTATT 
AAATGGCCAG 
TACAGTCACA 
TACCTGGGGA 
GCCATTTTTG 
TGCTTGGTTT 
GGTCTTTGAC 
TGCAGGTATC 
ATCTGTTGGA 
ATCATGGAGT 
AATGGACTCA 



41 

! 

AATGAGATGC 
AGACTTGTAC 
AGTTTATCCC 
CCTGAAAATG 
TGTCACACTG 
TCCAACCTGG 
GTAGAAGCTG 
GTTAAAACTC 
GATCGTAGGA 
AAAGGAGCTG 
ATTCATGTAG 
TTTACATCAA 
CAGCAGCGGG 
CTTGGAGCCA 
TTGAGAATGG 
AGTGTGAGTT 
CAGAAATCAG 
GATCATGTGA 
GAGCATGAAA 
ATGGCTCCAC 
ATAACATTGG 
GGCATAGGAA 
GCCAGATTTA 
ATAGCTACCC 
TTATATCAGT 
TTATACAATA 
GTAGACCCTC 
CTCTTAAGTA 
TTCTTTTTTG 
ATGTTTGGAA 
GTAAAGATGG 
TCTATTATAT 
GGCTCCCAGA 
GCCATAATCC 
CGACACCTCC 
AAGTGCTTGG 
AGAATGCTGG 
GCATCGGATC 
TCTACTTGTT 
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I 

AGTTTCGGGA 

CCGAAGGACC 

ATCTTAACAA- 

AAACTGAACT 

TACAGATTAG 

CACCATCGCA 

CTGCAAGGAT 

TTGGAAAACT 

GAATGAGTGT • 

AGTCATCAAT 

ATGAATTTGC 

AAGAGTATGA 

AAGAGAAATT 

CAGCAGTAGA 

CTGGTATCAA 

TATCATGTGG 

ACAGCGAGTG 

TTCAGCATGG 

AACTATTTAT 

TGCAGAAAGC 

CTGTTGGTGA 

TCATGGGTAA 

AGTTCCTCTC 

TTGTACAGTA 

TCTACTGTTT 

TTTGTTTTAC 

ATGTGTTACA 

TTAAAACATT 

GATCCTATTT 

ACTGGACATT 

CTCTGGAAAC 

TTTATTTTGT 

ATATGTATTT 

TCATGGTTGT 

ACCCTACAAG 

ACTCCATGTG 

AACGAGTTAT 

CTTTCTATAC 

AAAGGGGCAG 



3180 
3240. 
3300 
.3360 
3420 
3480 
354Q . 
3600 
■3660 
3720 
3780 
3840 



60 
120 
180 
240 
300 
360, 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
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■ : TAgSotSI T^GCCAG TTCACCTCCT TTCCTAAAAT TCAGTGTGAT CACCCTGTTA , 2400 

ATGGCCACAC TAGCTCTGAA- ATTAATTTCC AAAATCTTTG TAGTAGTTCA TACCCACTCA 24 SO 

GAGTTATAAT GGCAAACAAA CAGAAAGCAT, TAGTACAAGC CCCTCCCAAC ACCCTTAATT 2520 

TGAATCTGAA - CATGTTAAAA . TTTGAGAATA* AAGAGACATT TTTCATCTCT TTGTCTGGTT 2580 

5 . TGTCCCTTGT GCTTATGGGA CTCCTAATGG ^CATTTCAGTC TGTTG CTGAG GCCATTATAT 2640 

TTTAATATAA ATGTAGAAAA 'AAGAGAGAAA- TCTTAGTAAA GAGTATTTTT TAGTATTAGC 2700 

■ TTGATTATTG ACTCTTCTAT TTAAATCTGC TTCTGTAAAT. TATGCTGAAA GTTTGCCTTG 2760 

AGAACTCTAT TTTTTTATTA - GAGTTATATT TAAAGCTTTT. CATGGGAAAA GTTAATGTGA 2820 

ATACTGAGGA' ATTTTGGTCC CTCAGTGACC TGTGTTGTTA- ATTCATTAAT GCATTCTGAG 2880 

10 TTCACAGAGC AAATTAGGAG AATCATTTCC . AACCATTATT. TACTGCAGTA TGGGGAGTAA 2940 

ATTTATACCA ATTCCTCTAA CTGTACTGTA ACACAGCCTG TAAAGTTAGC CATATAAATG 3000 ' 

CAAGGGTATA TCATATATAC AAATCAGGAA- TCAGGTCCGT TCACCGAACT TCAAATTGAT 3060 

GTTTACTAAT ATTTTTGTGA- CAGAGTATAA AGACCCTATA GTGGGTAAAT TAGATACTAT 3120 

TAGCATATTA TTAATTTAAT, GTCTTTATCA- TTGGATCTTT TGCATGCTTT AATCTGGTTA 3180 

1 5 ACATATTTAA ATTTGCTTTT, TTTCTCTTTA CCTGAAGGCT. CTGTGTATAG TATTTCATGA 3240 

CATCGTTGTA CAGTTTAACT ATATCAATAA AAAGTTTGGA' CAGTATTTAA ATATTGCAAA 3300 

TATGTTTAAT TATACAAATC AGAATAGTAT GGGTAATTAA ATGAATACAA- AAAGAAGAGC . 3360 

CTCTTTCTGC AGCCGACTTA GACATGCTCT TCCCTTTCTA TAAGCTAGAT TTTAGAATAA 3420 

AGGGTTTCAG TTAATAATCT TATTTTCAGG TTATGTCATC TAACTTATAG CAAACTACCA 3480 

20 CAATACAGTG AGTTCTGCCA GTGTCCCAGT ACAAGGCATA TTTCAGGTGT GGCTGTGGAA 3540 

TGTAAAAATG CTCAACTTGT ATCAGGTAAT GTTAGCAATA AATTAAATGC TAAGAATGAT 3600 

TAATCGGGTA CATGTTACTG TAATTAACTC ATTGCACTTC AAAACCTAAC TTCCATCCTG 3660 

. AATTTATCAA' GTAGTTCAGT ATTGTCATTT. GTTTTTGTTT : . TATTGAAAAG TAATGTTGTC 3720 

TTAAGATTTA GAAGTGATTA • TTAGCTTGAG; AACTATTACC. CAG CTCTA AG CAAATAATGA 3780 

25 TTGTATACAT ATTAAGATAA TGGTTAAATG: ' CGGTTTTACC AAGTTTTCCC TTGAAAATGT 3840 

AATTCCTTTA TGGAGATTTA- TTGTGCAGCC CTAAGCTTCC TTCCCATTTC" ATGAATATAA 3900. 

GGCTTCTAGA- ATTGGACTGG- CAGGGGAAAG AATGGTAGAG ACAGAAATTA AGACTTTATC 3960. 

CTTGTTTGCT TGTAAACTAT TATTTTCTTG. CTAATGTAAC ATTTGTCTGT TCCAGTGATG 4020 

TAAGGATATT AAGTTATTAA • GCTAAATATT AATTTTCAAA AATAGTCCTT CTTTAACTTA 4080 

30 GATATTTCAT AGCTGGATTT AGGAAGATCT: , GTTATTCTGG AAGTACTAAA AAGAATAATA ... 4140 

CAAOGTACAA TGTCTGCATT CACTAATTCA TGTTCCAGAA- GAGGAAATAA TGAAGATATA 4200 

CTCAGTAGAG T ACT AGGTGG '- GAGGATATGG; AAATTTGCTC ATAAAATCTC TTATAAAACG 4260 

TGCATATAAC AAAATGACAC CCAGTAGGCC TGCATTACAT TTACATGACC GTGTTTATTT 4320 

GCCATCAAAT AAACTGAGTA- CTGACACCAG - ACAAAGACTC CAAAGTCATA AAATAGCCTA 4380 

35 TGACCAACTG CAGCAAGACA GGAGGTCAGC TCGCCTATAA- TGGTGCTTAA AgTGTGATTG 4440 

ATGTAATTTT CTGTACTCAC CATTTGAAGT TAGTTAAGGA GAACTTTATT TTTTTAAAAA 4500 

AAGTAAATGG CAACCACTAG TGTGCTCATC CTGAACTGTT- ACTCCAAATC CACTCCGTTT 4560 

TTAAAGCAAA ATTATCTTGT GATTTTAAGA- AAAGAGTTTT CTATTTATTT AAGAAAGTAA 4620 

CAATGCAGTC TGCAAGCTTT CAGTAGTTTTr CTAGTGCTAT ATTCATCCTG TAAAACTCTT 4680 

40 ACTACGTAAC CAGTAATCAC*. AAGGAAAGTG TCCCCTTTGC ATATTTCTTT AAAATTCTTT 4740 

CTTTGGAAAG TATGATGTTG ATAATTAACT. TACCCTTATC TGCCAAAACC AGAGCAAAAT 4B00 

GCTAAATACG TTATTGCTAA TCAGTGGTCT CAAATCGATT TGCCTCCCTT TGCCTCGTCT 4B60 

GAGGGCTGTA AGCCTGAAGA TAGTGGCAAG: CACCAAGTCA GTTTCCAAAA TTGCCCCTCA 4920 

GCTGCTTTAA GTGACTCAGC ACCCTGCCTC AGCTTCAGCA GGCGTAGGCT CACCCTGGGC 4980 

45 GGAGCAAAGT ATGGGCCAGG GAGAACTACA : GCTACGAAGA CCTGCTGTCG AGTTGAGAAA 5040 

AGGGGAGAAT TTATGGTCTG: AATTTTCTAA CTGTCCTCTT TCTTGGGTCT AAAGCTCATA 5100 

ATACACAAAG GCTTCCAGAC CTGAGCCACA CCCAGGCCCT ATCCTGAACA GGAGACTAAA 5160 

CAGAGGCAAA TCAACCCTAG GAAATACTTG CATTCTGCCC TACGGTTAGT ACCAGGACTG 5220 

AGGTCATTTC TACTGGAAAA' GATTGTGAGA . TTGAACTTAT CTGATCGCTT GAGACTCCTA 5280 

50 ATAGGCAGGA GTCAAGGCCA CTAGAAAATT GACAGTTAAG AGCCAAAAGT TTTTAAAATA 5340 

TGCTACTCTG AAAAATCTCG. TGAAGGCTGT AGGAAAAGGG- AGAATCTTCC ATGTTGGTGT 5400 

TTTTCCTGTA AAGATCAGTT. TGGGGTATGA TATAAGCAGG. TATTAATAAA AATAACACAC S460 

CAAAGAGTTA CGTAAAACAT GTTTTATTAA TTTTGGTCCC CACGTACAGA CATTTTATTT . 5520 , 

CTATTTTGAA ATGAGTTATC TATTTTCATA AAAGTAAAAC ACTATTAAAG TGCTGTTTTA 5580 

5 5 TGTGAAATAA r CTTGAATGTT . GTTCCTATAA 1 - AAAATAGATC ATAACTCATG ATATGTTTGT 5640 

AATCATGGTA' ATTTAGATTT TTATGAGGAA TGAGTATCTG GAAATA TTGT AGCAATACTT 5700 

GGTTTAAAAT TTTGGACCTG AGACACTGTG. GCTGTCTAAT GTAATCCTTT AAAAATTCTC 5760 

TGCATTGTCA GTAAATGTAG TATATTATTG • TACAGCTACT CATAATTTTT TAAAGTTTAT 5820 
GAAGTTATAT TTATCAAATA AAAACTTTCC' TATAT 



60 



80 



Seq ID NO: 315 Protein sequences 
Protein Accession #: XP_087254 



65 1 - 11 . 21 31 41 51 

ilQPRECSING MKYQEINGRL VPEGPTPDSS EGNLSYLSSIi SHLNNLSHLT TSSSFRTSPB 60 

NETELIKSHD LFFKAVSLCH ■ TVQISNVQTD CTGDGPWQSN LAPSQLEYYA SSPDEKALVE 120 

AAARIGIVP1 GNSEETMBVK. TLGKLERYKL LHILEFDSDR RRMSVIVQAP SGBKLLFAKG 180 

70 AESSILPKCI GGEIEKTRIH VDEFALKGLR TLCIAYRKFT SKEYEEIDKR IFEARTALQQ 240 

REEKLAAVFQ FIEKDLILLG ATAVEDRLQD KVRETIEALR MAGIKVWVLT GDKHETAVSV 300 

SLSCGHFHRT MNILELINQK SDSECAEQLR QLARRITEDH VIQHGLWDG TSLSLALREH 360 

EKLFMEVCRN- CSAVIiCCRMA PLQKAKVIRL IKISPEKPIT LAVGDGANDV SMIQEAHVGI 420 

GIMGKEGRQA ARNSDYAIAR FKFLSKLLFV HGHFYYIRIA TLVQYFFYKN VCFITPQFLY 480 

75 QFYCLFSQQT LYDSVYLTLY NICFTSLPIL IYSLLEQHVD PHVLQNKPTI* YRDISKNRLL 540 

SIKTPLYWTI LGFSHAFIFF FGSYJoLIGKD TSbLGNGQMF GNMTFGTLVF TVMVITVTVK 600 

MALETHFWTW INHfcVTWGSI IFYFVFSLFY GGILWPFLGS QNMYFVFIQL LSSGSAWFAI 660 

- ILMWTCIiFL DIIKKVFDRH LHPTSTEKAQ LTETNAGIKC LDSMCCFPEG EAACASVGRM 720 
LERVIGRCSP THISRSWSAS DPFYTNDRSI LTLSTMDSST C 



Seq ID NO: 316 DNA sequence " 
Nucleic Acid^ Accession #: NM_004473 
Coding sequence : 661 . . 1791 
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CTCGCCAGCG GTCCGCGGGG CTGGAGACCC ACGCCGT6GA GAGGACCAGC CTCAGGTCGC ' 60 
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CCCGCCTGGG CCOGOGCCCC 6ACCTCGCTG CCCCCGCCTC GCCTCTCTGC CCGTGGCGCT 120 

■ TACCGCCACC TTGGCCTCGG. GGGCAGGGCA TGGGOGGCCC CCGCCAGATC GCCCAGCGCC 180 
AGTACTAACT GCCCTCGCTC TGGCCTTCGA GCCCGAAGCC TCTTCTGCGC GCACAACCTA 240 

■ GGCAGTAATC CTAAACTAGC GGGCACCACA GAGCAGCTGC AGCCACCCCA* ACCCAGGGAT 300 
CACTTCCGGA CCCCTCGACC - GCCCGGCACC AGCGCGCAAG GGACCCTTCA GCCGGAGACC 3 60 
AGAGTCCAGT CCOGGTOSCG AGGCCACCGC CGCTGCCCGC CTCGAGAAGC ACAACGCGGG 420 
CTGAGCCGTC GGCTAGCGGG TCACTCCOGA" GCCTCTGTCT GCACCGCGCC AGCCCCAGAC 480 
CACGGACGCT GAGCCTCCAG CGCGCGCCAG CCTGGGCCGC TGGGCTCTCC GGGCCAGCCC 540 
GCGACGATCC CCTGAGCTCT CCGCAGAAGG GCCGAGCGTC CGTTCCGGGG ' ACGCCAGGCC 600 
CGCCCCCGCC- CCCCGACAGC CGCGGGGATC CAGAGCCCGG GGGTGCGGGA OGCCCGCGCC 660 
ATGACTGCCG AGAGCGGGCC GCCGCCGCCG CAGCCGGAGG TGCTGGCTAC CGTGAAGGAA 720 

" GAGCGCGGCG AGAOGGCAGC AGGGGCCGGG GTCCCAGGGG, AGGCCACGGG. CCGCGGGGCG 780 

GGCGGGCGGC GCCGCAAGCG CCCCCTGCAG CGCGGGAAGC CGCCCTACAG CTACATCGCG 840 

CTCATCGCCA-- TGGCCATCGC GCACGCGCCC GAGCGCCGCC TCACGCTGGG: CGGCATCTAC - 900 

AAGTTCATCA- CCGAGCGCTT CCCCTTCTAC CGCGACAACC CCAAAAAGTG GCAGAACAGC 960- 

ATCCGCCACA ACCTCACACT CAACGACTGC TTCCTCAAGA TCCCGCGOGA GGCCGGCCGC 1020 

CCGGGTAAGG GCAACTACTG. GGCGCTCGAC. CCCAACGCGG AGGACATGTT CGAGAGCGGC 1080 

AGCTTCCTGC GCCGCCGCAA' GCGCTTCAAG " CGCTCGGACC TCTCCACCTA CCCGGCTTAC 1140 

ATGCACGACG 7 CGGOGGCTGC CGCAGCCGCC GCTGCCGCAG; CCGCCGCCGC CGCCGCCGCC 1200 

GCCGCCATCT TCCCAGGCGC GGTGCCCGCC GCGCGCCCCC CCTACCCGGG CGCCGTCTAT 1260 

GCAGGCTACG CGCCGCCGTC GCTGGCCGCG CCGCCtCCAG TCTACTACCC CGCGGCGTCG 1320 

CCCGGCCCTT GCCGCGTCTT CGGCCTGGTT CCTGAGCGGC CGCTCAGCCC AGAGCTGGGG 1380 

CCCGCACCGT CGGGGCCCGG CGGCTCTTGC GCCTTTGCCT- CCGCCGGCGC CCCCGCTACC 1440 

ACCACCGGCT. ACCAGCCCGC AGGCTGCACC GGGGCCCGGC CGGCCAACCC CTCTGC CTAT 1500 

GCGGCTGCCT ACGCGGGCCC CGACGGCGCG TACCCGCAGG GCGCCGGCAG TGCGATCTTT 1560 

GCCGCTGCTG, GCCGCCTGGC GGGACCCGCT TCGCCCCCAG CGGGCGGCAG CAGTGGCGGC 1620 

GTGGAGACCA- CGGTGGACTT CTACGGGCGC ACGTCGCCCG GCCAGTTCGG AGCGCTGGGA 1680 

GCCTGCTACA ACCCTGGCGG- GCAGCTGGGA GGGGCCAGTG CAGGCGCCTA CCATGCTCGC 1740 

CATGCTGCCG. CTTATCCCGG TGGGATAGAT CGGTTCGTGT CCGCCATGTG AGCCAGCGTA- 1800 

GGGACGAAAA CTCATAGACA CATCGGCTGT TCACACGTTC CCCGCAACCT. GAGAACGAAC 1860 

AGGAATGGAG AGAGGACTCA> ACTGGGACCC ACGTGGAAAA GACCGAGCAG GCCACAGAGG 1920 

CTCGGTCTCC CCGCGCACAG CGTAGGCACC CTGTGTACTC TGTAAACGGG AGGAGGTGGG 1980 

GCGAGGCAGC CAGAGCCCTT GGACTGGCAG AGGGACCCTC GATGGAGCGA AGCCCTCAAA 2040 

CGGGATGCTT. TCTGGCATTC TATCGGGGAG GGTCCTTGGC GGTAACCAGA GGGCAGCGTA 2100 

GTGTCAACAC CAGAGACCAG GATCCAAATT GTGGGGAATC AGTTTCAGCC TTCCATGTGC 2160 

TGCCGGAACT CGGGCCTTTT TACGCGGTTC GTCCTCTAGT GCCTTTAACT GCGTTACTAC 2220 

AATAAAAGGC TGCGGCAGCG CCTTTCTTCT T AAAGTGAGG . AGGACAAATT TGCAAAAGAA 2280 

ATAGGCTTTT CTTCTTTTTT AAATTGGAGA AATCTCTGCT CTGGTTGACC TGGGCTGGTT. 2340. 

TTCCCTGTCT CTGAGAACTT GAGACCTAGC TCCGAGTTGA ACTGTGCGTC AGCACTCCAG , 2400; 

TCCCATCACC TGAACCTTCA GTCTCCCCCA TCTGTTACAC TAGAGGGCTG CAGGACTCTA ' 2460 

TCCACCGCCC CCGGGTTATC ATTCAGGGCC CCATCATCTT GGATGCTGCC CTGCGTATTT 2520 

GGCAGCAATG GTGGGCCACC CAGGGCCTCT GAGTAGCCAC CCAAAGCCTA GCCGCTGTTC 2580 

TAGGGAACGG AAAAGAGTTC ATGGCCAAGC GTCTAACCTA AAGTCCCAGG, ArTGGCTCCA 2640 

GGCAGCAATT ATATCATAAC TTATTGAACT TTTGAGCAGG ACGTGCTGGT AATTTCATGG 2700 

CTGTTACTGC CCAGTCATAA ATCTGCTTTT CCATTATAAG GCAGAGAGAA GTACATTCGT 2760 

TCATTTGTCC ACTGTTTCTT GTCATCACGC AGCCCTGGAC CCAAAGGGTG AACTAAAGTT " 2820 

TAAGGAGATG AGAGGATTCA AGGAGCCCGT TGGTGACGCC TTTCAGTAGC TGGGGAGGGC 2880 

TCTTCCATCC CCAGCACCCC CTGCTACACC TCAGCAGCCT CCCCCATGCA AAAAGGAAAG 2940 

AGAAAAATTA AGTTAGGGCA GTCAGTAAAG TGAGCTTTAG/ AAAGAAACTG GAATTTTAAC 3000 

TTCATTTTGT ATCTTGCTTA- AGTAGCAGGC TCACTAAAAT TAGAGAAAGT CCAATAACTC 3060 

TCCCCCTTTC CCTTGAGAAA" TCTTTAAGTT- TCGATTCTGG AGCAAAAACT TTCAGCATTA 3120 

AATATTTCAG. AGGCTCCATT CACAGCTTTC AGATAAACTG GAGTGTTCAG ATGGACTGTT 3180 

TTAATAAAAA TCTTTGAGCA AGTGAGTTAT GGCAAGAGAA ACTCAGCCTC TTTCTGTATA - 3240 

AACTTAACAG GGAAGGGCTG GGGTGTGAAA AAGAAGATTG TATGAAAACC ATTGGTAATT 3300 

TTTATTTTTT ATTTTTGGGA CTGCACTATC CTGTTCACGA AGACATGTGA ACTTGGTTCA * 3360 

GTCCAAATGG GGATTTGTAT AAACCAGTGC TCTCCATTAG AAATATGGTG CAAGCCACAT 3420 

ATGTAATTTT AAATATTCTA GTAGCCACAT TAATAAAGTK AAAAGAAACA AAAAAAAAAA 3480 
AA 

Seq ID NO: 317 Protein sequence: 
Protein Accession #: NP_004464 
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FKHLTHYROI DTRANSCRIP TIONFACTOR TTFMTAESGP PPPQPEVLAT VKEERGETAA 60 

GAGVPGEATG RGAGGRRRKR PLQRGKPPYS YIALIAMAIA HAPERRLTLG GIYKFITERP 120 

PFYRDNPKKW QNSIRHNLTL NDCFLKIPRE AGRPGKGNYW ALDPNAEDMF BSGSFLRRRK 180 

RFKRSDLSTY PAYMHDAAAA- AAAAAAAAAA- AAAAAIFPGA VPAARPPYPG AVYAGYAPPS 240 

LAAPPPVYYP AASPGPCRVF GLVPERPbSP ELGPAPSGPG GSCAFASAGA PATTTGYQPA 300 

GCTGARPANP SAYAAAYAGP DGAYPOGAGS AIFAAAGRLA GPASPPAGGS SGGVETTVDF 360 
YGRTSPGQFG ALGACYNPGG QLGGASAGAY HARHAAAYPG GIDRFVSAM 

Seq ID NO: 31B DNA sequence 
Nucleic Acid Accession Hi NM_005688 
Coding sequence: 126.. 4439 ; 
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CCGGGCAGGT GGCTCATGCT CGGGAGCGTG GTTGAGCGGC TGGCGCGGTT GTCCTGGAGC 60 

AGGGGCGCAG GAATTCTGAT GTGAAACTAA CAGTCTGTGA GCCCTGGAAC CTCCGCTCAG 120 

AGAAGATGAA GGATATCGAC ATAGGAAAAG AGTATATCAT CCCCAGTCCT GGGTATAGAA 180 

GTGTGAGGGA GAGAACCAGC ACTTCTGGGA CGCACAGAGA CCGTGAAGAT TCCAAGTTCA 240 

GGAGAACTCG ACCGTTGGAA TGCCAAGATG CCTTGGAAAC AGCAGCCCGA GCCGAGGGCC 300 

TCTCTCTTGA TGCCTCCATG CATTCTCAGC TCAGAATCCT GGATGAGGAG CATCCCAAGG 360 

GAAAGTACCA TCATGGCTTG AGTGCTCTGA AGCCCATCCG GACTACTTCC AAACACCAGC 420 

ACCCAGTGGA CAATGCTGGG CTTTTTTCCT GTATGACTTT TTCGTGGCTT TCTTCTCTGG 480 
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CCOGTGTGGC CCACAAGAAG GGGGAGCTCT CAATGGAAGA CGTGTGGTCT CTGTCCAAGC 540 

ACGAGTCTTC TGACGTGAAC TGCAGAAGAC TAGAGAGACT GTGGCAAGAA GAGCTGAATG 60O 

AAGTTGGGCC AGACGCTGCT TCCCTGCGAA GGGTTGTGTG GATCTTCTGC CGCACCAGGC 660 

TCATCCTGTC CATCGTGTGC CTGATGATCA OGCAGCTGGC TGGCTTCAGT GGACCAGCCT 720 

TCATGGTGAA" ACACCTCTTG GAGTATACCC AGGCAACAGA GTCTAACCTG CAGTACAGCT 760 

TGTTGTTAGT GCTGGGCCTC CTCCTGACGG AAATCGTGCG GTCTTGGTCG CTTGCACTGA 840 

CTTGGGCATT GAATTACCGA ACCGGTGTCC GCTTGCGGGG GGCCATCCTA" ACCATGGCAT 900 

TTAAGAAGAT. CCTTAAGTTA AAGAACATTA AAGAGAAATC CCTGGGTGAG CTCATCAACA : .960 

TTTGCTCCAA CGATGGGCAG. AGAATGTTTG. AGGCAGCAGC CGTTGGCAGC CTGCTGGCTG 1020 
GAGGACCCGT. TGTTG CCATC . - TT AGGCATG A TTTATAATGT AATTATTCTG GGACCAACAG .- 1080 

GCTTCCTGGG ATCAGCTGTT. TTTATCCTCT TTTACCCAGC AATGATGTTT GCATCACGGC 1140 

TCACAGCATA TTTCAGGAGA AAATGCGTGG CCGCCACGGA TGAACGTGTC CAGAAGATGA"- 1200 

ATGAAGTTCT. TACTTACATT' AAATTTATCA' AAATGTATGC CTGGGTCAAA GCATTTTCTC 1260 

AGAGTGTTCA AAAAATCCGC GAGGAGGAGC GTCGGATATT. GGAAAAAGCC GGGTACTTCC 1320 

AGGGTATCAC TGTGGGTGTG GCTCCCATTG TGGTGGTGAT- TGCCAGCGTG GTGACCTTCT 1380 

CTGTTCATAT, GACCCTGGGC TTCGATCTGA CAGCAGCACA 1 GGCTTTCACA GTGGTGACAG: 1440 

TCTTCAATTC CATGACTTTT GCTTTGAAAG. TAACACCGTT TTCAGTAAAG TCCCTCTCAGv 1500 

AAGCCTCAGT GGCTGTTGAC AGATTTAAGA GTTTGTTTCT AATGGAAGAG GTTCACATGA* 1560 

TAAAGAACAA- ACCAGCCAGT CCTCACATCA AGATAGAGAT GAAAAATGCC ACCTTGGCAT 1620 

GGGACTCCTC CCACTCCAGT ATCCAGAACT: CGCCCAAGCT* GACCCCCAAA ATGAAAAAAGr 1680. 

ACAAGAGGGC TTCCAGGGGC AAGAAAGAGA AGGTGAGGCA GCTGCAGCGC ACTGAGCATC Z 1740 

AGGCGGTGCT* GGCAGAGCAG . AAAGGCCACC. TCCTCCTGGA CAGTGACGAG CGGCCCAGTC 1800 

CCGAAGAGGA AGAAGGCAAG CACATCCACC TGGGCCACCT- GCGCTTACAG AGGACACTGC 1860 

ACAGCATCGA TCTGGAGATC CAAGAGGGTA- AACTGGTTGG AATCTGCGGC AGTGTGGGAA ■ 1920 

GTGGAAAAAC CTCTCTCATT TCAGCGATTTV TAGGCCAGAT GACGCTTCTA GAGGGCAGCA • 1980 

TTGCAATCAG TGGAACCTTC GCTTATGTGC; CCCAGCAGGC CTGGATCCTC AATGCTACTC 7 2040 

TGAGAGACAA CATCCTGTTT GGGAAGGAAT ATGATGAAGA AAGATACAAC TCTGTGCTGA 2100 

ACAGCTGCTG CCTGAGGCCT GACCTGGCCA TTCTTCCCAG CAGCGACCTG ACGGAGATTG v 2160 

GAGAGCGAGG AGCCAACCTG. AGCGGTGGGC AGCGCCAGAG GATCAGCCTT GCCCGGGCCT 2220 

TGTATAGTGA CAGGAGCATC .TACATCCTGG ACGACCCCCT CAGTGCCTTA GATGCCCATG * 2280 

TGGGCAACCA' CATCTTCAAT AGTGCTATCC GGAAACATCT CAAGTCCAAG ACAGTTCTGT 2340 

TTGTTACCCA CCAGTTACAG TACCTGGTTG ACTGTGATGA AGTGATCTTC ATGAAAGAGG 2400 

GCTGTATTAC GGAAAGAGGC ACCCATGAGG'. AACTGATGAA TTTAAATGGT GACTATGCTA 2460 

CCATTTTTAA TAACCTGTTG. CTGGGAGAGA CACCGCCAGT TGAGATCAAT TCAAAAAAGG 2520 

AAACCAGTGG TTCACAGAAG . AAGTCACAAG ACAAGGGTCC TAAAACAGGA TCAGTAAAGA 2580 

AGGAAAAAGC AGTAAAGCCA GAGGAAGGGC AGCTTGTGCA GCTGGAAGAG AAAGGGCAGG 2640 

GTTCAGTGCC CTGGTCAGTA TATGGTGTCT ACATCCAGGC TGCTGGGGGC CCCTTGGCAT 2700 

rCCTGGTTAT- TATGGCCCTT TTCATGCTGA ATGTAGGCAG CACCGGCTTC AGCACCTGGT/ 2760 

GGTTGAGTTA CTGGATCAAG, CAAGGAAGOG GGAACACCAC TGTGACTCGA GGGAACGAGA- 2820 

CCTCGGTGAG TGACAGCATG AAGGACAATC CTCATATGCA GTACTATGCC AGCATCTACG . 2880 

CCCTCTCCAT- GGCAGTCATG. CTGATCCTGA AAGCCATTCG- AGGAGTTGTC TTTGTCAAGG - 2940 

GCACGCTGCG; AGCTTCCTCC CGGCTGCATG ACGAGCTTTT . CCGAAGGATC CTTCGAAGCC 3000 

CTATGAAGTT TTTTGACACG- ACCCCCACAG GGAGGATTCT CAACAGGTTT TCCAAAGACA 3060 

TGGATGAAGT TGACGTGCGG CTGCCGTTCC AGGCCGAGAT GTTCATCCAG AACGTTATCC 3120 

TGGTGTTCTT CTGTGTGGGA ATGATCGCAG GAGTCTTCCC GTGGTTCCTT GTGGCAGTGG. 3180 

GGCCCCTTGT CATCCTCTTT TCAGTCCTGC ACATTGTCTC CAGGGTCCTG ATTCGGGAGC 3240 

TGAAGCGTCT GGACAATATC ACGCAGTCAC CTTTCCTCTC CCACATCACG TCCAGCATAC 3300 

AGGGCCTTGC. CACCATCCAC GCCTACAATA AAGGGCAGGA GTTTCTGCAC AGATACCAGG 3360 

AGCTGCTGGA- TGACAACCAA : GCTCCTTTTT TTTTGTTTAC GTGTGCGATG CGGTGGCTGG 3420 

CTGTGCGGCT GGACCTCATCv AGCATCGCCC TCATCACCAC CACGGGGCTG ATGATCGTTC 3480 

TTATGCACGG GCAGATTCCC CCAGCCTATG CGGGTCTCGC CATCTCTTAT GCTGTCCAGT 3540 

T AAGGGGGCT - GTTCCAGTTT ACGGTCAGAC TGGCATCTGA GACAGAAGCT CGATTCACCT 3600 

CGGTGGAGAG GATCAATCAC TACATTAAGA CTCTGTCCTT GGAAGCACCT GCCAGAATTA 3660 

AGAACAAGGC TCCCTCCCCT GACTGGCCCC AGGAGGGAGA GGTGACCTTT GAGAACGCAG 3720 

AGATGAGGTA CCGAGAAAAC CTCCCTCTTG TCCTAAAGAA AGTATCCTTC ACGATCAAAC 3780 

CTAAAGAGAA GATTGGCATT GTGGGGCGGA' CAGGATCAGG GAAGTCCTCG CTGGGGATGG 3840 

CCCTCTTCCG TCTGGTGGAG TTATCTGGAG" GCTGCATCAA' GATTGATGGA GTGAGAATCA 3900 

GTGATATTGG CCTTGCCGAC CTCCGAAGCA AACTCTCTAT CATTCCTCAA GAGCCGGTGC 3960 

TGTTCAGTGG CACTGTCAGA TCAAATTTGG ACCCCTTCAA CCAGTACACT GAAGACCAGA 4020 

TTTGGGATGC CCTGGAGAGG ACACACATGA AAGAATGTAT TGCTCAGCTA CCTCTGAAAC 4080 

TTGAATCTGA AGTGATGGAG AATGGGGATA ACTTCTCAGT GGGGGAACGG CAGCTCTTGT 4140 

GCATAGCTAG AGCCCTGCTC CGCCACTGTA AGATTCTGAT TTTAGATGAA GCCACAGCTG 4200 

CCATGGACAC AGAGACAGAC TTATTGATTC AAGAGACCAT CCGAGAAGCA TTTGCAGACT 4260 

GTACCATGCT GACCATTGCC CATCGCCTGC ACACGGTTCT AGGCTCOGAT AGGATTATGG 4320 

TGCTGGCCCA GGGACAGGTG GTGGAGTTTG ACACCCCATC GGTCCTTCTG TCCAACGACA 4380 

GTTCCCGATT CTATGCCATG TTTGCTGCTG CAGAGAACAA GGTCGCTGTC AAGGGCTGAC 4440 

TCCTCCCTGT TGACGAAGTC TCTTTT CT TT AGAGCATTGC CATTCCCTGC CTGGGGCGGG 4500 

CCCCTCATCG CGTCCTCCTA. CCGAAACCTT GCCTTTCTCG ATTTTATCTT TCGCACAGCA 4560 

GTTCCGGATT GGCTTGTGTG TTTCACTTTT AGGGAGAGTC ATATTTTGAT TATTGTATTT 4620 

ATTCCATATT CATGTAAACA AAATTTAGTT ' TTTGTTCTTA ATTGCACTCT AAAAGGTTCA 4680 

GGGAACCGTT ATTATAATTG TATCAGAGGC CTATAATGAA GCTTTATACG TGTA GCTATA 4740 

TCTATATATA ATTCTGTACA TAGCCTATAT TTACAGTGAA AATGTAAGCT GTTTATTTTA 4800 

TATTAAAATA AGCACTGTGC TAATAACAGT GCATATTCCT TTCTATCATT TTTGTACAGT 4860 
TTGCTGTACT. AGAGATCTGG TTTTGCTATT AGACTGTAGG AAGAGTAGCA TTTCATTCTT '4920 

CTCTAGCTGG rGGTTTCACG GTGCCAGGTT TTCTGGGTGT CCAAAGGAAG ACGTGTGGCA 4980 

ATAGTGGGCC CTCCGACAGC CCCCTCTGCC GCCTCCCCAC AGCCGCTCCA GGGGTGGCTG 5040 

GAGACGGGTG GGCGGCTGGA GACCATGCAG AGCGCCGTGA GTTCTCAGGG CTCCTGCCTT 5100 

CTGTCCTGGT GTCACTTACT GTTTCTGTCA GGAGAGCAGC GGGGCGAAGC CCAGGCCCCT 5160. 

TTTCACTCCC TCCATCAAGA ATGGGGATCA CAGAGACATT CCTCCGAGCC GGGGAGTTTC 5220* 

TTTCCTGCCT TCTTCTTTTT GCTGTTGTTT CTAAACAAGA ATCAGTCTAT CCACAGAGAG 5280. 

TCCCACTGCC TCAGGTTCCT ATGGCTGGCC ACTGCACAGA GCTCTCCAGC TCCAAGACCT 5340 

GTTGGTTCCA AGCCCTGGAG CCAACTGCTG CTTTTTGAGG TGGCACTTTT TCATTTGCCT 5400 

ATTCCCACAC CTCCACAGTT CAGTGGCAGG GCTCAGGATT TCGTGGGTCT GTT TTCC TTT 5460 

CTCACCGCAG TCGTCGCACA . GTCTCTCTCT CTCTCTCCCC TCAAAGTCTG CAACTTTAAG 5520 

CAGCTCTTGC TAATCAGTGT CTCACACTGG CGTAGAAGTT TTTGTACTGT AAAGAGACCT 5580 

ACCTCAGGTT GCTGGTTGCT . GTGTGGTTTG GTGTGTTCCC GCAAACCCCC TTTGTGCTGT 5640 

GGGGCTGGTA GCTCAGGTGG GCGTGGTCAC TGCTGTCATC AGTTGAATGG TCAGCGTTGC 5700 
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ATGTCGTGAC CAACTAGACA JTTCTGTCGCC TTAGCATGTT TGCTGAACAC CITGTGGAAG .5760 
CAAAAATCTG AAAATGTGAA TAAAATTATT TTGGATTTTG TAAAAAAAAA AAAAAAAAAA 5B20 
AAAAAAAAAA AAAAAAAA 



PCT/US02/12476 



Seq ID NO: 319 Protein sequence: 
Protein Accession #: NPJ)05679 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55- 
60 
65 
70 
.75 
80 
85 



MKDIDIGKEY 

ldasmhsqlr- 
vahkkgelsm 
lsivclmitq; 
alnyrtgvrl- 

PWAILGMIY 
VLTYIKFIKM- 
HMTLGFDIiTA 
NKPASPHIKI; 
VLAEQKGHLL 
KTSLISAILG 
CCLRPDLAIL' 
NHIFNSAIRK: 
FNNLLLGETP 
VPWSVYGVYI 
VSDSMKDNPH 
KFFDTTPTGR 
LVILPSVLHI 
IiDDNQAPPFL 
GLFQFTVRLA 
RYRENLPLVI, 
IGliADLRSKL: 
SEVMENGDNF 
MLTIAHRLHT 



11 . - 
I 

IIPSPGYRSV. 
ILDEEHPKGK 
EDVWSLSKHE 
IiAGFSGPAFM 
RGAILTMAFK 
NVIILGPTGF 
YAWVKAFSQS^ 
AQAFTWTVF 
EMKNATLAWD- 
LDSDERPSPE- 
QMTIiIiEGSIA : . 
PSSDLTEIGE 
HUCSKTVLFV 
PVEINSKKET' 
QAAGGPLAFL 
MQYYASIYAL 
ILNRFSKDMD 
VSRVLZRELK 
FTCAMRWLAV 
SETEARFTSV; 
KKVSFTIKPK< 
SIIPOEPVLF" 
SVGERQLLCI" 
VLGSDRIMVI* 



21 
I 

RERTSTSGTH 
YHHGLSALKP 
SSDVNCRRLB: 
VKHLIiEYTQA- 
KILKLKNIKE 
W3SAVFILFY- 

vqkireeerr- 
nsmtfalkvt: 
sshssiqns? 

EEEGKHIHLG- 
ISGTFAYVAQ 
RGANLSGGQR- 
THQLQYLVDC 
SGSQKKSQDK- 
VIMALFMLNV 
SMAVMLILKA" 
EVDVRLPFQA 
RLDNITQSPF: 
RLDLISIALI'- 
ERINHYIKTL 
EKIGIVGRTG 

sgtvrsnldp: 

ARALLRHCK* 
AQGQWEFDT* 



31 
I 

RDREDSKFRR 
IRTTSKHQHP. 
RLWQEEIiNEV'- 
TESNLQYSLL 
KSLGELINIC 
PAMMFASRLT 
ILEKAGYFQG 
PFSVKSLSEA' 
KLTPKMKKDK 
HLRLQRTLHS 
QAWIUIATIiR* 
QRI SliARALY 
DEVIFMKEGC 
GPKTGSVKKE 
GSTAFSTWWL 
IRGWFVKGT 
EMFIQNVILV 
LSHITSSIQG 
TTTGLMIVLM 
SLSAPARIKN 
SGKSSLGMAL 
FNQYTEDQIW 
LILDEATAAM 
PSVLLSNDSS 



41 
I 

TRPIiECQDAI* 
VDNAGLFSCM 
GPDAASLRRV' 
LVLGLLLTEI 



AYFRRKCVAA- 
ITVGVAPIW 
SVAVDRFKSIi 
RASRGKKEKV 
IDLEIQEGKL 
DNIIiFGKEYD 
SDRSIYILDD 
ITERGTHEEL 
KAVKPEEGQL 
SYWIKQGSGN 
LRASSRLHDE 
FFCVGMIAGV: 
LATIHAYNKG 
HGQIPPAYAG 
KAPSPDWPQE 
FRLVELSGGC 
DAliERTHMKE 
DTETDLLIQE 
RFYAMFAAAE 



51 
I 

ETAARAEGliS 

TFSWLSSLAR- 

VWIFCRTRLI 

VRSWSLALTW 

AAVGSLLAGG 

TDERVQKMNE- 

VIASWTFSV:: 

FLMEEVHMIK- 

RQLQRTEHQA- 

VGICGSVGSG- 

EERYNSVLNS' 

PLSALDAHVG 

MNLNGDYATI- 

VQLEEKGQGS 

TTVTRGNETS; 

LFRRIIiRSPM* 

FPWFLVAVGP ■ 

QEFLHRYQEL: 

IiAISYAVQLT 

GEVTFENAEM* 

IKIDGVRISD" 

CIAQLPLKLE 

TIREAFADCT 

NKVAVKG 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
. 660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



Seq ID NO: 320 DNA sequence 

Nucleic Acid Accession #: AK022089.1 

Coding sequence: 181-1488 



1 

I 

AGCAGTTGCA 
ACATCCGAGG- 
CCCCACCCCC. 
ATGGCTCCCT 

gacatggatt 
tgtgggctga" 
gaggctacgt; 
gagaagtgga 
aaagcgtggg 

CTTCCAGTTC 
TGGGAGCTCA- 
GTCAGGAAAA" 
GATAATATGG' 
GTCAAGAGAA 
GATCGAGTAG: 
AGTGAAGTTG 
AGCGAAAGTG: 
GATAAGCTCT 
GATGCGGAAG 
• GATTTGGAGA 
AAAGAGATTA 
GCCAAGGAAT 
AGAGCGAAGG 
GTATTTAGCA 
CAGGACTCCG 
CTTTCTGACC 
ATATAACACT 
GAAATGTGTT 



11 
I 

CAACTTCCAG 
AGAAGAGAAA 
ACCCTCAGAT 
TTGGAAGAAA 
CAGAAGAGAA 
CTAAACGCAC 
TTGGAGAGAA 
GAGGCTCCGA 
GAGATGAGCA 
CTTTGTGGCG 
GCCCAGCAAA 
CTTTCCGGAA 
AGACATTAGT 
TGAAAGAGCT 
AAAATGATGG 
AGCAAAATCT 
ATGGAATTGA 
CTGCTGAAAT. 
GGGAAGCTGC 
AAAGCATGAA 
AATACAGTGA 
TCAATTCACT 
AATCTGAGGT 
ATTACACAAA 
AAACAACAGT 
TGCTTTCATG 
CAAAAAAATG 
GTTTCTG 



21 
I 

CAACTTTCTC 
GCTTCCAGCC 
CACTTTAAGA 
CTTGCTAAAG' 
GGAAATTGTG 
CACCTCTGCT, 
ACGATTTCTT 
AAGGGTTCTT' 
GCCCAATATG 
GACAGCTGAA 
CTACATGAAG 
ACTGGCTAAA 
TCATCTGATC ■ 
GGATCTGGAA 
AGAAAACTAT 
AGACTTGCAG" 
ACAGCTGGAA 
AGAAAAAGAG 
AAGTGAACTG 
AGCTGGTTTG 
CTCATTGCTT- 
TCACATTAGC 
TCCCAGTAGC 
TGACACAGAC 
AGGAGATGTG 
TTTTAATGTT 
TAAATCATAT 



31 
I 

AGCCGGCTAC 
CTCCTCCCTT 
TAATTTCTTT 
ACTCGGCATA 
GTTTGGGTTT 
GATGTCATCC 
CTGGGGAAGC 
CCTGCACTAA 
CAATTTGTTT 
GCCAAATTAG. 
ACTTTACCAC 
ATTAAGCAGG 
ATTTCCCAGG 
ATTGAAAAGT 
GTTCAGGATG 
TATGAGGAAA 
GAACGACTGA 
GTAAAAAGTG 
GAAAGCTCTA 
AAAATTCACT 
CAGATGAAAG 
AACAAAGATG 
AATGGGGAGA 
TCGGACACTG 
GTGCTGTTGT 
TGTTTAATTT 
TGTAGTATTC 



Seq ID NO: 321 Protein sequence: 
Protein Accession #: NP_005438,1 



I 

MAPFGRNLLK 
EATFGEKRFL 
LPVPLWRTAE 
DNMETLVHLI 
SEVEQNLDLQ 
DAEGEAASEL 
AKEFNSLHIS 
QDSETTVGDV 



11 
I 

TRHKNRSPTK 
LGKPSDYCII 
AKLVQNTEKL 
ISQDHTIHQQ 
YEENQTLEDL 
ESSNLESVKC 
NKDGCQLKEN 
VLLST 



21 i • 
I 

DMDSEEKEIV 
EKWRGSERVL 
WELSPANYMK 
VKRMKELDLE 
SESDGIEQLE 
DLEKSMKAGL 
RAKES EVP SS 



31 
I 

VWVCQEEKLV 
PPLTRILKLW 
TLPPDKQKRI 
IEKCEAKFHL 
ERLKYYRILI 
KIHSHLSGIQ 
NGEIPPFTQR 



41 ■ 
I 

TAATGAGCTG 
CACCCTGGAA 
ATTCGTTTGC 
AAAACAGATC 
GCCAAGAAGA 
AGGCTTTGCT 
CCAGTGATTA 
CTAGAATCCT 
TGGTTAAAGC 
TGCAAAACAC. 
CAGATAAACA 
ACACAGTTTC 
ACCATACTAT 
GTGAAGCTAA 
CATATTTAAT 
ACCAGACTCT 
AATATTACCG 
TTTGCATTGA 
ATTTAGAGAG 
CTCATTTGAG 
CAAAAGAATA 
GGTGCCAGTT 
TTCCTCCCTT 
GTATCAGTTC 
CAACATAGTT 
AATAGGAAAC 
AATAGTTAAT 



41 
I 

CGLTKRTTSA 
KAWGDEQPNM 
VRKTFRKIiAK 
DRVENDGENY 
DKLSAEIEKE 
KEIKYSDSLL 
VFSNYTNDTD 



51 
I 

AAAGCCAGGA 

ATCCAGACAC 

CCGACAGACC 

TCCAACTAAA 

GAAGCTTGTC 

TGAGGAACAT 

CTGCATCATA 

GAAGCTTTGG 

AGATGCTTTT 

AGAAAAATTG 

AAAAAGAATA- 

TCATGATCGA- 

TCATCAGCAA-. 

GTTCCATCTT- 

GCCCAGTTTC' 

GGAGGACCTG. 

AATACTCATT. 

TATAAATGAA^ 

TGTTAAGTGT 

TGGCATCCAG 

TGAACTCCTG 

AAAGGAAAAC 

TACTCAAAGA 

TAACCACAGT 

CCAATGGCTC 

CTCATTTTAA 

AAAAACTCGA 



51 
I 

DVIQALLEEH 
QFVLVKADAF 
IKQDTVSHDR 
VQDAYltMPSF 
VKSVCIDINE 
QMKAKEYELL 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
13B0 
1440 
1500 
1560 
1620 



60 
120 
180 
240 
300 
360 
420 



Seq ID NO: 322 DNA sequence 

Nucleic Acid Accession 8: NM_030920.1 
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. Coding sequence: 017-1123*1 



AAGAGGAGGA 900 

GAGAAGAGGA 960 

ATGATGACTA 1020 

GGGAGAAGAG 1080' 



1 11 21 31 41 51 

I I I I.I I 

AGCATTGAAG GGGAAGGAAC TGCGGGTGTG GTGTGTGTAT GTGTGTGTGT ATGTGTGTGC. 60 

GGCGCGTGCG TGCGTGTGTG . TGCGCGCGCT AGTGTGTGGA CAAGGAGGTG . GGGGCAGCTG 120 

AGTTAGAGTC CCAACTCTTG GACTCCATTT GCTATTCTCT TCTTTCTCCC CCACACCTAT 180 

CTGGTGGTGG TAGTGGGCGT* TTATATTTGC GTTCCTTTTC ATTCATTTCT AAATCTCTTA 240 

AAAATTTTGG- GTTGGGGGTA*- TTGGGGAAGG- CAGGAAAGGG AAAAGGAGAG TAGTAGCTGA 300 

AGAGCAAGAG : GAGGACATGG AGATGAAGAA GAAGATTAAC CTGGAGTTAA GGAACAGATC 360 

CCCGGAGGAG GTGACAGAGT TAGTCCTTGA' TAATTGCCTG TGTGTCAATG GGGAAATTGA, 420 

AGGCCTGAAT GATACTTTCA- AAGAACTAGA ATTTCTGAGT ATGGCTAATG . TGGAACTAAGV 480 

TTCGCTGGCC CGGCTTCCCA GCTTAAATAA ACTTCGAAAA TTGGAGCTTA GTGATAATAT 540 

AATTTCTGGA GGCTTGGAAG TCCTGGCAGA GAAATGTCCA AATCTTACCT ACCTCAATCT 600 

GAGTGGAAAC AAAATAAAAG ATCTCAGTAC AGTAGAAGCT CTGCAAAATC TTAAAAATTT 660 

GAAAAGTCTT GACCTGTTTA ACTGTGAGAT CACAAACCTG GAAGATTATA GAGAAAGTAT 720 

XTTTGAACTA- CTGCAGCAAA. TCACATACTT - AGATGGATTT GATCAGGAGG- ATAATGAAGC " 780 

GCCGGACTCT- GAAGAGGAGG ATGATGAGGA- TGGAGATGAA GATGATGAAG. AGGAAGAGGA 840 
AAATG AAGCT- GGTCCACCGG ". AAGGATATGA' GGAAGAGGAG GAGGAAGAGG 
TGAGGATGAG GATGAAGATG- AAGATGAAGC' AGGTTCAGAG TTGGGAGAGG 
AGTGGGCCTC TCATACTTAA TGAAAGAAGA AATTCAGGAT GAAGAAGATG 
TGTTGAAGAA GGGGAAGAAG AGGAAGAAGA- GGAAGAAGGA GGTCTTCGAG 

GAAACGAGAT GCTGAAGACG ATGGAGAGGA "AGAAGATGAC . TAGATCATTC TAAGACCAGA 1140 

TTCTCTAATG TTTCTGGGTG TGCAATAGAG TGATCACATC TTTGTTTCTT CATGTACGAT 1200 

AGCTATCCCT ACAGAAGATA' ATGTGTAACT TTTTATAGGA AAAGTGTGGT TTTACTATTT 1260 
TTGCCTTATC ATTCCAAATA" AGAACTAGTC: TGTTAATGAT- CATATTGTAT.. GTAGAGAAAA 1320 

ATTTTCATTG ACTCCCATTG TGGAATTCCC TAGCAATTTA - TTTAGACTTA ATTTTTTAAA 1380 

TTCAAGCTTA CTGTATTAGT' CATTTTTAGC CCATAATTAA AACATGATCA^ CTTTTAAACA 1440 

GGTGTAGTAT- GGTGCATTTC ATTCCTTATT. TAT AGATTAA , CT GAAATT AC AGTTTGCTAT 1500 

AATATAAAAT GACAATAGTC TCTTGAGTGG .TAAGTTGGTT ' ATTTTTTTAG AGGTGATCCA 1560 

GGAATCTTTA GTTTGAAGGC AGTTACCTTT TTTTTTTTTT- TTTTTTTTTG ACTAAGAGTG 1620 

TTTGGTTGCT- TTTTTGTCAC AAGTAACTTG GAAAATAGAA GCAGAATAGT AAAGGTTCTA 1680 

TTCAGCAACA- TAGTTCATGG ATTTTGTGGA GGTTCTATTC AGTAATATGG TTCATGGATT 1740 

TAGTGGTGAC TGATAAGATT TTATTTTTGA AGGAAAAATT . GCTTATACTA AGTCCAGAGA 1BO0 

CATGCAGGTG AGCCCTTTTG TCAGGCTGCA AATCATGACA' TGCCGATGGT TGTTTATTTT 1860 

GTTTTTAGGT- GTGCATTCTT TTTCTTCTTA GCAATTCCTT TATGATCACC TTCCCTTCTT 1920 

GTTTCACTCC CTCCCGCTCT CTCAAAAGGA ACTTGGGAAA CTTGTGAAAC CCAGGAAAAC 1980 

CTTTAGTCTT ATACCTCAAC TACGTTTCAG' TCCTGTCTGG GTTTTAAATA AGTGAAGTAG 2040 

AAGAAATTGA GTATTTTCTG ACATAAGAAT ATATTATCAA TACAGTTTTA TGCAGTAAGC 2100 

TCTCCTTACC ATAAATGTTT CTTGGTTGAC AACATCTAAG ACAATATTAG. TGGGATGAAG 2160 

AAAGAAAAGC AGGGGTGGTT. TTGGAAGCAG TGTTAGTGTT CCTCAAAAGT CGGAACAATT 2220 

GCCTGTTGAT ATATTAATAA* GACATTAAAG TCAAATTTTA ATGTTGGCCT CTCAAATGAT 2280 

TTGGATACCA CTCTGCAAAG' TATTTCTAAC CTTTAATTCC CAGTTTTAAA ACAGATATAA 2340 

TAATAGCATT TAATTGGAAT ATACTAGGCA- GCTGGAAAAG . TATTTGAAAC TAAATTGACA 2400' 

TTAAAATTAA GATTTGTTTT CAAGTGGATG : TCCATTAAAA GTAGAAAAAT: ATTTGGGATA" 2460 

AGTGAGTGTG TGTTTCCTTA- CATGGCTACT AAATAAAATA TAATGAGTAT-- ACAAGTATAT" 2520 

CTCCTCTTTT GCTATGGAGG CTCCATGTTC AAGGCAATGG-- CTTTTTAAAT CTTGGCTATC 2580 

TAAAATTTTT' TCCCTTTGTT- TTGAATATTT GTAAGTTTTT AAGAAGTTAG TGTCAGCAAA 2640 

TTAATTGAAG TTATGCTTCT ATACTGGGAC ATATTTAAAT ACTGAGTATA GTACTGCTGC 2700 

TACTGCTTCT- ACAATGTAAA ATGTATGACT TGGTGTTTTA AAGTAAAAAT TATGATGTTA 2760 

CTTGTGGAGA* AACTAAAAAT GTTGTACAAC TGACCGAAAG AAAACCCTTG GGGATAAGTT 2820 

TAGTGAGGGG ATTGGAATCC CCAAAAAGAT; AACATTTTTC TTCTGCTTTT- AAAAACTGAA 2880 

ATTCCCTGTT- CTAGTTCCTA- ACAATTCTCA TTACATACTA TGCCAGATTA CAAAATACTT 2 940 

ATTTTTAAAA TGAAATCTAT ATATTGACTT TCTTATCAAT CATCTTACTG TGCAATCAAA 3000 

ATTAGAGTAC TTTGGTTTGA- AAACAACACT TAGAGCCTCC AGATAACTTT. TAAGACTTAT 3060 

TTAGCTTTGT GGGTGGTATT TTCATGCAAA" TAAGTAAGGG TGGGTTTTAT ATTTTGTAGA 3120 

AGTTTTCGGT. CCTATTTTAA TGCTCTTTGT ATGGCAGTAT GTATATATTG TGTTAAGTTC 31B0 

CTCAAGAATC TCCTTAAAAA CTTTGAAGTT AATACTTTTG TGCAACTGTG TTTTGAATAA 3240 
AGCCATGACA . GTGTTAAAAA CAAAC 

Seq ID NO: 323 Protein sequence: 
Protein Accession #: NP_1 12 182.1 ■ 

1 11 21 31 41 51 

MEMKKiaNLE LRNRSPEEVT ELVLDNCLCV NGBIEGLNDT FKELEPLSMA NVELSSLARL 60 

PSLNKLRKLE LSDNIISGGL EVLAEKCPNL TYLNLSGNKI KDLSTVEALQ NLKNLKSLDL 120 

PNCEITNLED YRESIFELLQ QITYLDGFDQ" EDNEAPDSEB EDDEDGDEDD EEEEENEAGP 180 

PEGYEEEEEE EEEEDEDBDE DEDEAGSELG ' EGEEEVGLSY LMKEEIQDEE DDDDYVEEGE 240 
EEEEEEEGGL RGEKRKRDAE DDGEEEDD 

Seq ID NO: 324 DNA sequence 
Nucleic Acid Accession fc: NM_003812 
Coding sequence: 224.. 2722 

1 . 11 21 31 41 51 

TCCTCTGCGT CCCGCCCCGG GAGTGGCTGC GAGGCTAGGC GAGCCGGGAA AGGGGGCGCC 60 

GCCCAGCCCC GAGCCCCGCG CCCCGTGCCC CGAGCCCGGA GCCCCCTGCC CGCGGCGGCA 120 

CCATGCGCGC CGAGCCGGCG TGACCGGCTC CGCCCGCGGC CGCCCCGCAG CTAGCCCGGC 180 

GCTCTCGCCG GCCACACGGA GCGGCGCCCG GGAGCTATGA GCCATGAAGC CGCCCGGCAG 240 

CAGCTCGCGG CAGCCGCCCC TGGCGGGCTG CAGCCTTGCC. GGCGCTTCCT GCGGCCCCCA 300 

ACGCGGCCCC GCCGGCTCGG TGCCTGCCAG CGCCCCGGCC CGCACGCCGC CCTGCCGCCT 360 

GCTTCTCGTC CTTCTCCTGC TGCCTCCGCT CGCCGCCTCG TCCCGGCCCC GCGCCTGGGG 420 

GGCTGCTGCG CCCAGCGCTC CGCATTGGAA TGAAACTGCA GAAAAAAATT TGGGAGTCCT 480 

GGCAGATGAA GACAATACAT TGCAACAGAA TAGCAGCAGT AATATCAGTT ACAGCAATGC 540 

AATGCAGAAA GAAATCACAC TGCCTTCAAG ACTCATATAT TACATCAACC AAGACTCGGA 600 



307 
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AAGCCCTTAT CACGTTCTTG . ACACAAAGGC AAGACACCAG CAAAAACATA ATAAGGCTGT 660. 
CCATCTGGCC CAGGCAAGCT TCCAGATTGA AGCCTTCGGC TCCAAATTCA TTCTTGACCT 720 
CATACTGAAC AATGGTTTGT TGTCTTCTGA TTATGTGGAG ATTCACTACG AAAATGGGAA 780 
ACCACAGTAC TCTAAGGGTG GAGAGCACTG TTACTACCAT GGAAGCATCA GAGGCGTCAA 840 
5 ' AGACTCCAAG GTGGCTCTGT CAACCTGCAA TGGACTTCAT GGCATGTTTG AAGATGATAC 900. 
CTTCGTGTAT ATGATAGAGC CACTAGAGCT GGTTCATGAT GAGAAAAGCA CAGGTCGACC 960 

• ■ ACATATAATC CAGAAAACCT. TGGCAGGACA GTATTCTAAG CAAATGAAGA ATCTCACTAT 1020 

• GGAAAGAGGT GACCAGTGGC " CCTTTCTCTC TGAATTACAG TGGTTGAAAA. GAAGGAAGAG. 1080- 

AGCAGTGAAT CCATCACGTG GTATATTTGA AGAAATGAAA TATTTGGAAC TTATGATTGT 1140- 

10 TAATGATCAC AAAACGTATA AGAAGCATCG CTCTTCTCAT. GCACATACCA ACAACTTTGC 1200 

AAAGTCCGTG " GTCAACCTTG . TGGATTCT^ CAGCTCAACA. CCAGGGTTGT 1260 

CCTGGTGGCT. GTAGAGACCT GGACTGAGAA GGATCAGATT GACATCACCA CCAACCCTGT 1320 

GCAGATGCTC CATGAGTTCT. CAAAATACCG GCAGCGCATT AAGCAGCATG CTGATGCTGT 1380 

GCACCTCATC TCGCGGGTGA; CATTTCACTA TAAGAGAAGC AGTCTGAGTT ACTTTGGAGG 1440: 

15 ' TGTCTGTTCT,. CGCACAAGAG GAGTTGGTGT GAATGAGTAT GGTCTTCCAA. TGGCAGTGGC " 1500 

ACAAGTATTA TCGCAGAGCC TGGCTCAAAA CCTTGGAATC CAATGGGAAC CTTCTAGCAG 1560 

AAAGCCAAAA TCTGACTGCA CAGAATCCTG GGGTGGCTGC ATCATGGAGG- AAACA GGGGT ■ 1620 

GTCCCATTCT CGAAAATTTT CAAAGTGCAG CATTTTGGAG TATAG AGACT TTTTACAGAG 1680 

AGGAGGTGGA GCCTGCCTTT . TCAACAGGCC AACAAAGCTA TTTGAGCCCA' CGGAATGTGG 1740 

• 20 AAATGGAT AC GTGGAAGCTG GGGAGGAGTG TGATTGTGGT TTTCATGTGG AATGCTATGG 1800 

ATTATGCTGT AAGAAATGTT CCCTCTCCAA CGGGGCTCAC TGCAGCGACG GGCCCTGCTG 1860 

TAACAATACC TCATGTCTTT TTCAGCCACG AGGGTATGAA TGCCGGGATG CTGTGAACGA 1920 

GTGTGATATT" ACTGAATATT "GTACTGGAGA CTCTGGTCAG TGCCCACCAA ATCTTCATAA 1980 

GCAAGACGGA TATGCATGCA ATCAAAATCA GGGCCGCTGC TACAATGGCG AGTGCAAGAC 2040 

25 CAGAGACAAC' CAGTGTCAGT ACATCTGGGG AACAAAGGCT GCAGGGTCTG ACAAGTTCTG 2100 ' 

CTATGAAAAG CTGAATACAG AAGGCACTGA GAAGGGAAAC TGCGGGAAGG ATGGAGACCG. 2160 

• GTGGATTCAG TGCAGCAAAC ATGATGTGTT CTGTGGATTC TTACTCTGTA" CCAATCTTAC " 2220 

TCGAGCTCCA CGTATTGGTC AACTTCAGGG TGAGATCATT CCAACTTCCT TCTACCATCA 2280 

AGGCOGGGTG ATTGACTGCA GTGGTGCCCA TGTAGTTTTA GATGATGATA CGGATGTGGG 2340 

30 CTATGTAGAA GATGGAACGC .CATGTGGCCC GTCTATGATG TGTTTAGATC GGAAGTGCCT 2400 

ACAAATTCAA GCCCTAAATA TGAGCAGCTG TCCACTCGAT TCCAAGGGTA AAGTCTGTTC 2460 

GGGCCATGGG: GTGTGTAGTA. ATGAAGCCAC CTGCATTTGT GATTTCACCT GGGCAGGGAC 2520 

AGATTGCAGT ATCCGGGATC CAGTTAGGAA CCTTCACCCC CCCAAGGATG AAGGACCCAA 2580 

GGGTCCTAGT GCCACCAATC TCATAATAGG - CTCCATCGCT GGTGCCATCC TGGTAGCAGC 2640 

35 TATTGTCCTT GGGGGCACAG GCTGGGGATT TAAAAATGTC AAGAAGAGAA GGTTCGATCC 2700 

TACTCAGCAA GGCCCCATCT GAATCAGCTG CGCTGGATGG' ACACCGCCTT GCACTGTTGG 2760 

ATTCTGGGTA TGACATACTC. GCAGCAGTGT TACTGGAACT ATTAAGTTTG TAAACAAAAC 2820 

CTTTGGGTGG. TAATGACTAC GGAGCTAAAG TTGGGGTGAC AAGGATGGGG TAAAAGAAAA 2880 

CTGTCTCTTT TGGAAATAAT GTCAAAGAAC ACCTTTCACC ACCTGTCAGT AAACGGGGGA 2940 

40 GGGGGCAAAA GACCATGCTA TAAAAAGAAC TGTTCCAGAA TCTTTTTTTT TCCCTAATGG 3000 
ACGAAGGAAC AACACACACA CAAAAATTAA ATGCAATAAA GGAATCATTA AAAA 
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Seq ID NO: 325 Protein sequence: 
Protein Accession 8: NP_003803 



I 11 . 21 31 41 51 

II I I I I 

MKPPGSSSRQ PPLAGCSLAG ASCGPQRGPA GSVPASAPAR TPPCRLLLVL LLLPPLAASS 60 

RPRAWGAAAP SAPHWNETAB KNLGVLADED NTW3QNSSSN ■ ISYSNAMQKE; ITLPSRtlYY 120 

50 INQDSESPYH' VLDTKARHQQ KHNKAVHLAQ. ASFQIEAFGS KFILDLILNN GLLSSDYVEI 180 

HYENGKPQVS KGGEHCYYHG SIRGVKDSKV ALSTCNGLHG MFEDDTPVYM IEPLELVHDE 240 

KSTGRPHIIQ KTLAGQYSKQ MKNLTMERGD QWPFLSELQW. LKRRKRAVNP SRGIFEEMKY 300 

LELMIVNDHK. TYKKHRSSHA HTNNFAKSW- NLVDSIYKEQ LNTRWLVAV ETWTEKDQID 360 

ITTNPVQMLH- EPSKYRQRIK QHADAVHLIS RVTFHYKRSS LSYFGGVCSR TRGVGVNEYG 420 

55 LPMAVAQVLS QSLAQNLGZQ. WEPSSRKPKC DCTESWGGCI MEETGVSHSR KFSKCSILEY 480 

RDPLQRGGGA 1 CLPNRPTKLF EPTECGNGYV EAGEECDCGP HVECYGLCCK KCSLSNGAHC 540 

SDGPCCNNTS CLFQPRGYEC RD AVNE CD IT EYCTGDSGQC PPNLHKQDGY ACNQNQGRCY 600 

NGECKTRDNQ CQYIWGTKAA GSDKFCYEKL NTEGTEKGNC GKDGDRWIQC SKHDVFCGFI* 660 

LCTNLTRAPR IGQLQGEIIP TSFYHQGRVI DCSGAHWLD DDTDVGYVED GTPCGPSMMC 720 

60 LDRKCLQIQA LNMSSCPLDS KGKVCSGHGV CSNEATCICD FTWAGTDCSI RDPVRNLHPP 780 
KDEGPKGPSA TNLI IGSIAG AILVAAIVIX3 GTGWGFKNVK KRRFDPTQQG PI 

Seq ID NO: 326 DNA sequence 
Nucleic Acid Accession 8: AK074418.1 
65 Coding sequence: 244-1515 

1 11 '. 21 J 31 41 51 ' .: 

r ' ' |. " .. :| I I -I 

CTTTCTCCAA GACGGCCGGC CATGCTCTCC TCCTCTGCCA GTCTCCXCCA CCACTCTCTA 60 

70 ACCTGAGAGC CTGTGGAACC TGCCCGTCTC CCCTCCTCCA TCAGACACAC CTGCCTAGGA 120 

AACAGATGGA AAAAGTGAGG GACCGGTGAG TGACTTGCTG CTAAAGTTTA TACCAGATGC 180 

AAATGACAGA GCTGGAGTTC TGCTGTGCCT GGAAAGGACC TCGGAAGTCT TCTAAGGAGA 240 

GTCATGGCGT ATTACCAGGA GCCTTCAGTG GAGACCTCCA TCATCAAGTT CAAAGACCAG 300 

GACTTTACCA CCTTGCGGGA TCACTGCCTG AGCATGGGCC GGACGTTTAA GGATGAGACA 360 

75 TTCCCCGCAG CAGATTCTTC CATAGGCCAG AAGCTGCTCC AGGAAAAACG CCTCTCCAAT 420 

GTGATATGGA AGCGGCCACA GGATCTACCA GGGGGTCCTC CTCACTTCAT CCTGGATGAT 480 

ATAAGCAGAT TTGACATCCA ACAAGGAGGC GCAGCTGACT GCTGGTTCCT GGC AGCA CTG . 540 

GGATCCTTGA CTCAGAACCC ACAGTACAGG CAGAAGATCC TGATGGTCCA AAGCTTTTCA 600 

* ' " CACCAGTATG CTGGCATTTT CGGTTTCOGG TTCTGGCAAT GTGGCCAGTG GGTGGAAGTG 660 

80 GTGATTGATG ' ACCGCCTACC TGTCCAGGGA GATAAATGCC TCTTTGTGCG TCCTCGCCAC 720 

CAAAACCAAG . AGTTCTGGCC CTGCCTGCTG GAGAAGGCCT ATGCCAAGCT GCTCGGATCC 780 

TATTCCGATC TGCACTATGG CTTCCTCGAG GATGCCCTGG TGGACCTCAC AGGAGGCGTG 840 

ATCACCAACA TCCATCTGCA CTCTTCCCCT GTGGACCTGG TGAAGGCAGT GAAGACAGCG. 900 

ACCAAGGCAG GCTCCCTGAT AACCTGTGCC ACTCCAAGTG GGCCAACAGA TACAGCACAG 960 

85 GCGATGGAGA ' ATGGGCTGGT GAGTCTCCAT GCCTACACTG TGACTGGGGC TGAGCAGATT 1020 

' CAATACCGAA GGGGCTGGGA AGAAATTATC TCCCTGTGGA ACCCCTGGGG CTGGGGCGAG 1080 

ACOGAATGGA GAGGGOGCTG ' GAGTGATGGG TCTCAGGAGT GGGAGGAAAC CTGTGATCCG. 1140 
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WO 02/086443: .. . . . 

CGGAAAAGCC AGCTACATAA GAAACGGGAA GATGGCGAGT 7TTGGATGTC GTGTCAAGAT 1200 

. TTCCAACAGA AATTCATCGC CATGTTTATA TGTAGCGAAA TTCCAATTAC CCTGGACCAT 1260. 

GGAAACACAC TCCACGAAGG ATGGTCCCAA. ATAATGTTTA GGAAGCAAGT GATTCTAGGA 1320 

AACACTGCAG GAGGACCTCG " GAATGATGCT CAATTCAACT TCTCTGTGCA AGAGCCAATG 1360 

GAAGGCACCA ATGTTG TCGT. . GTGCGTCACA GTTGCTGTCA CACCATCAAA TTTGAAAGCA 1440 

GAAGATGCAA AATTTCCACT CGATTTCCAA GTGATTCTGG CTGGCTCACA GAAACACTGT 1500 

CCAAAGCTCA- AATAATAAAT TCCGCCGCAA CTTCACCATG ACTTACCATC TGAGCCCTGG , 1560 

GAACTATGTT GTGGTTGCAC AGACACGGAG AAAATCAGCG GAGTTCTTGC TCCGAATCTT 1620 
CCTGAAAATG CCAGACAGTG ACAGGCACCT GAGCAGCCAT- TTCAACCTCA GAATGAAGGG • 1680^ 

AAGCCCTTCA GAACATGGCT CCCAACAAAG CATTTTCAAC AGATATGCTC AGCAGGTATG 1740 

GTACCTAGCA* CCCAGGGGCC. TTACGTGGGA TTGGAGAAAG GGGACCTGAG GGAGGGACAG 1800 

CCCTCACAGG CCCTTACTGG : GATGCAGAGA GGAGAAGTGA CTTGATGGAC . TATTTTACCT I860 

GCCTCTCTTC CTGGATCGTC TCCAGAACTG CTGTGGCTGC CAAGCTCGGT AGAGACGTGG 1920 
CGCCCCACCC AGTCTCATCC GGGGGACTTC: AAGCTGGAAT GCAGAGCTTA GAAAGGGAGG" 1980 

GGATAATTAT GGGGTGTGAG GTGCATTGCC CTCTAAATCT" TTAAACAAGC AATTGGCAGT 2040 

ACCCCGTGAA - ACCTTTCCTT CTCCTACTCG GCCACCTCCC ACCAACCTGG CATCGTTCCT 2100 

CCCGGGAGCT. AGCCAGCTTC- AGAAAGCACA TACAGCATCC TTGCTGCCAA ACCACCTATG 2160 

TGCACACAGG ATTTCCTT AA TGGCTTAATA' AACTGTTATA AAGAACTCCT TGACTTGTCA 2220' 

. GAATAAAATA- GCTGCCAGGG GCTCTGCACA ATGAGCCTCT TACCGTTAAA AAAAAAAAAA 2280 
AAAAAAAAAA- AAAAAAAAAA AAAAAAAAAA 

Seq ID NO: 327 Protein sequence: 

Protein Accession #: BAB85075.1 . . 

1 ii 21 31 41 51 

I I I \ 1 I 

MAYYQEPSVB TSIIKFKDQD FTTLRDHCLS MGRTFKDETF PAADSSIGQK bLQEKRLSNV 60 

IWKRPQDLPG GPPHPIU3DI SRFDIQQGGA ADCWFIAALG SLTQNPQYRQ KILMVQSFSH 120 

QVAGIFRFRF. WQCGQWVEW IDDRLPVQGD KCLFVRPRHQ NQEFWPCLLE KAYAKLLGSY 180 

SDLHYGFLED ALVDLTGGVI- TNIHUiSSPV DLVKAVKTAT- KAGSliITCAT- PSGPTDTAQA 240 

MENGLVSLHA" YTVTGAEQIQ: YRRGWEEIIS LWNPWGWGET EWRGRWSDGS- QEWEETCDPR 300 

KSQLHKKRED GBFWMSGQDP QQKFIAMPIC SEIPITLDHG NTLHEGWSQI MFRKQVILGN 360 

TAGGPRNDAQ -FKFSVQEPME GTNWVCVTV AVTPSNLKAE DAKFPLDFQV ILAGSQKHCP 420 
KI»K 



Seq ID NO: 328 DNA Bfiqueiice 

Nucleic Acid Accession #t BC017490.1 

Coding sequence: 74-27B8 

1 11 21 31 41 51 . 

GTGGGTCACG TGAACCACTT TTCGCGCGAA ACCTGGTTGT TGCTGTAGTG GCGGAGAGGA 60 

TCGTGGTACT GCTATGGCGG AATCATCGGA ATCCTTCACC ATGGCATCGA GCCCGGCCCA 120 

GCGTCGGCGA GGCAATGATC CTCTCACCTC CAGCCCTGGC CGAAGCTCCC GGCGTACTGA 180 

TGCCCTCACC TCCAGCCCTG GCCGTGACCT TCCACCATTT GAGGATGAGT CCGAGGGGCT 240 

CCTAGGCACA GAGGGGCCCC TGGAGGAAGA AGAGGATGGA GAGGAGCTCA TTGGAGATGG 300 

CATGGAAAGG GACTACCGCG CCATCCCAGA GCTGGACGCC TATGAGGCCG AGGGACTGGC 360 

TCTGGATGAT GAGGACGTAG: AGGAGCTGAC- GGCCAGTCAG AGGGAGGCAG CAGAGCGGGC 420 

CATGCGGCAG . CGTGACCGGG AGGCTGGCCG GGGCCTGGGC CGCATGCGCC GTGGGCTCCT " 480 

GTATGACAGC GATGAGGAGG ACGAGGAGCG CCCTGCCCGC AAGCGCCGCC AGGTGGAGCG 540 

GGCCACGGAG- GACGGCGAGG AGGACGAGGA GATGATCGAG AGCATCGAGA ACCTGGAGGA 600 

TCTCAAAGGC CACTCTGTGC GCGAGTGGGT GAGCATGGCG GGCCCCCGGC TGGAGATCCA 660 

CCACOGCTTC AAGAACTTCC TGCGCACTCA CGTCGACAGC CACGGCCACA ACGTCTTCAA 720 

GGAGCGCATC AGCGACATGT GCAAAGAGAA- CCGTGAGAGC CTGGTGGTGA ACTATGAGGA ' 780 

CTTGGCAGCC AGGGAGCACG TGCTGGCCTA CTTCCTGCCT GAGGCACCGG CGG AGCTGCT 840 

GCAGATCTTT GATGAGGCTG CCCTGGAGGT GGTACTGGCC ATGTACCCCA AGTACX3ACCG 900 

CATCACCAAC CACATCCATG TCCGCATCTC CCACCTGCCT. CTGGTGGAGG AGCTGCGCTC 960 

GCTGAGGCAG CTGCATCTGA ACCAGCTGAT CCGCACCAGT GGGGTGGTGA CCAGCTGCAC 1020 

TGGCGTCCTG CCCCAGCTCA GCATGGTCAA GTACAACTGC AACAAGTGCA ATTTCGTCCT 1080 

GGGTCCTTTC -TGCCAGTCCC AGAACCAGGA GGTGAAACCA GGCTCCTGTC CTGAGTGCCA 1140 

GTCGGCCGGC CCCTTTGAGG TCAACATGGA GGAGACCATC TATCAGAACT ACCAGCGTAT 1200 

CCGAATCCAG GAGAGTCCAG GCAAAGTGGC GGCTGGCCGG CTGCCCCGCT CCAAGGACGC 1260 

CATTCTCCTC GCAGATCTGG TGGACAGCTG CAAGCCAGGA GACGAGATAG AGCTGAGTGG 1320 

CATCTATCAC AACAACTATG ATGGCTCCCT CAACACTGCC AATGGCTTCC CTGTCTTTGC 1380 

CACTGTCATC CTAGCCAACC ACGTGGCCAA GAAGGACAAC AAGGTTGCTG TAGGGGAACT 1440 

GACCGATGAA GATGTGAAGA TGATCACTAG CCTCTCCAAG GATCAGCAGA TCGGAGAGAA 1500 

GATCTTTGCC AGCATTGCTC CTTCCATCTA TGGTCATGAA GACATCAAGA GAGGCCTGGC 1560 

TCTGGCCCTG TTCGGAGGGG AGCCCAAAAA CCCAGGTGGC AAGCACAAGG TACGTGGTGA 1620 

TATCAACGTG CTCTTGTGCG GAGACCCTGG CACAGCGAAG TCGCAGTTTC TCAAGTATAT • 1680 

TGAGAAAGTG TCCAGCCGAG CCATCTTCAC CACTGGCCAG GGGGCGTCGG CTGTGGGCCT 1740 

CACGGCGTAT GTCGAGCGGC ACCGTGTCAG' CAGGGAGTGG ACCTTGGAGG CTGGGGCCCT 1800 

GGTTCTGGCT GACCGAGGAG TGTGTCTCAT TGATGAATTT GACAAGATGA ATGACCAGGA 1860 

CAGAACGAGC ATCCATGAGG CCATGGAGCA ACAGAGCATC TCCATCTCGA AGGCTGGCAT 1920 

CGTCACCTCC CTGCAGGCTC GCTGCACGGT CATTGCTGCC GCCAACCCCA TAGGAGGGCG . 1980 

tCTACGACCCC TCGCTGACTT TCTCTGAGAA CGTGGACCTC ACAGAGCCCA TCATCTCACG 2040 

CTTTGACATC CTGTGTGTGG TGAGGGACAC CGTGGACCCA GTCCAGGACG AGATGCTGGC 2100 

CCGCTTCGTG GTGGGCAGCC ACGTCAGACA CCACCCCAGC AACAAGGAGG AGGAGGGGCT 2160 

GGCCAATGGC AGCGCTGCTG AGCCCGCCAT. GCCCAACACG TATGGCGTGG AGCCCCTGCC 2220 

CCAGGAGGTC CTGAAGAAGT ACATCATCTA CGCCAAGGAG AGGGTCCACC CGAAGCTCAA 2280 

CCAGATGGAC CAGGACAAGG TGGCCAAGAT GTACAGTGAC CTGAGGAAAG AATCTATGGC 2340 

GACAGGCAGC ATCCCCATTA CGGTGCGGCA CATCGAGTCC ATGATCCGCA TGGCGGAGGC 2400 

CCACGCGCGC ATCCATCTGC GGGACTATGT- GATCGAAGAC GACGTCAACA TGGCCATCCG 2450 

CGTGATGCTG GAGAGCTTCA TAGACACACA GAAGTTCAGC GTCATGCGCA GCATGCGCAA 2520 

GACTTTTGCC CGCTACCTTT CATTCCGGCG TGACAACAAT GAGCTGTTGC TCTTCATACT 2580 

GAAGCAGTTA GTGGCAGAGC AGGTGACATA TCAGCGCAAC CGCTTTGGGG CCCAGCAGGA 2640 

CACTATTGAG GTCCCTGAGA AGGACTTGGT GGATAAGGCT CGTCAGATCA ACATCCACAA 2700 
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CCTCTCTGCA TTTTATGACA GTGAGCTCTT 
AAGGAAAATG ATCCTGCAGC AGTTCTGAGG 
TTCTGGTTTG GGGTGGTCAG TGCCCTCTGT 
TGAACTCGGG . GTACTAGGGT . CAGGGCTTAT • 

. TGTTTGTTTC TCCAAGCCTG CTTTGTGCTT- 
TGTCTTACTT GGTTGCTGAA CATCTTGCCA 
TTGGATCAGA GCTGCTGAGT TCAGGATGCC 
ATGGATGTCA GGAGAGCTGC ' TGCCCTCTTG 

. TGCCT TTGG C CAGAGAGCTG GTTGAAGATG. 
TCTGTGCCCC. TGTGGTGGAA- GAGGGCACGA 
GTCGCAGGGG TGGGATGTGA' GTCATGCGGA 
TTGCTCCCTG . TCTGTTTCCC. CACTCTCTTA 
TAATTTTTAA TAAAGTTGAA TAAAATATAA 

Seq ID NO:- 329 Protein sequence: 
Protein Accession 8 : AAH17490 . 1 



CAGGATGAAC 
CCCTATGCCA 
GCTTTATGGA. 
AGCAGGATGT 
CTCACCTTTG 
CCTCCGAGTG 
TGCGTGTGGT 
GCGTGAGTTG 
TTTGTAATCG 
CAGTGCCAGC 
TTATCCACTC 
TTTGTGCATT 
AAAAAAAAAA 



AAGTTCAGCC 
TCCATAAGGA 
CACAAAACCA 
CTGGCTGCAC 
GGTGGGATGC 
CTTTGTCTCC 
TTAGGTGTTA 
OGTATTCAGG 
TTTTCAGTCT 
GCAGCGTTCT 
GCCACAGTTA 
CGGTTTGGTT 
AAAAAA 



1 

i 

MAESSESFTM. 
GPLEEEEDGE 
DREAGRGLGR 
SVREWVSMAG 
EHVLAYFLPE 
HLNQIiIRTSG 
FEVNMEETIY-- 
NYDGSLNTAN 
IAPSIYGHED 
SRAIFTTGQG 
HEAMEQQSIS 
CWRDTVDPV 
KKYIIYAKER 
HLRDYVIEDD 
AEQVTYQRNR 
LQQF 



11 

I 

ASSPAQRRRG 
ELIGDGMERD 
MRRGLLYDSD 
PRIiEIHHRFK- 
APAELLQIFD- 
WTSCTGVLP 
QNYQRIRIQE 
GFPVPATVIL 

ikrglalalf 
asavgi/tayv 
iskagivtsl 
qdemlarfw 
vhpklnqmdq;. 

VNMAIRVMLE" 
PGAQQDTIEV, 



NDPLTSSPGR 
YRAIPELDAY 



NFLRTHVDSH 
EAALEWLAM 
QLSMVKYNCN 
SPGKVAAGRL 
ANHVAKKDNK 
GGEPKNPGGK 
QRHPVSREWT 
QARCTVIAAA 
GSHVRHHPSN 
DKVAKMYSDL 
SFIDTQKFSV 
PEKDLVDKAR 



31 
I 

SSRRTDALTS 
EAEGLALDDE 
RRQVERATED 
GHNVFKERIS 
YPKYDRITNH 
KCNFVLGPFC 
PRSKDAILLA 
VAVGELTDED 
HKVRGDINVL 
LEAGALVLAD 
NPIGGRYDPS. 
KEEEGLANGS 
RKESMATGS* 
MRSMRKTFAR 
QINIHNLSAF 



41 
I 

SPGRDLPPFE 
DVEELTASQ.R 
GEEDEEMIES 
DMCKENRESL 
IHVRISHLPL 
QSQNQEVKPG 
DLVDSCKPGD 
VKMITSLSKD 
LCGDPGTAXS 
RGVCLIDEFD; 
LTFSENVDIjT: 
AAEPAMPNTY- 
PITVRHIESM' 
YLSFRRDNNE 
YDSELFRMNK 



ACGACCTGAA 
TTCCTTGGGA 
GAGCACTTGA 
CTGGCATGAC 
CTTGCCAGTG 
ACTCAGTACC 
GCCTTCTTAC 
CTGCTTTTGC 
CCTGCAGGTT 
GGGCTCCTCA 
TCAGCTGCCA 
TCTGTAGTTT 



51 
I 

DESEGLLGTE 
EAAERAMRQR: 
IENLEDLKGH, 
WNYEDLAAR" 
VEELRSLRQL- 
SCPECQSAGP 
EIELTGIYHNi- 
QQ I GEKI FAS* 
QFLKYIEKVS : 
KMNDQDRTSI" 
EPIISRFDIL: 
GVEPLPQEVI*; 
IRMAEAHARI- 
LLLFILKQLV 
FSHDLKRKMI 



2760 
•2820. 
2860 
2940 
3000 
3060 . 
3120 
3180 
3240 
3300 
3360 
3420 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660. 
720 
780 
840 
900 



Seq ID NO: 330 DNA sequence 
Nucleic Acid Accession ft: M17254 
Coding sequence: 257-1645 

1 11 21 31 41 51 

I I I I i I 

GTCCGCGCGT GTCCGCGCCC GCGTGTGCCA GCGCGCGTGC CTTGGCCGTG CGCGCCGAGC . 60 
CGGGTCGCAC TAACTCCCTC~ GGCGCCGACG GCGGCGCTAA CCTCTCGGTT ATTCCAGGAT. 120 
CTTTGGAGAC CCGAGGAAAG; CCGTGTTGAC CAAAAGCAAG ACAAATGACT CACAGAGAAAr 180 
AAAGATGGCA GAACCAAGGG CAACTAAAGC CGTCAGGTTC TGAACAGCTG GTAGATGGGC 240 
TGGCTTACTG: AAGGACATGA^ TTCAGACTGT CCCGGACCCA GCAGCTCATA TCAAGGAAGC 300 
CTTATCAGTT GTGAGTGAGG' ACCAGTCGTT GTTTGAGTGT GCCTACGGAA CGCCACACCT 360 
GGCTAAGACA GAGATGACCG CGTCCTCCTC CAGCGACTAT GGACAGACTT CCAAGATGAG 420 
CCCACGCGTC CCTCAGCAGG ATTGGCTGTC TCAACCCCCA GCCAGGGTCA CCATCAAAAT 480 
GGAATGTAAC CCTAGCCAGG^ TGAATGGCTC AAGGAACTCT CCTGATGAAT GCAGTGTGGC 540 
CAAAGGCGGG AAGATGGTGG GCAGCCCAGA CACCGTTGGG- ATGAACTACG GCAGCTACAT. 600 
GGAGGAGAAG- CACATGCCAC CCCCAAACAT GACCACGAAC GAGCGCAGAG TTATCGTGCC 660 
AGCAGATCCT ACGCTATGGA GTACAGACCA- TGTGCGGCAG TGGCTGGAGT GGGCGGTGAA ■ 720 
AGAATATGGC CTTCCAGACG. TCAACATCTT GTTATTCCAG AACATCGATG GGAAGGAACT:" 780 
GTGCAAGATG ACCAAGGACG- ACTTCCAGAG GCTCACCCCC AGCTACAACG CCGACATCCT 840 
TCTCTCACAT- CTCCACTACC TCAGAGAGAC TCCTCTTCCA CATTTGACTT CAGATGATGT- 900 
TGATAAAGCC- TTACAAAACT, CTCCACGGTT: AATGCATGCT AGAAACACAG ATTTACCATA-. 960 

TGAGCCCCCC AGGAGATCAG CCTGGACCGG TCACGGCCAC CCCACGCCCC AGTCGAAAGC 1020 

TGCTCAAGCA TCTCCTTCCA 'CAGTGCCCAA AACTGAAGAC CAGCGTCCTC AGTTAGATCC 1080 

TTATCAGATT CTTGGACCAA CAAGTAGCCG- CCTTGCAAAT CCAGGCAGTG GCCAGATCCA 1140 

GCTTTGGCAG TTCCTCCTGG AGCTCCTGTC GGACAGCTCC AACTCCAGCT GCATCACCTG 120.0 

GGAAGGCACC AACGGGGAGT- TCAAGATGAC GGATCCCGAC GAGGTGGCCC GGCGCTGGGG. 1260 

AGAGCGGAAG AGCAAACCCA ACATGAACTA CGATAAGCTC AGCCGCGCCC TCCGTTACTA 1320 

CTATGACAAG AACATCATGA CCAAGGTCCA TGGGAAGCGC TACGCCTACA AGTTCGACTT- 1380 

CCACGGGATC GCCCAGGCCC TCCAGCCCCA* CCCCCCGGAG TCATCTCTGT ACAAGTACCC 1440 

CTCAGACCTC CCGTACATGG GCTCCTATCA CGCCCACCCA CAGAAGATGA ACTTTGTGGC 1500 

. GCCCCACCCT CCAGCCCTCC CCGTGACATC TTCCAGTTTT TTTGCTGCCC CAAACCCATA 1560 

CTGGAATTCA CCAACTGGGG .GTATATACCC CAACACTAGG CTCCCCACCA GCCATATGCC 1620 

TTCTCATCTG GGCACTT ACT ACTAAAGACC TGGCGGAGGC TTTTCCCATC AGCGTGCATT 1680 

CACCAGCCCA TCGCCAGAAA CTCTATCGGA GAACATGAAT CAAAAGTGCC TCAAGAGGAA 1740 

TG AAAAAAGC . TTTACTGGGG - CTGGGGAAGG AAGCCGGGGA AGAGATCCAA AGACTCTTGG 1800 ■ • 

GAGGGAGTTA CTGAAGTCTT ACTACAGAAA TGAGGAGGAT GCTAAAAATG TCACGAATAT IB 60 

GGACATATCA TCTGTGGACT- GACCTTGTAA AAGACAGTGT ATGTAGAAGC ATGAAGTCTT - 1920 

AAGGACAAAG TGCCAAAGAA AGTGGTCTTA AGAAATGTAT AAACTTTAGA" ' GTAGAGTTTG 1980 

AATCCCACTA ATGCAAACTG GGATGAAACT AAAGCAATAG AAACAACACA GTTTTGACCT 2040 ' 

AACATACCGT TTATAATGCC ATTTTAAGGA AAACTACCTG TATTTAAAAA' TAGTTTCATA 2100 

TCAAAAACAA GAGAAAAGAC ACGAGAGAGA CTGTGGCCCA TCAACAGACG TTGATATGCA- 2160 

ACTGCATGGC ATGTGCTGTT TTGGTTGAAA TCAAATACAT TCCGTTTGAT GGACAGCTGTr 2220 

CAGCTTTCTC AAACTGTGAA GATGACCCAA AGTTTCCAAC TCCTTTACAG TATTACCGGG'* 2280 . 

ACTATGAACT' "AAAAGGTGGG ACTGAGGATG TGTATAGAGT GAGCGTGTGA TTGTAGACAG 2340 

AGGGGTGAAG AAGGAGGAGG AAGAGGCAGA GAAGGAGGAG ACCAGGCTGG GAAAGAAACT- 2400 

TCTCAAGCAA -TGAAGACTGG' ACTCAGGACA TTTGGGGACT GTGTACAATG AGTTATGGAG: 2460 

ACTCGAGGGT TCATGCAGTC AGTGTTATAC CAAACCCAGT GTTAGGAGAA AGGACACAGC 2520 

GTAATGGAGA AAGGGAAGTA GTAGAATTCA GAAACAAAAA TGCGCATCTC TTTCTTTGTT 2580 

TGTCAAATGA AAATTTTAAC TGGAATTGTC TGATATTTAA GAGAAACATT CAGGACCTCA 2640 

TCATTATGTG GGGGCTTTGT TCTCCACAGG GTCAGGTAAG AGATGGCCTT CTTGGCTGCC 2700 
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ACAATCAGAA ATCAGGCAGG CATTTTGGGT AGGCGGCCTC CAGTTTTCCT TTGAG TCGCG 2760 

AACGCTGTGC GTTTGTCAGA ATGAAGTATA CAAGTCAATG TTTTTCOCCC TTTTTATATA 2820 

ATAATTATAT AACTTATGCA TTTATACACT ACGAGTTGAT CTCGGCCAGC CAAAGACACA 2880 

CGACAAAAGA GACAATCGAT ATAATGTGGC CTTGAATTTT AACTCTGTAT GCTTAATGTT ■ 2940 

TACAATATGA" AGTTATTAGT TCTTAGAATQ CAGAATGTAT. GTAATAAAAT AAGCTTGGCC 3000 

TAGCATGGCA AATCAGATTT ATACAGGAGT CTGCATTTGC ACTTTTTTTA G TGAC TAAAG 3060 

TTGCTTAATG AAAACATGTG CTGAATGTTG TGGATTTTGT GTTATAATTT ACTTTGTCCA 3120 
GGAACTTGTG CAAGGGAGAG CCAAGGAAAT AGGATGTTTG GCACCC 

Seq ID HO: 331 Protein sequence 
Protein Accession #: AAA52398 

1 11 21 31 41 51 

I |il I I 

MIQTVPDPAA HIKEALSWS EDQSLFECAY GTPHLAKTEM TASSSSDYGQ TSKMSPRVPQ 60 

QDWLSQPPAR VTIKMECNPS QVNGSRNSPD ECSVAKGGKM VGSPDTVGMN YGSYMKEKEM 120 

PPPNMTTNER ' RVIVPADPTL WSTDHVRQWL EWAVKEYGLP DVNILLFQNI- DGKELCKMTK 180 
DDFQRLTPSY: NADILLSHLH YLRETPLPHL TSDDVDKALQ; NSPRLMHARtt TDLPYEPPRR" 240 

SAWTGHGHPT; PQSKAAQPSP STVPKTEDQR PQLDPYQ I LG PTSSRLANPG SGQIQLWQFIi 300 

LELLSDSSNS* SCITWEGTNG' EFKMTDPDEV- ARRWGERKSK PNMNYDKLSR ALRYYYDKNI 360 

MTKVHGKRYA* YKFDFHGIAQ ALQPHPPESS LYKYPSDLPY " MGSYHAHPQK MKFVAPHPPA 420 

LPVTSSSFFA- APNPYWNSPT GGIYPNTRLP TSHMPSHLGT YY 462 

Seq ID NOr 332 DNA .sequence 
Nucleic Acid Accession #: NM_000020 
Coding- sequence : 283-1794* 

1 11 21 . . 31 41 " 51 

AGGAAACGG7 TTATTAGGAG- GGAGTGGTGG .AGCTGGGCCA GGCAGGAAGA CGCTGGAATA 60 

AGAAACATTT TTGCTCCAGC CCCCATCCCA GTCCCGGGAG - GCTGCCGCGC CAGCTGCGCC 120 

GAGCGAGCCC CTCCCCGGCT CCAGCCCGGT CCGGGGCCGC. GCCGGACCCC AGCCCGCCGT. 180 

CCAGCGCTGG. CGGTGCAACT GCGGCCGCGC GGTGGAGGGG' AGGTGGCCCC GGTCCGCCGA- 240 

AGGCTAGCGC CCCGCCACCC GCAGAGCGGG CCCAGAGGGA CCATGACCTT GGGCTCCCCC-' 300 

AGGAAAGGCC TTCTGATGCT GCTGATGGCC TTGGTGACCC AGGGAGACCC TGTGAAGCCG 360 

TCTCGGGGCC CGCTGGTGAC CTGCACGTGT GAGAGCCCAC ATTGCAAGGG GCCTACCTGC 420 

CGGGGGGCCT; GGTGCACAGT AGTGCTGGTG CGGGAGGAGG GGAGGCACCC CCAGGAACAT ■ 430 

CGGGGCTGCG GGAACTTGCA CAGGGAGCTC TGCAGGGGGC GCCCCACCGA GTTCGTCAAC. 540 

CACTACTGCT GCGACAGCCA CCTCTGCAAC CACAACGTGT CCCTGGTGCT. GGAGGCCACC 600 

CAACCTCCTT - CGGAGCAGCC GGGAACAGAT GGCCAGCTGG CCCTGATCCT GGGCCCCGTG 660 

CTGGCCTTGC TGGCCCTGGT. GGCCCTGGGT GTCCTGGGCC TGTGGCATGT CCGACGGAGG 720 

CAGGAGAAGC AGCGTGGCCT GCACAGCGAG: CTGGGAGAGT CCAGTCTCAT CCTGAAAGCA 780 

TCTGAGCAGG; GCGACACGAT . GTTGGGGGAC CTCCTGGACA GTGACTGCAC CACAGGGAGT 840 

GGCTCAGGGC TCCCCTTCCT GGTGCAGAGG" ACAGTGGCAC GGCAGGTTGC CTTGGTGGAG 900 

TGTGTGGGAA AAGGCCGCTA TGGCGAAGTG: TGGCGGGGCT TGTGGCACGG TGAGAGTGTG 9 SO 

GCCGTCAAGA TCTTCTCCTC GAGGGATGAA CAGTCCTGGT TCCGGGAGAC TGAGATCTAT 1020 

AACACAGTAT TGCTCAGACA CGACAACATC CTAGGCTTCA; TCGCCTCAGA CATGACCTCC ' 1080 

CGCAACTCGA GCACGCAGCT GTGGCTCATC ACGCACTACC ACGAGCACGG CTCCCTCTAC 1140 

GACTTTCTGC AGAGACAGAC GCTGGAGCCC CATCTGGCTC TGAGGCTAGC TGTGTCCGCG - 1200 

GCATGCGGCC TGGCGCACCT GCACGTGGAG 1 ATCTTCGGTA ' CACAGGGCAA ACCAGCCATT 1260 

GCCCACCGCG ACTTCAAGAG CCGCAATGTG CTGGTCAAGA GCAACCTGCA* GTGTTGCATC 1320 

GCCGACCTGG GCCTGGCTGT GATGCACTCA . CAGGGCAGCG ATTACCTGGA CATCGGCAAC 13 B 0 

AACCCGAGAG. TGGGCACCAA GCGGTACATG GCACCCGAGG TGCTGGACGA GCAGATCCGC 1440 

ACGGACTGCT TTGAGTCCTA CAAGTGGACT GACATCTGGG CCTTTGGCCT - GGTGCTGTGG 1500 

GAGATTGCCC GCCGGACCAT .CGTGAATGGC ATCGTGGAGG ACTATAGACC. ACCCTTCTAT 1560 

GATGTGGTGC CCAATGACCC CAGCTTTGAG GACATGAAGA. AGGTGGTGTG TGTGGATCAG 1620 

CAGACCCCCA CCATCCCTAA CCGGCTGGCT. GCAGACCCGG TCCTCTCAGG CCTAGCTCAG 1680 

ATGATGCGGG AGTGCTGGTA CCCAAACCCC TCTGCCCGAC TCACCGCGCT. GCGGATCAAG 1740 

AAGACACTAC AAAAAATTAG CAACAGTCCA" GAGAAGCCTA AAGTGATTCA ATAGCCCAGG 1800 

AGCACCTGAT TCCTTTCTGC CTGCAGGGGG. CTGGGGGGGT GGGGGGCAGT GGATGGTGCC 1860 

CTATCTGGGT AGAGGTAGTG , TGAGTGTGGT GTGTGCTGGG GATGGGCAGC TGCGCCTGCC 1920 

TGCTCGGCCC CCAGCCCACC CAGCCAAAAA TACAGCTGGG CTGAAACCTG ATCCCCTGCT 19 80 

GTCTGGCCTG CTCAAAGCGG CAGGCTCCCT GACGCCTGGC TCTCTCCCCA CCCCTATGGC 2040 

CAGCATGGTG CACCCCCTAC CACTCCCGGG ACAGGATGCA AAAGAGGCTC CAGAGTCAGA 2100 

GTGCCAAGCC AGGGAATCCC AGTCCCAGAC TCAGAGCCCG GGCCTGCACT TTGCCCCCTG 2160 

CCCTTGATCA ACCCCACTGC CCCACCAGAG CTGCCAGGGT GGCACAGGGC CCTGTCCAGC 2220 

CCCTGGCACA CACTTCCCTG CCAGGCCTCA GCCTCTAGCA TAAGCTCCAG AGAGCCAGGG 2280 

CCCATCAGTT TCTCTCTGTG GATTTGTATC TCAGCTCCAT GATGCCTTGG GCTTTCTGTC 2340 

TCCTCAACAA GAGTGCAGCT TGCTGAATGT CAGCTGCCTG AGAGAGCTGG GGCCTGACTT 2400 

ACTAGGGCAT TAAATCCTAA GAGGTCCTAC TGAGGTGTGG CAGGATCACA GGCCAGTGGA 2460 

AAAAGGGCAG GTCAGATGGG CAAGGCCCAG GACTTTCAGA TTAACTGAGA GGATATCGAG 2520 

GCCAAGCATG GCAGGGGGAA GGTCAGTGGG TGTCAAGAGA CCCAGGTCTG ACCCCGGATG 2580 

TTTGCTCCAT GTGACAAAAG CAGGCCTGTC TCAGGACCTT TTCTTTTCTT TTTTCCTTCT 2640 

TTTTTTTTTT GACACGGAGT TTCGCTCTTG TTGTCCAGGC TAGAGTGCAA TGGCATGATC 2700 

CCAGCTCACC GCAACGTCTA CCTCCCAGGT TCAAATCATT CtCTTGCCTC AGACTCCCGA 2760 

GTAGCTGGGA TTACAGGCAC ATGCCACCAT GCCTGGCTAA TTTTGTATAT TTAGTAGAAA 2820 

CAGGGTTTCA CCATGCTGGC CATGCTGGTT- CTCGAACTCC TGACCTCAGG TGTTCCACCT 2880 

ACCTCAGCCT CCCAAAGTGC TGGGGTTACA GGTGTGAGCC ATCGCGCCTG GCCAGGACCT 2940 

TTGTTTCTTA TCTACATATT GGAAGATTTG GTCCTGATGT CCTTTGAGGC TTCTTTAGCT 3000 

CTAGTTCTCT. GACACTTCAG CCTATATCAC AGCTAACTTC YTCAGTCTCA TCTATTCCTT 3060 

ATGCTCCAGC CCCTGGCAAT TTGCCTCAAG ATGGGGGTTT GAAAATAACT TTACCTGACT 3120 

CAAGGAGTGT CTGGAGCACC TCCTAGTCTA AGTCTGCAAG CTCCAGTTCT TGCCTAAAAC 3180 

CATGCCAGTG GCCACCCTTG GGCTCAGACA GCTCTGGGCC TTTTGACCAC AAGCCAGCCC 3240 

CTCGCCCTCT CTGTGGCATA GTCTTCTCTG CCCCAGGACT GCAGGGOGGC TTCCTCCAAG 3300 

GCTTCCAAGG CTCAAAAGAA ATTTGGCTCC ATCCAAGAAG GCTCCAGCTC CCCTACTGGC 3360 

CCCTGGCTTC AGGCCCACAC CCCTGGGCCA GGSCCAGAGA GTGTGTCTCA GGAGAATTCA 3420 

ATGGGCTCTA GAGAGACACA CAGAAAGTTT GGGCATTTGG GAAATTTTCA AGGRTGTATG 3480 
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10 



15 



TATGGYTCAC 
GAAGTGGATT 
GACAAGGACA 
GCAGTGAAGA" 
GCATCTTATG 
CATTGTGCAA. 
TGGATGGGCT 
AATCCACCCA~ 
GGAACAAACT 
TGGAAAATCC 
GAGAAGGGGG 
ATGCTTCTGT. 
AGACGCTGTT: 
ATGGTTAAAT. 



GTATGGWGCA 
GGAGGGGAGC 
GCCCCAAGGT 
AAGCTCTCCC 
TGTGTCTTCC 
GGCTCGGAAG 
AGGTTCCCAG. 
GCCCACGAAT 
CCTGCTGAGA 
CTAAGAGAAG, 
CCCAATGGCC 
CTGAGTGCAG 
TGTGGGAGCA 
CCTGAAAAAA 



GGTTGTCCTG 
TTGAGGAATA 
TGGGAAGACC 
CGCTCCTGCT 
ACCATCCTCA 
'AGAACCAGGA 
ATCATTAGGG 
CATCTCCCTC 
CCCCACAGCC 
GCCTGGGGGA 
AGGGAGTGAA 
GAAGGTGTTC 
CTGGGCTCAT 
AAAAAAAAA 



GTCCYKGGGT 
TAAGGAGCGG 
TGGCCTTAGT 
GTAATGACCC 
TGGTGGCACT 
AGTGAAACTG 
CAGAGTTTGC 
TTTGAAGGAT 
AGAAACTGAA 
MAGGAAKTGG 
GGAGGTGGCG 
CAGGGTCGAA 
GCCTGGCACA 



GCAGGGAAGT 
GGGTGGAGAC 
GGTCCTCAGC 
AGAG7AGCCT 
TTTCTAGGCC 
GGTGAAAACA 
ACGTCCTCTG 
TTTWATTTCT 
AGCAGCAGCT 
AGTGACAGGG 
TTGCTGAGAG 
ATTACACTTC 
CAATAGGTCT 



GGGCTGCAGG 
TCAGGCTATG 
.CTAGGGCAGG 
CCCCAGGCCG 
TGTCTCCCAG 
GAAAGCTCAA 
GTTCACTGGG 
ACTGGGTTTT 
CCCCAAAGCC 
GACAGGTAGA 
CAGTCTGCAC 
TCGTACCTGG 
GCAATAAACC 



3540 
3600 
3660 
3720- 
3780 
'3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 



Seq ID NO:/ 333 Protein sequence 
Protein Accession #: NP_000011 



20 

25 

30 

35 . 

40 

45 

50 

55 

60 

65 

70 

75 

80 

85 



1 
1 

MTLGSPRKGL 
RHPQEHRGCG 
LILGPVLALL 
DCTTGSGSGL 
RETEIYNTVL 
RLAVSAACGL 
YLDIGNNPRV 
YRPPFYDWP 
TALRIKKTLQ 



11 
I 

LMLLMALVTQ 
NIjHRELCRGR 

alvalgvlgi, 
pflvqrtvar 
lrhdnilgfi 
ahlhveifgt- 
gtkrymapev 
ndpsfedmkk 
kisnspekpk 



21 
I 

GDPVKPSRGP 
PTEFVNHYCC 
WHVRRRQEKQ 
QVALVECVGK 
ASDMTSRNSS 
QGKPAIAHRD 
LDEQIRTDCF 
WCVDQQTPT 
VIQ 



31 
I 

LVTCTCESPH 
DSHLCNHNVS" 
RGLHSEIGES 
GRYGEVWRGL 
TQLWLITHYH 
FKSRNVLVKS 
ESYKWTDIWA 
I PNRLAADPV 



Seq ID NO:, 334 DNA sequence 

Nucleic Acid Accession #; NM_004126.1 

Coding sequence: 10B-329 



1 
I 

GGCACGAGCT- 
AGCGGCTCCG 
ACATCGAAG A 
AAGTGAAGTT 
AAGAACGTTC: 
AAGAAAAAGG. 
AGAACTAGTT 
TGAAATTAAA" 
GTGCTACTCA 
ACAATTATGG 
GCTTCAAATA 



11 
I 

CGTGCCGGCC 
CTGCCAGAGC. 
TTTGCCAGAG 
GCAGAGACAA; 
TGGAGAGGAT 
CAGCTGTGTT 
TGTTTTAGTT- 
AGGAGACTTT 
TCTTTGCTCA" 
AAATAAGAAC. 
AAGTTTTGTC 



21 
I 

TTCAGTTGTT 
TAGCCCGAGC 
AAGGAAAAAC 
CAAGTGTCTA 
CCTCTAGTAA 
ATTTCATAAA- 
TTCCCAGATA 
CTTAAGCACC 
CTATGCAGTC 
ATTACTTGAG 
TT 



31 

I 

TCGGGACGCG- 
CCGGTTCTGG, 
TGAAAATGGA : 
AATGTTCTGA 
AGGGAATTCC 
TAACTTGGGA 
AAACCAACAT 
ATATAGATAG 
TTTTTTAAGA 
CATGACACTT 



41 
I 

CKGPTCRGAW 
LVLEATQPPS 
SLILKASEQG 
WHGESVAVKt 
EHGSLYDFLQ 
NLQCCIADLG 
FGLVLWEIAR 
LSGLAQMMRE 



41 

I 

CCGAGCTTCG 
GGCGAAAATG 
AGTTGAGCAG 
AGAAATAAAG 
AGAAGACAAa 
GAAACTGCAT. 
GCTTTTTAAG 
GGTTATGTAT 
GAGCAGAGAG 
CTTTCAGTAT 



51 
I 

CTWLVREEG 
EQPGTDGQIA- 
DTMLGDItLDS 
FSSRDEQSWF 
RQTLEPHLAI* 
IiAVMHSQGSD. 
RTIVNGIVED 
CWYPNPSARL 



51 
I 

CCGCTCTTCC 
CCTGCCCTTC 
CTTCGCAAAG. 
AACTATATTG 
AACCCCTTTA 
CCTAAGTGGA 
GAAGGAAGAA 
AAAAGCATAT 
TATCAGATGT 
ATTGCTTGAT 



Seq ID NO: 335 Protein. sequence 
Protein Accession 8: NP_004117.1 



11 



21 31 41 51 

MPALHIEDLP EKEKLKMEVE QLRKEVKLQR QQVSKCSEEI KNYIEERSGE DPLVKGIPED 
KNPFKEKGSC VIS 



Seq ID NO: 336 DNA sequence 
Nucleic Acid Accession &: NM_005795 
Coding sequence: 555-1940 



I 

GCACGAGGGA 
CAAGCTCTGC 
TTCCCACCTT 
TGAGAATATT 
AAGAAATTCT 
GACAATTGTG 
GAATAATAAA 
AAAGAAAACT 
ACAAGGTTGC 
ATTTGGGCTT 
TTATGATTCT 
TTACTAGAAA 
CCATTCAACA 
ACGATGTTGC 
ATCCATCAGA 
CAAGCAACAG 
AGACTGCACT 
TGCTTATCTC 
TACACAAAAA 
. CTGCAGTGGC 
AGTTCATTCA 
ACCTACACAC 
ATTTTCTTGG 



11 
I 

ACAACCTCTC 
TAACTGAATC 
GCTTGTGGGT 
TCACAAAGAA 
TAAAGACAAT 
CATATCGTCT 
AACCCATACT 
ACTACAACTT 
TATAAAACAA 
AATGATGGAG 
TGTTACAGCA 
TAAAATCATG 
AGCAGAAGGC 
AGCAGGAACT 
AAAAGTTACA 
AACATGGACA 
AAATTTGTTT 
GCTTGGCATA 
TCTGTTCTTC 
CAACAACCAG 
TCTTTACCTG 
ACTCATTGTG 
CTGGGGATTT 



21 
I 

TCTCTSCAGC 
TCAT CCTAAT 
AAATCTCTTC 
TTTCCTTAAG 
GTCAAATATG 
AATAATAAAA 
AGCCTATAGA' 
GACAAGACTG 
GATTGCTACA- 
AAAAAGTGTA 
GAATTAGAAG 
ACAGCTCAAT 
GTTTACTGCA' 
GAATCAATGC 
AAGATCTGTG 
AATTATACCC 
TACCTGACCA 
TTCTTTTATT 
TCATTTGTTT 
GCCTTAGTAG 
ATGGGCTGTA 
GTGGCCGTGT 
CCACTGATTC 



31 

I 

AGAGAGTGTC 
TGCAGGATCA 
TGCGGAATCT 
AGCTGGACTG 
ATCCAAGAGA 
ACCCATACTA 
AAACAATATT' 
CTGCAAACTT 
ACTTCTAGTT 
CCCTGTATTT 
AGAGTCCTGA 
ATGAATGTTA 
ACAGAACCTG 
AGCTCTGCCC 
ACCAAGATGG 
AGTGTAATGT 
TAATTGGACA 
TCAAGAGCCT 
GTAACTCTGT 
CCACAAATCC 
ATTACTTTTG 
TTGCAGAGAA 
CTGCTTGTAT 



41 
I 

ACCTCCTGCT 
CATTGCAAAG 
CAGAAAGTAA 
GGTCTTGACC 
AAATGTGATT 
GCCTATAGAA 
TGAAAGATTG 
CAATTGGTCA 
TATGTTATAC 
TCTGGTTCTC 
GGACTCAATT 
CCAAAAGATT 
GGATGGATGG 
TGATTACTTT 
AAACTGGTTT 
TAACACCCAC 
CGGATTGTCT 
AAGTTGCCAA 
TGTAACAATC 
TGTTAGTTGC 
GATGCTCTGT 
GCAACATTTA 
ACATGCCATT 



51 
I 

TTAGGACCAT 
CTTTCACTCT 
AGTTCCATCC 
CCTGGAATTT 
TGAGTCTGGA 
AACAATATTT 
CTACCACTAA 
CCACAACTTG 
AGCATATTTC 
TTGCCTTTTT 
CAGTTGGGAG 
ATGCAAGACC 
CTCTGCTGGA. 
CAGGACTTTG 
AGACATCCAG 
GAGAAAGTGA 
ATTGCATCAC 
AGGATTACCT 
ATTCACCTCA 
AAAGTGTCCC 
GAAGGCATTT 
ATGTGGTATT 
GCTAGAAGCT 



60 
120 
180 
240 
300 
360 
420 
480 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
'660 
720 
780 
840 
900 
960 
1020 . 
1080 
1140 
1200 
1260 
1320 
1380 
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WO 02/086443 < < 

TATATTACAA TGACAATTGC TGGATCAGTT CTGATACCCA TCTCCTCTAC ATTATCCATG 1440 

GCCCAATTTG TGCTGCTTTA CTGGTGAATC TTTTTTTCTT GTTAAATATT GTAOGOGTTC 1500 

TCATCACCAA GTTAAAAGTT ACACACCAAG CGGAATCCAA TCTGTACATG AAAGCTGTGA IS SO 

GAGCTACTCT TATCTTGGTG CCATTGCTTG GCATTGAATT TGTGCTGATT CCATGGCGAC 1620 

CTGAAGGAAA GATTGCAGAG GAGGTATATG ACTACATCAT GCACATCCTT ATGCACTTCC 1680 

AGGGTCTTTT GGTCTCTACC ATTTTCTGCT TCTTTAATGG AGAGGTTCAA GCAATTCTGA 1740 

GAAGAAACTG GAATCAATAC AAAATCCAAT TTGGAAACAG CTTTTCCAAC TCAGAAGCTC 1800 

TTCGTAGTGC GTCTTACACA GTGTCAACAA TCAGTGATGG TCCAGGTTAT AGTCATGACT 1860. 

GTCCTAGTGA ACACTTAAAT GGAAAAAGCA TCCATGATAT TGAAAATGTT CTCTTAAAAC 1920 

CAGAAAATTT ATATAATTGA AAATAGAAGG ATGGTTGTCT CACTGTTTGG TGCTTCTCCT I960 

AACTCAAGGA CTTGGACCCA TGACTCTGTA GCCAGAAGAC TTCAATATTA AATGACTTTG 2040 

GGGAATGTCA TAAAGAAGAG CCTTCACATG AAATTAGTAG TGTGTTGATA AGAGTGTAAC 2100 

ATCCAGCTCT ATGTGGGAAA AAAGAAATCC TGGTTTGTAA TGTTTGTCAG TAAATACTCC 2160 

CACTATGCCT. GATGTGACGC TACTAACCTG ACATCACCAA GTGTGGAATT GGAGAAAAGC 2220 

ACAATCAACT TTTCTGAGCT GGTGTAAGCC AGTTCCAGCA CACXATTGAT GAATTCAAAC 2280 

AAATGGCTGT AAAACTAAAC ATACATGTTG GGCATGATTC TACCCTTATT. CSC CCCAAGA 2340 

GACCTAGCTA AGGTCTATAA - ACATGAAGGG. AAAATTAGCT TTTAGTTTTA'< AAACTCTTTA 2400. 

TCCCATCTTG ATTGGGGCAG TTGACTTTTT TTTTTTCCCA GAGTGCCGTA GTCCTTTTTG 2460 

TAACTACCCT CTCAAATGGA CAATACCAGA AGTGAATTAT CCCTGCTGGC TTTCTTTTCT 2520 

CTATGAAAAG CAACTGAGTA CAATTGTTAT GATCTACTCA TTTGCTGACA CATCAGTTAT 2580 

ATCTTGTGGC ATATCCATTG. TGGAAACTGG ATGAACAGGA TGTATAATAT GCAATCTTAC 2640 

TTCTATATCA TTAGGAAAAC ATCTTAGTTG ATGCTACAAA ACAC CTTGT C AACCTCTTCC 2700 

TGTCTTACCA ; AACAGTGGGA GGGAATTCCT AGCTGTAAAT ATAAATTTTG CCCTTCCATT 2760 

TCTACTGTAT AAACAAATTA" GCAATCATTT TATATAAAGA AAATCAATGA- AGGATTTCTT 2820 

ATTTTCTTGG AATTTTGTAA AAAGAAATTG TGAAAAATGA GCTTGTAAAT ACTCCATTAT 2 880. 

TTTATTTTAT AGTCTCAAAT CAAATACATA CAACCTATGT AATTTTTAAA GCAAAT ATAT .: 2 940 

AATGCAACAA' TGTGTGTATG TTAATATCTG ATACTGTATC TGGGCTGATT TTTTAAATAA 3000 
AATAGAGTCT* GGAATGCT 



Seq ID NO; 337 protein sequence 
Protein Accession #: NPJ) 05786.1 



Seq ID NOs 338 DNA sequence 
Nucleic Acid Accession #: NM_001795 
Coding sequence: 25-2379 



60 
120 
180 
240 
300 
360 
420 



1 11 21 31 41 51 

I I I I I I 

MEKKCTLYFL VLLPFFMILV TAELEESPED SIQLGVTRNK IMTAQYECYQ KIMQDPIQQA 
EGVYCNRTWD. GWLCWNDVAA GTESMQLCPD YFQDFDPSEK VTKICDQDGN WFRHPASNRT 
WTNYTQCNVN- THEKVKTALN LFYLTIIGHG LSIASLLISL GIFFYFKSLS CQRITLHKNL 
FPSFVCNSW-- TIIHLTAVAN NQALVATNPV SCKVSQFIHL YLMGCNYFWM LCEGIYLHTL 
IWAVFAEKQ HLMWYYFLGW GFPLIPACIH AIARSLYYND. NCWISSDTHL bYlIHGPICA 
ALLVNLFFLL NIVRVLITKL- KVTHQAESNIi YMKAVRATLI LVPLLGIEFV LIPWRPEGKI 
AEEVYDYIMH ILMHFQGLLV STIFCFFNGE VQAILRRNWN QYKIQFGNSF SNSEALRSAS 
YTVSTISDGP GYSHDCPSEH LNGKSIHDIE NVLLKPENLY N 



1 11 21 31 41 51 

I ■ I I I I I 

GCACGATCTG TTCCTCCTGG GAAGATGCAG AGGCTCATGA TGCTCCTCGC CACATCGGGC 60 

GCCTGCCTGG GCCTGCTGGC AGTGGCAGCA GTGGCAGCAG CAGGTGCTAA CCCTGCCCAA 120 

CGGGACACCC ACAGCCTGCT GCCCACCCAC CGGCGCCAAA AGAGAGATTG GATTTGGAAC 180 

CAGATG CACA ■ TTGATGAAGA GAAAAACACC TCACTTCCCC ATCATGTAGG CAAGATCAAG 240 

TCAAGCGTGA GTCGCAAGAA TGCCAAGTAC CTGCTCAAAG GAGAATATGT GGGCAAGGTC 300 

TTCCGGGTCG ATGGAGAGAC AGGAGACGTG TTCGCCATTG AGAGGCTGGA CCGGGAGAAT 3 60 

ATCTCAGAGT ACCACCTCAC TGCTGTCATT GTGGACAAGG ACACTGGTGA AAACCTGGAG 420 

ACTCCTTCCA GCTTCACCAT CAAAGTTCAT GACGTGAACG ACAACTGGCC TGTGTTCACG 480 

CATCGGTTGT TCAATGCGTC CGTGCCTGAG TCGTCGGCTG TGGGGACCTC AGTCATCTCT 540 

GTGACAGCAG TGGATGCAGA CGACCCCACT GTGGGAGACC ACGCCTCTGT CATGTACCAA 600 

ATCCTGAAGG GGAAAGAGTA TTTTGCCATC GATAATTCTG GACGTATTAT CACAATAACG 660 

AAAAGCTTGG ACCGAGAGAA GCAGGCCAGG TATGAGATCG TGGTGGAAGC GGGAGATGCC 720 

CAGGGCCTCC GGGGGGACTC GGGCACGGCC. ACCGTGCTGG TCACTCTGCA AGACATCAAT 780 

GACAACTTCC CCTTCTTCAC CCAGACCAAG TACACATTTG TCGTGCCTGA AGACACCCGT 840 

GTGGGCACCT CTGTGGGCTC TCTGTTTGTT GAGGACCCAG ATGAGCCCCA GAACCGGATG 900 

ACCAAGTACA GCATCTTGCG GGGCGACTAC CAGGACGCTT TCACCATTGA GACAAACCCC 960 

GCCCACAACG AGGGCATCAT GAAGCCCATG AAGCCTCTGG ATTATGAATA CATCCAGCAA 1020 

TACAGCTTCA TCGTCGAGGC CACAGACCCC ACCATCGACC TCCGATACAT GAGCCCTCCC 1080 

GCGGGAAACA GAGCCCAGGT CATTATCAAC ATCACAGATG TGGACGAGCC CCCCATTTTC 1140 

CAGCAGCCTT. TCTACCACTT CCAGCTGAAG GAAAACCAGA AGAAGCCTCT GATTGGCACA 1200 

GTGCTGGCCA TGGACCCTGA TGCGGCTAGG CATAGCATTG GATACTCCAT CCGCAGGACC 1260 

AGTGACAAGG GCCAGTTCTT CCGAGTCACA AAAAAGGGGG ACATTTACAA TGAGAAAGAA 1320 

CTGGACAGAG AAGTCTACCC CTGGTATAAC CTGACTGTGG AGGCCAAAGA • ACTGGATTCC 1380 

ACTGGAACCC CCACAGGAAA AGAATCCATT GTGCAAGTCC ACATTGAAGT TTTGGATGAG 1440 

AATGACAATG CCCCGGAGTT TGCCAAGCCC TACCAGCCCA AAGTGTGTGA ' GAACGCTGTC 1500 

CATGGCCAGC TGGTCCTGCA GATCTCCGCA ATAGACAAGG ACATAACACC ACGAAACGTG 1560 

AAGTTCAAAT^ TCACCTTGAA TACTGAGAAC AACTTTACCC TCACGGATAA TCACGATAAC 1620 

ACGGCCAACA TCACAGTCAA GTATGGGCAG TTTGACCGGG AGCATACCAA GGTCCACTTC 1680 

CTACCCGTGG TCATCTCAGA CAATGGGATG CCAAGTCGCA OGGGCACCAG CACGCTGACC 1740 

GTGGCCGTGT GCAAGTGCAA CGAGCAGGGC GAGTTCACCT TCTGCGAGGA TATGGCCGCC 1800 

CAGGTGGGCG TGAGCATCCA GGCAGTGGTA GCCATCTTAC TCTGCATCCT CACCATCACA 1860 

GTGATCACCC TGCTCATCTT CCTGCGGCGG CGGCTCCGGA AGCAGGCCCG CGCGCACGGC 1920 

AAGAGCGTGC CGGAGATCCA CGAGCAGCTG GTCACCTACG ACGAGGAGGG CGGCGGCGAG 1980 

ATGGACACCA CCAGCTACGA TGTGTCGGTG CTCAACTCGG TGCGCGGCGG CGGGGCCAAG 2040 

CCCCCGCGGC CCGCGCTGGA CGCCCGGCCT TCCCTCTATG CGCAGGTGCA GAAGCCACCG 2100 

AGGCACGCGC CTGGGGCACA CGGAGGGCCC GGGGAGATGG CAGCCATGAT CGAGGTGAAG 2160" 

AAGGAGGAGG CGGACCACGA CGGCGACGGC CCCCCCTACG ACACGCTGCA CATCTACGGC 2220 

TACGAGGGCT CCGAGTCCAT AGCCGAGTCC CTCAGCTCCC TGGGCACCGA CTCATCCGAC 2280 
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TCTGACGTGG ATTACGACTT .'CCTTAACGAC TGGGGACCCA 
CTGTACGGCT CGGACCCCCG GGAGGAGCTG CTGTATTAGG 
CTGGGGACCC AAACCCCCTG CAGCCCAGGC CAGTCAGACT 
AAATQGCAGT GACTCCCCAG CCCAGCACCC CTTCCTCGTG 
CCTTGGGATA • GCAAACTCCA GGTTCCTGAA ATATCCAGGA 
TTCTCAAATG CTGGCAAATC CAGGCTGGTG TTCTGTCTGG 
CTGTCACCCA CAGACCGCCG. TCTAACTCAA AGACTTCCTC 
GCAAAACAGA CTGTGTTTAA CTGCTGCAGG. GTCTTTTTCT 
TAAGGCTGGT, GAGGTCCTGG TGCCTATCTG CCTGGAGGCA 
TGTGGGGCAG GATTCTCTGC AGCCCATTCC CAAGGGAGAC 
GGGAGCCCTA GCCCTGCTCC AACTCCATAC TCCACTCCAA 
CCCTCTCCAG GCCTGTCAAG AGGGAGGAAG GGGCCCCATG: 
CTGAAGTGAC CTCACTGGCC TGCCATGCCA GTAACTGTGC. 
ATTCAGGGAA ATGCTTATTA AACCTTGAAG. CAACTGTGAA 
GAGATCAGGA GTGACAGATC ACAGGGTGAG GGCCACCTCC 
AGGCCTGGAA GAGCTGAGAC" CTTGCTTTGA GACTCCTCAG 
AGAAGGGGCA GATGTTCCCG GAGATCAGAA - GACGTCTCCC 
GCCAATCCAT- GCTCTCTTTC TTTTCTCTGT CTACTCCTTA 
CAAGATGTGG CCTTTAGCAA AACTGACAAT GTCCAAACCC 
GCCGAGCATG TGTCTTTACA CCTCGCTGTT GTCACATCTC 
CACCTTGCAG; AAGGAAGGCC CTGCCCTGCC CAACCTCTGT 
ACTGGAACGT TTCACTGCAA ACACACCTTG * GAGAAGTGGC 
AGGGAAGGAG. ACACCAAGCT CACCCTTCGT CATGGACCGA 
CCCTCACACT GCAAGGGATT GTAGATAACA CTGACTTGTT 
TTCTTATAAT GATTTTTTTA* CTAATGATAC TTACAAGTTT. 
GAATAAGGGT TTTTGCATAA TAAGCAGGTT. GTTATTTAGG 
TTTTTAGTTG GAAAAACAAT TCCTGTAACC TTCTATTTTC 
TACAGATAAT GTCTATATAT TGGCCAAACT GGTGCATGAC 
CCTAAATAAA GAAAAATCTT TAGCCTGGGC AACAAAAAAA 

Seq ID NO: 339 Protein sequence 
Protein Accession #: NP_001786 



PCTAJS02/12476 



GGTTTAAGAT 
CGGCCGAGGT 
CCAGGCACCA : 
GGTCCCAGAG 
ATATATGTCA 
GCTCAGACAT. 
TGGCTCCCCA 
AGGGTCCCTG' 
AAGGCCTGGA 
TGACCATCAT 
GTGCCCCACC 
GCAGCTCCTG 
TGTACTGAGC 
TTCATTCTGG 
ACACCCACCC 
CACCCCTCCA 
CTTCTCTGCC 
TCCCTTGGTT. - 
ACTCATGACT 
AGGGAACTGA 
GGTCACCCAT 
ATCAGTCAAC 
GGTTCCCACT 
TGTTTTAACC 
CTAGCTCTCA 
TTAACAAT AT 
TATAATTGTA 
AAGTACTGTA 



GCTGGCTGAG 

CACTCTGGGC 

CAGCCTCCAA 

ACCTCATCAG 

GTGATGACTA 

CCACATAACC 

AGGCTGCAAA 

AACGCCCTGG 

CAGCTTGACT 

GCCCTCTCTC 

ACTCCCCAAC ' 

ACCTTGGGTC 

ACTGAACCAC 

AGGGGCAGTG 

CCTCTGGAGA 

GTTTTGCCTG 

TCACCTGGTC 

TAGAGGAACC 

GCATGACGGA 

CCCTCAGGCA 

GCATCATTCC " 

AGAGAGGGGC 

CTGGCAAAGC 

AATAACTAGC 

CAGACATATA 

TAATTCAGGT 

GTAATTGCTC 

TTTTTTTATA 



MQRLMMLLAT 
NTSLPHHVGK 
VIVDKDTGEN 
PTVGDHASVM 
TATVLVTLQD 
DYQDAFTIET 
INITDVDEPP ; 
VTKKGDIYNE 
KPYQPKVCEN- 
GQFDREHTKV: 
WAILLCILT 
SVLNSVRRGG 
DGPPYDTLHI 
ELLY 



11 
I 

SGACLGLLAV 
IKSSVSRKNA 
LEtPSSFTIK 
YQILKGKEYF- 
INDNFPFFTQ 
NPAHNEGXIK 
IFQQPFYHFQ 
KELDREVYPW 
AVHGQLVLQI 
HFLPWISDN 
ITVITLLIFL 
AKPPRPALDA 
YGYEGSESIA 



21 
I 

AAVAAAGANP 
KYLLKGEYVG 
VHDVNDNWPV 
AIDNSGRIIT 
TKYTFWPED 
PMKPLDYEYI 
LKENQKKPLI 
YNLTVEAKEL 
SAIDKDITPR 
GMPSRTGTST 
RRRLRKQARA- 
RPSLYAQVQK 
ESLSSLGTDS 



31 
I 

AQRDTHSLLP 
KVFRVDAETG 
FTHRLFNASV 
ITKSLDREKQ. 
TRVGTSVGSL 
QQYSFIVEAT 
GTVLAMDPDA 
DSTGTPTGKE 
NVKFKFTLNT 
LTVAVCKCNE 
HGKSVPEIHE 
PPRHAPGAHG 
SDSDVDYDFL 



41 
1 

THRRQKRDWI 
DVFAIERLDR 
PESSAVGTSV 
ARYEIWEAR 
FVEDPDEPQN/ 
DPTIDLRYMS 
ARHSIGYSIR- 
SIVQVHIEVL 
ENNFTLTDNH 
QGEFTFCEDM 
QLVTYDEEGG 
GPGEMAAMIE 
NDWGPRFXML 



51 

1 - 

WNQMHIDEEK 
ENISEYHLTA • 
ISVTAVDADD 
DAQGLRGDSG. 
RMTKYSILRG 
PPAGNRAQVI 
RTSDKGQFFR 
DENDNAPEFA 
DNTANITVKY 
AAQVGVSIQA 
GEMDTTSYDV 
.VKKDEADHDG 
AELYGSDPRE 



Seq ID NO: 340 DNA sequence 
Nucleic Acid Accession #$ NM_003088 
Coding sequence: 112-1593 



GCGGAGGGTG 
CCCGCCACCC 
AACGGCACAG 
CTGACGGCCG 
CAGATCTGGA 
AGCCACCTGG 
GTGCCCGGTC 
CAGTCCGAGG 
CAGACGGTGT 
ATCTACAGTG 
GCCGTGGACC 
CAGCGCTACA 
GCGCGCCCCG 
CGCGACTGCG' 
AAGGCCACCA 
GTGCTGCAGG 
AATCAGGACG 
AAGTGTGCCT 
CAGTCCACCG 
CGCATCACAC 
GCCGCCTCGG 
CCCATCATCG 
CTGGACGCCA 
AACATCAAAG 
AGCGGCGACA 
AAGGTGGGCG 
ACCGTGGACC 
CCACATGGCG 
GGCGGGAGGC 
CCTGTCGCCC 



11 
I 

CGTGCGGGCC 
ACCTCCCGGG 
CCGAGGCGGT. 
AGGCGTTCGG 
CGCTGGAGCA 
GCCGCTACCT 
CCGACTGCCG 
CGCACCGGCG 
CCCCCGCCGA 
TCACCCGTAA 
GCGACGTGCC 
GCGTGCAGAC 
AGCCGGCCAC 
AGGGCCGTTA 
AGGTGGGCAA- 
CGGCCAACGA 
AGGAGACCGA 
TCCGTACCCA 
CCTCCAGCAA 
TGAGGGCGTC 
TGGAGACAGC 
TGTTCCGCGG 
ACCGCTCCAG 
ACTCCACAGG 
CTCCTGTGGA 
GGCGCTACCT 
CCGCCTCGCT 
GCTCCTGCCA 
AAGCCCCCTT 
CTATGGACTC 



21 
I 

GCGGCAGCCG 
GCCGCGCAGC 
GCAGATCCAG 
GTTCAAGGTG 
GCCCCCTGAC 
GGCGGCGGAC 
TTTCCTCATC 
CTACTTCGGC 
GAAGTGGAGC 
GCGCVACGCG 
GTGGGGCGTC 
CGCCGACCAC 
TGGCTACACG 
CCTGGCGCCG 
GGACGAGCTC 
GAGGAACGTG 
CCAGGAGACC 
CACGGGCAAG 
GAATGCCAGC 
CAATGGCAAG 
AGGGGACTCA 
GGAGCATGGC 
GTATGACGTC 
GAAATACTGG 
CTTCTTCTTC 
GAAGGGCGAC 
GTGGGAGTAC 
ACCCTCCCTG. 
GCCTTTCAAA 
CCCACTCTCC 



31 
I 

AACAAAGGAG 
GGCCTCTCGT. 
TTCGGCCTCA 
AACGCGTCCG 
GAGGCGGGCA 
AAGGACGGCA 
GTGGCGCACG 
GGCACCGAGG 
GTGCACATCG 
CACCTGAGCG 
GACTCGCTCA 
CGCTTCCTGC 
CTGGAGTTCC 
TCGGGGCCCA 
TTTGCTCTGG 
TCCACGCGCC 
TTCCAGCTGG 
TACTGGACGC 
TGCTACTTTG 
TTTGTGACCT 
GAGCTCTTCC 
TTCATCGGCT 
TTCCAGCTGG 
ACGGTGGGCA 
GAGTTCTGCG 
CACGCAGGCG 
TAGGGCCGCC 
CTAACCCCTT 
CTGGAAACCC 
CCTCCGCCCG 



41 

I 

CAGGGGCGCC 
CTACTGCCAC 
TCAACTGCGG 
CCAGCAGCCT 
GCGCGGCCGT 
ACGTGACCTG 
ACGACGGTCG 
ACCGCCTGTC 
CCATGCACCC 
CGCGGCCGGC 
TCACCCTCGC 
GCCACGACGG 
GCTCCGGCAA 
GCGGCACGCT 
AGCAGAGCTG- 
AGGGTATGGA 
AGATCGACCG- 
TGACGGCCAC 
ACATCGAGTG • 
CCAAGAAGAA 
TCATGAAGCT 
GCCGCAAGGT 
AGTTCAACGA- 
GTGACTCCGC 
ACTATAACAA 
TCCTGAAGGC 
CCGTCCTTCC 
CTCCGCCAGG 
CAGAGAAAAC 
GGTTCCCTAC 



51 
I 

GCCGCAGGGA 
CATGACCGCC 
CAACAAGTAC 
GAAGAAGAAG 
GTGCCTGCGC 
CGAGCGCGAG 
CTGGTCGCTG 
CTGCTTCGCG 
TCAGGTCAAC 
CGACGAGATC 
CTTCCAGGAC 
GCGCCTGGTG 
GGTGGCCTTC 
CAAGGCGGGC 
CGCCCAGGTC 
CCTGTCTGCC 
CGACACCAAA 
CGGGGGCGTG 
GCGTGACCGG 
TGGGCAGCTG 
CATCAACCGC 
CACGGGCACC 
TGGCGCCTAC 
GGTCACCAGC 
GGTGGCCATC 
CTCGGCGGAA- 
CCGCCCCTGC 
TGGGCTCCAG 
GGTGCCCCCA 
TCCCCTCGGG 



2340 
2400 
2460 
2520 
2580 
2640. 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 



60 
120 
. 180 
.240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 
300 
360 
420 
480 
■ 540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 ' 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 



314 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

TCAGCGGCTG CGGCCTGGCC. 
CGGGGCGAGT CTGGCACCTC 
GAAGCGGCTA AGGGACGGTT 
TTTGCCTCTC CCAGCCACCT 
CTGTCAGTGG CCCTCCCTGG 
CGGGAGGGCT AGGACTGACC 
CTCCCAGGTG GGAGAGGCTC 
ACAGGGTCTG CCOGCTGCAC 
GGGCCGTCTT CCTCCTGTCT; 
CAAATCAGTA TTTTTTTTAA- 
GTAGTAGCGA - GTGATCTGGC: 
CCCCCTCTTT CCGTCCTTCC 
GCCAGAGCCC CTGCTGTGAT 
CGCCCCCTCC GGGAGCCCTG 
TCCCCAACAT GCATCTCACT; 
TATAACTCTA AACGCCCATG 
AGTCTGC 



PCT/US02/12476 



CTGGGAGGGA 
TTTCTTCTGA 
GGGGGCTGGG 
CCTCCCAGCC- 
TGCACTGTCC 
CTTG TGGTGT 
AGCCTGGCTC 
GTTCTGCCAA 
CTTTCCTTTC' 
TGAAATATTA 
GGGGGGCGTC 
CGTCCAGCCC 
TGGTGCTCCC 

gggtgagccg; 
ctgggtgtct 

ATAGTAGCTT 



TTTCAGATGC 
CCTCAGAOGG 
AGCCCTGGGC 
CCCCAGGAGA 
CCGAAACCCC 

CCTTCCCTGG 
GGTGGTGGTG 
ACCCTAGCCT 
TTGCTGGAGG 
TCAGCACOCT. 
CAGCCCT6GG. 
TGGGCCTCCC 
CCGGGGCCCC 
TGGTCTTTTA 
CAAACTGGAA 



CCCTGCCCTC 
CTCTGAGCCT 
GTGTAGTGTA 
GCTGGGCACA 
TGCTTGGGAA 
GGTGGCTGGA 
AGCGGCAGGG 
GCGGGCGGGT 
GACTGGAAGC 
CGTCCCAGGC 
CCCCAGGGGG. 
CCTGGGCTGC 
GGGTGGATGA 
CCTGCTGCCA 
TTTTTTGTAA 
ATAGCGAAAT 



TTGTCTGCCA 
TATTTCTCTG 
ACTGGAATCT 
TGTCCCAAGC- 
GGGAAGCTGT 
AACAGCCCCT. 
CGTGACGGCC 
AGGGGTGTGG 
AGAAAATGAC 
AAGCCTGGCT 
TGCATCTCAG 
OGACACCTGG 
AGCCAGGCGT 
GCCTCCCCCG 
GTGTCATTTG 
AAAATAACTC 



I860 
1920 
1980 
2040 
2100 
2160- 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 



Seq ID NO: 341 Protein sequence 
Protein Accession 8: NP_003079 



MTANGTAEAV 
CLRSHLGRYL 
CFAQTVSPAE: 
PQDQRYSVQT 
KAGKATKVGK 
DTKKCAFRTH 
GQLAASVETA" 
GAYNIKDSTG 
SABTVDPASL 



11 
I 

QIQFGLINCG 
AADKDGNVTC 
- KWSVHIAMHP 
ADHRFLRHDG- 
DELPALEQSC' 
TGKYWTLTAT 
GDSSLFLMKL 
KYWTVGSDSA 
WEY 



21. 
I 

NKYLTAEAFG 
EREVPGPDCR 1 ' 
QVNIYSVTRK' 
RLVARPEPAT- 
AQWLQAANE, 
GGVQSTASSK 
INRPIIVFRG 
VTSSGDTPVD 



31 
I 

FKVKASASSL 
FLIVAHDDGR 
RYAHLSARPA 
GYTIiEFRSGK 
RNVSTRQGMD 
NASCYFDIEW 
EHGFIGCRKV 
FFFEFCDYNK 



41 
I 

KKKQIWTLEQ 
WSLQSEAHRR 
DEIAVDRDVP 
VAFRDCEGRY 
LSANQDEETD 
RDRRITLRAS 
TGTLDANRSS 
VAIKVGGRYL 



51 
I 

PPDEAGSAAV 
YFGGTEDRLS 
WGVDSLITLA 
LAPSGPSGTL 
QETFQLEIDR 
NGKFVTSKKN 
YDVFQLEFKD 
KGDHAGVLKA 



Seq ID NO: 342 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence : 6 6 0 .. 1 7 0 5 



1 
I 

CGCTCCGCAC 
GGGATTCTTG. 
CCGGGTTGGG 
CTCGAGCGGG 
CCGGCGTTCG 
CACCCCACTG 
GCTACCATGA 
ACCGTGCGTG: 
CCCGGAGGGG 
TGCACCCTGG 
CTCGAAAGCG 
TGCGCGGTAC 
ACCTGCGCGC: 
GCCCCGGGGC 
TGGACTTCAG- 
CAGTTACTTG; 
TGTGTCCCTG 
TAGACGACTT 
GCCGCTCTTG 
CCAGGCGCCC 
TCGACGAGAA 
TTCCTGAGAT 
AAGCCGAGTC 
CGACTTCCTC 
TGAGCACAGC 
TCTGCTTTCA 
TGGAGAGTGA 
GGGTGAAAGT 
CCCCTCTTGG 



11 
I 

ACATTTCCTG 
GCAGTTGGGG" 
GAAGCCAGCT* 
ACAGATCCAA 
CCCTGTGCCT 
CCGACCGTGC 
AGCGGCAGGG 
CGGGCGCCGA 
GCTCCAAAGA 
AGAACGAGCC 
ACACGCTGCA 
TCCAGGCCAC 
CAACGGCTAC 
CGCCTCTAAC 
TCCACCTGGG 
CATCGCGGAC 
CCCCGGGAGG 
GGGAGGCTTT, 
TGTGACCAGT- 
GCCGGCCACT 
GCTGGGAGAG 
TCCTCGATGG 
AAAGGCCACT 
TGCCACTCCT 
AGTAGTAGTG 
CGAAAGCCCC 
TCCTGAGCCC 
CGGGGACTGT 
CTCTAGTGAT 



21' 

1 = 

TCGCGGCCTA 
GGTCCGTCGG 
GTAGAGGGCG 
GTTGGGAGCA 
CCTCTGGCAG 
TGGCTGCTCG 
GGCCGAGGAG 
GCTGCGCGCT 
CCTGCTGTTC 
TTTGCGGGGT 
GTGGGTGGAG 
CGGTGGGGTC 
CTGTGCAAGT 
TTGAGCTATC 
ACCGAGGTGA 
GAAATCGGCG 
TACCTCCGTG 
GCCTGCGAAT; 
GGGGAAGGAC 
GCAACCAGCC 
ACACCACTTG 
GGATCACAGA 
ATCACCCCAT 
CAGGCTTTCG 
TTGGTGATCT 
TCTTCCCAGC 
GCTGCTTTGG 
GATCTGCGGG 
G CAT AG 



31 
I 

AGGGAAACTG 
GAGCGAGGGC 
GTGACCGCGC 
GCTCTGCGTG 
GCGCTCTGGC 
GCCTCGGGGG 
GCCTGCATCC 
GTGCTCGCGC 
TGGGTCGCAC 
TTCTCCTGGC 
GAGCCCCAAC 
GAGCCCGCAG 
ACCAGTTTGA 
GCGCGCCCTT 
GTGCGCTCTG 
CTCGCTGGGA 
CTGGCAAATG 
GTGCTACGGG 
AGCCGACCCT. 
CCGTGCCGCA 
TCCCTGAACA- 
GCACGATGTC 
CAGGGAGCGT- 
ACTCCTCCTC 
TGACCATGAC 
CAAGGAAGGA 
GCTCCAGTTC 
ACAGAGCAGA 



41 
I 

TTGGCCGCTG 
GGAGGGGAAG 
TCCAGACACA 
CGGGGCCTCA 
CCGGGCCGGG 

cctgctacag: 
tgcgaggtgg 
tcctgcgggc 

TGGAGCGCAG 
TGTCCTCCGA 
GCTCCTGCAC 
CTGGAAGGAG 
GGTCTTGTGT 
CCAGCTGCAC 
CCGGGGACAG. 
CAAACTCTCG 
CGCAGAGCTC 
CTTCGAGCTG: 
TGGGGGGACC 
GAGAACATGG 
AGACAATTCA 
TACCCTTCAA 
GATTTCCAAG 
TGCCGTGGTC 
AGTACTGGGG 
GTCTATGGGC 
TGCACATTGC 
GGGTGCCTTG 



51 - 
I 

GGCCCGCGGG 

GGAGGGGGAA 

GCTCTGCGTC 

GAGAATGAGG- 

CGGCGGCGAA- 

CCTGCACCAC ■ 

GGCGCTCAGC 

AGGCCCAGGG. 

GCGTTCCCAC 

CCCCGGCGGT 

CGCGCGGAGA 

ATGCGATGCC" 

CCTGCGCCGC 

AGCGCCGCTC 

CTCCCGATCT 

GGCGATGTGT: 

CCTAACTGCC 

GGGAAGGACG 

GGGGTGCCCA 

CCAATCAGGG 

GTAACATCTA 

ATGTCCCTTC 

TTTAATTCTA 

TTCATATTTG 

CTTGTCAAGC 

CCGCCGGGCC 

ACAAACAATG 

CTGGCGGAGT 



Seq ID NO: 343 Protein sequence 
Protein Accession #: FGENESH predicted 



1 
I 

MGKDFMTKTP 
XARIYKELEQ 
GLGKPAVEGG 
PHCRPCWLLG 
RRGLQRPAVL 
RGTPGHRWGR 
DFSPPGTEVS 
DDLGGFACEC 
DEKIiGETPLV 
TSSATPQAFD 
ESDPEPAAUG 



11 

i 

KAFATKAKID 
IYKKKKPTKT 
DRAPDTAliRP 
LGGLLQPAPR 
GRTGAQAFPL 
ARSHKEMRCH 
ALCRGQLPIS 
ATGFELGKDG 
PEQDNSVTSI 
SSSAWFIFV 
SSSAHCTNNG 



21 

f: ■. 

KWDLIKUCSF 
LRTHFLSRPK 
RAGQIQVGSS 
YHEAAGGRGG 
HPGERAFAGF 
LRANGYLCKY 
VTCIADEIGA 
RSCVTSGEGQ 
PEIPRWGSQS 
STAWVLVIL 
VKVGDCDIiRD 



31 
I 

CTAKETI IRV 
GNCWPLGPRG 
SACGASENEA 
LHPARWGAQH 
LLAVLRPRRS 
QFEVLCPAPR 
RWDKLSGDVL 
PTLGGTGVPT 
TMSTLQMSLQ 
TMTVLGLVKL 
RAEGALLAES 



41 
I 

NSQPTDWQKT 
DSWQLGGPSG 
GVRPVPPLAG- 
RACGRRAARC 
RKRHAAVGGG 
PGAASNLSYR 
CPCPGRYLRA 
RRPPATATSP 
AESKATITPS 
CFHESPSSQP 
PIiGSSDA 



51 
I 

FAIYPSDKGV 
ARAEGKGGGT 
AtiARAGRRRT 
ARAPAGRPRA 
APTLLHRAEM 
APFQLHSAAL 
GKCAELPNCL 
VPQRTWPIRV 
GSVISKFNST 
RKESMGPPGL 



60 
120 
180 
240 
300 
360 
420 
480 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



60 
120 
180- 
240 
300 
360 
420 
480 
540 
600 



315 



180 " 

240 

300 

360 

420 



13 80-' 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 



WO 02/086443 

Seq ID NO: 344 DNA sequence 
Nucleic Acid: Accession #:NM_0120?2 

Coding sequence: 149-2107 

1 ii 2i . r t " 

aaagccctca gcctttgtgt ccttctctgc gccggagtgg ctgcagctca • CCCCTCAGCT so 

CCCAGCTGGG AGCCGAGATA GAAGCTCCTG TCGCCGCTGG GCTTCTCGCC 120 
TCCCGCAGAG G^S AGACCGGGAT GGCCACCTCC ATGGGCCTGC TGCIGCIGCT 
GCTGCTGCTC CTGACCCAGC CCGGGGCGGG GACGGGAGCT GACACGGAGG CGGTGGTCTG 
SSSgS GCCTGCTACA CGGCCCACTC GGGCAAGCTG AGCGCTGCCG aggcccagaa 
CCACTGCAAC CAGAACGGGG GCAACCTGGC CACTGTGAAG AGCAAGGAGG AGGCCCAGCA 
CGTCCAGCGA GTACTGGCCC AGCTCCTGAG GOGGGAGGCA GCCCTGAOGG CGAGGATGAG ^ 
S^TCTGG ATTGGGCTCC AGCGAGAGAA GGGCAAGTGC CTGGACCCTA GTCTGCCGCT. 480 
S^ScTTC AGCTGGGTGG GCGGGGGGGA GGACACGCCT TACTCTAACT GGCACAAGGA S40 
£SUc TCGTGCATCT CCAAGCGCTG TGTGTCTCTG CTGCTGGACC TCTCCCAGCC 600 

^cotccc aaccgcctgc CCAAGTGGTC tgagcgcccc.tgtgggagcc caggctcccc mo 

SgSSaAC ATTGAGGGCT TCGIGTGCAA GTTCAGCTTC AAAGGCATGT GCOGGCCTCT 720 

S^TCGTC GGCCCAGGTC AGGTGACCTA CACCACCCCC TTCCASACCA CCAGTTCCTC 780 

CTTGGAGGCT GTGCCCTTTG CCTCTGOGGC CAATGTAGCC TGTGGGGAAG GTGACAAGGA 840 

rGAGACTCAG AGTCATTATT TCCK3TGCAA GGAGAAGGCC CCOGATGTGT/ TCGACTGGGG 900 

SgS^GC ^CTCTGTG tSgCCCCAA GTATGGCTGC AACTTCAACA- ATGGGGGCTG 960 

CCACCAGGAC TGC^TGAAG GGGGGGATGG CXCCTTCCTC TGOGGCTGCC GACCAGGATT 1020 

SScEa TGACCTGTGC CTCTCGAAAC, CCTTGCAGCr CCAGCCCATG. 1080 

TCGTGGGGGG . GCCACGTGOG TCCTGGGACC CCATGGGAAA' AACTACACGT GCCGCTGCCCr. 1140 

CCAAGGGTAC CAGCTGGACT CGAGTCAGCT GGACTGTGTG ' GACGTGGATG ^TGCCAGGA 200 

SdCCCrtGT. GCCCAGGAGT GTGTCAACAC CCCTGGGGGC TTCOGCTGCG AMGCTGGGT 1260 

SgSaTGAG CCGGGCGGTC CTGGAGAGGG GGCCTGTCAG;GATGTGGATG AOTOTGCTCT 1320 
gStCGCTM CCTTGOGCCC AGGGCTGCAC CAACACAGAT GGCTCATTTC ACTGCTCCTG 
" TACGTCCTGG CCGGGGAGGA CGGGACTO*. ™^<* £££££ 
TGTGGGCCCG GGGGGCCCCC TCTGCGACAG CTTGTGCTTC ^«^°. ??TSS!!^ 
CTGTGGCTGC CTGCCAGGCT GGGTGCTGGC CCCAAATGGG GTCTCTTGCA - CCATGGGGCC 

SacSccat ctgggccccc cgatgaggag.gaca«ggag agaaagaagg 

GAGCACCGTG CCCCGCGCTG CAACAGCCAG TCCCACAAGC GGCCCCGAGG GCACCCCCAA 

otcScaccc accaSaagta gaccttcgct gtcatctgac gcccccatca catctgcccc 

aSgSg CTGGCCCCCA . GTGGGTCCTC AGGCGTCTGG AGGGAGCCCA GCATCCATCA 

raScS gcctctggcc cccaggagcc tgcaggtggg gactcctccg tggccacaca 
SaSa^gS gSgacg ggcaaaagct gcttitattc tacatcctag gcaccgtggt 

GGCCATCCTA CTCCTGCTGG CCCTGGCTCT GGGGCTACTG GTCTATOGCA AGCGGAGAGC 

SaSS gIgSgaIgg agaagaagcc ccagaatgcg gcagacagtt actcctgggt 
tccj^agcga gctgagagca gggccatgga gaaccagtac. agtccgacac ctgggacaga 
ctotgaSg Stggcc ctagagacac tagagtcacc agccaccatc ctcagagctt 
™£-tcccc a^tccaaagg ggcacccaca tttttttgaa agactggact ggaatcttag- 2220 

^£a£g TaISct CCTTAAAGGC. CCCTTGGAAC ATGCAGGTAT TTrCTACGGG 28 

^TTCATGT TCCTGAAGTG . GAAGCTGTGT : GTTGGCGTGC CACGGTGGGG ATTTCGTGAC 2340 

JSaTAATGA TTGTTACTCG CCCTCCCTTT = TCAAATTCCA' ATGTGACCAA TTCCGGATCA 2400 

^GTOTCAGG AGGCTGG3GC T AAGGGGCT C CCCTGAATAT CTTCTCTGCT CACTTCCA^^ 2460 

ATCTAAGAGG AAAAGGTCAG TTGCTCATGC : TGATTAGGAT TGAAATGATT TGTTTCTCm 2S20 

CCTAGGATGA AAACTAAATC AATTAATTAT TCAATTAGGT AAGAAGATCT <™TGG 25B0 

TCAAAGGGAA CATGTTCGGA CTGGAAACAT TTCTTTACAT TTGCATTCCT CCATTTCGCC 2640 

S^SIaa?g tgatactgw GACATCCTCC AGAATGGCCA ^TGCAAJ I™ 

TAACCTCTTA GGTGGCAAGG AGGCAGGAAG IGCCTCTTTA GTTCTTACAT TTCTAATAGC 2760 
STCGGmA TTTGCAAAGG AAGCTTGAAA AATATGAGAA AAGTTGCTTG AAGTGCATTA 

SgS?g SSaca:taatctacgg ggciagggcg agagaggcca ^gatttgtt 

aCAGATACT TGAATTAATT CATCCAAATG TACTGAGGTT ACCACACACT TGACTACGGA 

^GaS ACTAACAAGG AAACAAATTC AAGGACAACC TGTCTTTGAG CCAGGGCAGG. 3000 

CCTCAGACAC CCTGCCTGTG GCCCOGCCTC: CACTTCATCC. TGCCCGGAAT GCCAGTGCTC 3060 

SI^AGA CAGAGGAAGC CCTGCAGAAA GTTCCATCAG - GCTGTTTCCT' AAAGGATGTG 3120 

TGAACGGGAG ATGATGCACT GTGTTTTGAA' AGTTGTCATT. TTAAAGCATT" TTAGCACACT' 3180 

^aS CAGTTGATGC AGCATCCTOA GATTTTAAAT.-CCTGAAGTGT CGCTGGOGCA- 3240 

S^ACCAMT- AGGGAGCTAG TCAGGCAGTT TGCTTAAGGA ACTTTTGTTC TCTGTCTCTT 3300 

TTCCTTAAAA TTGGGGGTAA GGAGGGAAOG AAGAGGGAAA GAGATGACTA ACTAAAATCA 3360 

SS^C IISa^.CAaXgCCATT. TAAATTATAT CCTCATTTTA AAACTTACAT 3420 

i^IiIliT TTCTCCCTAT GATAATGCAG TCGATAGTGT GCACTCTTTC TCTCTCTCTC 3480 

?SS^AC SSc A««aCAC ACACACACAC AGAGACACGG CACCATTCTG- 3540 

otSSSc T^gScACAT- TCCTGGGGGT. CACCGATGGT CAGAGTCACT AGAAGTTACC 3600 

Z^^C TGGGAGGCCT CATGTCTCCT GTGGGCTTTT TACCACCACT ^GCAGGAGA, 3660 

ACAGACAGAG GAAATGTGTC TCCCTCCAAO GCCCCAAAGC CICAGAGAAA OGGTOTTTCT 3720 

"CT TAGCAATGCA TCGGTCTCTG AGGTGACACT CTGGAGTGGT WAAGGGGCA 3 ,80 
CAAGGIGCAG GGTTAATACT CTTGCCAGTT. TTGAAATATA GATGCTATGG TTCAGATTGT 
TTTTAATAGA AAACTAAAGG GGCAGGGGAA GTGAAAGGAA AGATGGAGGT TTTGTGCGGC 
TCGATGGGGC ' ATTTGGAACT TCTTTTTAAA GTCATCTCAT GGTCTCCAGT TTTCAGTTGG 

Ia^^otg tttaacacw AAGGGAGACA aaggctgtgt ccatttggca aaaotcctt 

GGCCACGAGA CTCTAGGTGA TGTGTGAAGC TGGGCAGTCT GTGGTGTGGA GAGCAGCCAT 
CTGTCTGGCC ATTCAGAGGA TTCTAAAGAC ATCGCTGGAT. ?CGCTGCTGA CCAACATCAG 
CACTTAAATA AATGCAAATG CAACATTTCT CCCTCTGGGC CTTGAAAATC CTOOCCTTA 
^^GGGG TGAAGGAGAC" ATTTCTGTCC TTGGCTTCCC ACAGCCCCAA CGCAGTCTGT 

gSSS tgggatccaa cgagccctcc tattttcaca f r c J^ SStc 

GCCCAGGCCC ATOGTCTGTT CTCTGAATGC AGCCCTGTTC T»*"A^ 
GAACCCCTCT GTGGAACCCA CAAGGGGAGA AATGGGTGAT AAAGAATCCA GTTCCTCAAA 
^ccctg gcaggcSgg TCCCTCTCCT GCTGGGTGGT GCTTTCTCTT gcacaccact 

™ == s = 

CACAGTGGGG AATCCA^GG^ T^CAGTATG GGGAGAGGTG CACCCTGCCA CCTGCTAACT 4800 



2820 
2880 
2940 



3840 
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• TCTCGCTAGA CACAGTGTTT CTGCCCAGGT GACCTGTTCA GCAGCAGAAC AAGCCAGGGC 4860 

CATGGGGACG GGGGAAGTTT TCACTTGGAG ATGGACACCA AGACAATGAA GATTTGTTGT 4920 

CCAAATAGGT CAATAATTCT GGGAGACTCT TGGAAAAAAC TGAATATATT CAGGACCAAC 4980 

TCTCTCCCTC CCCTCATCCC ACATCTCAAA GCAGACAATG TAAAGAGAGA AC ATCTCACA 5040 

CACCCAGCTC GCCATGCCTA CTCATTCCTG AATTTCAGGT GCCATCACTG CTCTTTCTTT 5100. 
CTTCTTTGTC ATTTGAGAAA GGATGCAGGA GGACAATTCC CACAGATAAT CTGAGGAATG- 5160 

CAGAAAAACC AGGGCAGGAC AGTTATCGAC AATGCATTAG AACTTGGTGA GCATCCTCTG 5220 

TAGAGGGACT. CCACCCCTGC TCAACAGCTT- GGCTTCCAGG CAAGACCAAC CACATCTGGT 5280 

CTCTGCCTTC GGTGGCCCAC ACACCTAAGC GTCATCGTCA TTGCCATAGC ATCATGATGC 5340 
AACACATCTA CGTGTAGCAC TACGACGTTA TGTTTGGGTA ATGTGGGGAT GAACTGCATG ." 5400 

AGGCTCTGAT TAAGGATGTG. GGGAAGTGGG. CTGCGGTCAC TGTCGGCCTT GCAAGGCCAC 5460 

CTGGAGGCCT. GTCTGTTAGC CAGTGGTGGA GGAGCAAGGC TTCAGGAAGG GCCAGCCACA 5520 

TGCCATCTTC CCTGCGATCA' GGCAAAAAAG TGGAATTAAA ; AAGTCAAACC TTTATATGCA 5580 

TGTGTTATGT CCATTTTGCA GGATGAACTG AGTTTAAAAG AATTTTTTTT TCTCTTCAAG 5640 

TTGCTTTGTC TTTTCCATCC .TCATCACAAG CCCTTGTTTG AGTGTCTTAT" CCCTGAGCAA 5700 

TCTTTCGATG-- GATGGAGATG ATCATTAGGT ACTTTTGTTT CAACCTTTAT TCCTGTAAAT 5760 

ATTTCTGTGA AAACTAGGAG AACAGAGATG • AGATTTGACA AAAAAAAATT GAATTAAAAA 5820. 

TAACACAGTC , TTTTTAAAAC TAACATAGGA AAGCCTTTCC TATTATTTCT CTTCTTAGCT 5880 

TCTCCATTGT. CTAAATCAGG AAAACAGGAA* AACACAGCTT TCTAGCAGCT GCAAAATGGT 5940 

TTAATGCCCC CTACATATTT CCATCACCTT GAACAATAGC TTTAGCTTGG GAATCTGAGA 6000 

TATGATCCCA'GAAAACATCT GTCTCTACTT CGGCTGCAAA ACCCATGGTT, TAAATCTATA 6060 

TGGTTTGTGC ATTTTCTCAA CTAAAAATAG AGATGATAAT CCGAATTCTC CATATATTCA 6120 

CTAATCAAAG ACACTATTTT, CATACTAGAT TCCTGAGACA AATACTCACT' GAAGGGCTTG 6180 

TTTAAAAATA AATTGTGTTT TGGTCTGTTC TTGTAGATAA TGCCCTTCTA TTTTAGGTAG 6240 

AAGCTCTGGA ATCCCTTTAT, TGTGCTGTTG CTCTTATCTG CAAGGTGGCA AGCAGTTCTT 6300 

TTCAGCAGAT TTTGCCCACT. ATTCCTCTGA GCTGAAGTTC. TTTGCATAGA TTTGGCTTAA 6360 

GCTTGAATTA GATCCCTGCA- AAGGCTTGCT CTGTGATGTC AGATGTAATT GTAAATGTCA 6420 

GTAATCACTT CATGAATGCT AAATGAGAAT GTAAGTATTT TTAAATGTGT GTATTTCAAA 6480 

TTTGTTTGAC TAATTCTGGA ATTACAAGAT TTCTATGCAG GATTTACCTT CATCCTGTGC 6540 

ATGTTTCCCA' AACTGTGAGG ; AGGGAAGGCT CAGAGATCGA GCTTCTCCTC TGAGTTCTAA 6600- 

■CAAAATGGTG" CTTTGAGGGT CAGCCTTTAG GAAGGTGCAG CTTTGTTGTC CTTTGAGCTT 6660 
TCTGTTATGT GCCTATCCTA ATAAACTCTT AAACACATT 

Seq ID NO: 345 Protein sequence 
Protein Accession #: ." NP^036204 

1 11 21 31 41 51 

I I I 1 I . I 

MATSMGLLLL LLLLLTQPGA GTGADTEAW CVGTACYTAH SGKLSAAEAQ NHCNQNGGNL 60 

ATVKSKEEAQ HVQRVLAQLL RREAALTARM- SKFWIGLQRE KGKCLDPSLP LKGFSWVGGG 120 

. EDTPYSNWHK: ELRNSCISKR CVSLLLDLSQ PLLPNHLPKW SEGPCGSPGS PGSNIEGFVC 180 

KFSFKGMCRP IiALGGPGQVT YTTPFQTTSS SLEAVPFASA ANVACGEGDK DETQSHYTLC 240 

KEKAPDVFDW- GSSGPLCVSP KYGCNFNNGG- CHQDCFEGGD- GSFLCGCRPG FRLLDDLVTC 3 00 

ASRNPCSSSP. CRGGATCVLG PHGKNYTCRC PQGYQLDSSQ LDCVDVDECQ DSPCAQECVN 360 

TPGGFRCECW-VGYEPGGPGE GACQDVDECA LGRSPCAQGC TNTDGSFHCS CEEGYVLAGE 420 

DGTQCQDVDE : CVGPGGPLCD SLCFNTQGSF HCGCLPGWVIi APNGVSCTMG. PVSLGPPSGP 480 

PDEEDKGEKE GSTVPRAATA SPTRGPEGTP KATPTTSRPS LSSDAPITSA PLKMLAPSGS 540 

SGVWREPSIH- HATAASGPQE PAGGDSSVAT QNNDGTDGQK tiLLFYILGTV VAILLLLALA 600 
LGLLVYRKRR AKREEKKEKK PQNAADSYSW VPERAESRAM ENQYSPTPGT DC 

Seq ID NO:. 346 DNA sequence 
Nucleic Acid Accession #: Z31560 
Coding sequence: <l-966 . 

1 11 .21 . 31 41 51 

i i i ; i i r 

CACAGCGCCC GCATGTACAA CATGATGGAG ACGGAGCTGA AGCCGCCGGG CCCGCAGCAA 60 

ACTTCGGGGG GCGGCGGCGG CAACTCCACC GCGGCGGCGG CCGGCGGCAA CCAGAAAAAC 120 

AGCCCGGACC GCGTCAAGCG GCCCATGAAT GCCTTCATGG TGTGGTCCCG CGGGCAGCGG 180 

CGCAAGATGG CCCAGGAGAA CCCCAAGATG CACAACTCGG AGATCAGCAA GCGCCTGGGC 240 
GCCGAGTGGA- AACTTTTGTC GGAGACGGAG AAGCGGCCGT TCATCGACGA GGCTAAGCGG . 300 

CTGCGAGCGC TGCACATGAA GGAGCACCCG GATTATAAAT ACCGGCCCCG GCGGAAAACC 360 

AAGACGCTCA TGAAGAAGGA TAAGTACACG CTGCCCGGCG GGCTGCTGGC CCCCGGCGGC 420 

AATAGCATGG CGAGCGGGGT CGGGGTGGGC GCCGGCCTGG GCGCGGGCGT GAACCAGCGC 480 

ATGGACAGTT ACGCGCACAT GAACGGCTGG AGCAACGGCA GCTACAGCAT GATGCAGGAC 540 

CAGCTGGGCT ACCCGCAGCA CCCGGGCCTC AATGCGCACG GCGCAGCGCA GATGCAGCCC 600 

ATGCACCGCT ACGACGTGAG CGCCCTGCAG TACAACTCCA TGACCAGCTC GCAGACCTAC 660 

ATGAACGGCT CGCCCACCTA CAGCATGTCC TACTCGCAGC AGGGCACCCC TGGCATGGCT 720 

CTTGGCTCCA TGGGTTCGGT GGTCAAGTCC GAGGCCAGCT CCAGCCCCCC TGTGGTTACC 780 

TCTTCCTCCC ACTCCAGGGC GCCCTGCCAG GCCGGGGACC TCCGGGACAT GATCAGCATG 840 

TATCTCCCCG GCGCCGAGGT GCCGGAACCC GCCGCCCCCA GCAGACTTCA CATGTCCCAG 900 

CACTACCAGA GCGGCCCGGT GCCCGGCACG GCCATTAACG GCACACTGCC CCTCTCACAC 960 

ATGTGAGGGC CGGACAGCGA ACTGGAGGGG GGAGAAATTT TCAAAGAAAA ACGAGGGAAA 1020 

TGGGAGGGGT GCAAAAGAGG AGAGTAAGAA ACAGCATGGA GAAAACCCGG TACGCTCAAA 1080 
AAAAA 

Seq ID NO: 347 Protein sequence/ 
Protein Accession CAA83435 

1 11 " ' ii '.* / 31 .41 51 

i - 1 "* ' • .. i - - i i i 

HSARMYNMME TELKPPGPQQ TSGGGGGNST AAAAGGNQKN SPDRVKRPMN AFMVWSRGQR 60 

RKMAQENPKM HNSEISKRLG AEWKLLSETE KRPFIDEAKR LRALHMKEHP DYKYRPRRKT- 120 

KTLMKKDKYT LPGGLLAPGG ' NSMASGVGVG AGLGAGVNQR MDSYAHMNGW SNGSYSMMQD 180 

QIjGYPQHPGL NAHGAAQMQP MHRYDVSALQ YNSMTSSQTY MNGSPTYSMS YSQQGTPGMA '240 

LGSMGSWKS EASSSPPWT SSSHSRAPCQ AGDLRDMISM YLPGAEVPEP AAPSRLHMSQ 300 
HYQSGPVPGT A2NGTLPLSH M . 
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Seq ID NO: 348' DNA~ sequence ' 
Nucleotide.*^ NM_002638 ' 

Coding sequence: \, 120-473 

1 11 21-31 .41 51 ' 

L : I .;. : . '| -. : I 1 I 

CAATACAGCT AAGGAATTAT CCCTTGTAAA TACCACAGAC CCGCCCTGGA GCCAGGCCAA GO 

GCTGGACTGC ATAAAGATTG GTATGGCCTT AGCTCTTAGC CAAACACCTT CCTGACACCA 120 

" TGAGGGCCAG CAGC TT C TTG ATCGTGGTGG TGTTCCTCAT CGCTGGGACG CTGGTTCTAG 180 
AGGCAGCTGT CACGGGAGTT ' CCTGTTAAAG GTCAAGACAC TGTCAAAGGC CGTGTTCCAT.. 240 

TCAATGGACA- AGATCCCGTT AAAGG ACAAG x TTTCAGTTAA" AGGTCAAGAT AAAGTCAAAG - 300 

CGCAAGAGCC AGTCAAAGGT CCAGTCTCCA CTAAGCCTGG CTCCTGCCCC ATT ATCTTGA 360 

TCCGGTGCGC CATGTTGAAT. CCCCCTAACC GCTGCTTGAA AGATACTGAC TGCCCAGGAA 420 

TCAAGAAGTG CTGTGAAGGC TCTTGCGGGA" TGGCCTGTTT CGTTCCCCAG TGAAGGGAGC 480 

CGGTCCTTGC, TGCACCTGTG CCGTCCCCAG AGCTACAGGC CCCATCTGGT - CCTAAGTCCC 540 

TGCTGCCCTT XCCCTTCCCA ;CACTGTeCAT TCTTCCTCCC ATTCAGGATG CCCACGGCTG S0O 
GAGCTGCCTC TCTCATCCAC TTTCCAATAA _ A 

Seq ID NO: 349 Protein sequence: 
Protein Accession #: NP_002629 

1 11 21 31 41 51 

I I ■ \ I I - I 

MRASSFLIW VFLIAGTLVL EAAVTGVPVK GQDTVKGRVP FNGQDPVKGQ VSVKGQDKVK 60 
AQEPVKGPVS TKPGSCPIIL ^ IRCAMLNPPN; RCLKDTDCPG IKKCCEGSCG MACPVPQ 

Seq ID NO: 350 DNA sequence 
Nucleic Acid Accession 8: NM_007183 
Coding sequence: 75-2468 

1 11 21 31 41 51 

I | • I i I I 

GAATTCCGGA CAGGACGTGA AGATAGTTGG GTTTGGAGGC GGCCGCCAGG CCCAGGCCCG 60 

GTGGACCTGC CGCCATGCAG GACGGTAACT TCCTGCTGTC GGCCCTGCAG CCTGAGGCCG 120 

GCGTGTGCTC CCTGGCGCTG CCCTCTGACC TGCAGCTGGA CCGCCGGGGC GCCGAGGGGC 180 

CGGAGGCCGA GCGGCTGCGG. GCAGCCCGCG TCCAGGAGCA GGTCCGCGCC OGCCTCTTGC . 240 

AGCTGGGACA- GCAGCCGCGG CACAACGGGG CCGCTGAGCC CGAGCCTGAG GCCGAGACTG 300 

CCAGAGGCAC ATCCAGGGGG 1 CAGTACCACA' CCCTGCAGGC TGGCTTCAGC TCTCGCTCTC 360 

AGGGCCTGAG TGGGGACAAG. ACCTCGGGCT tCCGGCCCAT CGCCAAGCCG , GCCTACAGCC 420 

CAGCCTCCTG GTCCTCCCGC TCCGCCGTGG ATCTGAGCTG CAGTCGGAGG CTGAGTTCAG 480 

CCCACAATGG GGGCAGCGCC TTTGGGGCCG' CTGGGTACGG, GGGTGCCCAG CCCACCCCTC . 540 

CCATGCCCAC CAGGCCCGTG TCCTTCCATG AGCGCGGTGG GGTTGGGAGC CGGGCCGACT-. 600 

ATGACACACT CTCCCTGCGC TCGCTGCGGC TGGGGCCCGG- GGGCCTGGAC GACCGCTACA 660 

GCCTGGTGTC TGAGCAGCTG GAGCCCGCGG CCACCTCCAC CTACAGGGCC TTTGCGTACG. 720 

AGCGCCAGGC CAGCTCCAGC TCCAGCCGGG CAGGGGGGCT GGACTGGCCC GAGGCCACTG 780 

AGGTTTCCCC GAGCCGGACC ATCCGTGCCC CTGCCGTGCG GACCCTGCAG CGATTCCAGA 840 

GCAGCCACCG GAGCCGCGGG GTAGGCGGGG CAGTGCCGGG, GGCCGTCCTG GAGCCAGTGG 900 

CTCGAGCGCC ATCTGTGCGC AGCCTCAGCC TCAGCCTGGC TGACTCGGGC CACCTGCCGG 960 

ACGTGCATGG GTTCAACAGC TACGGTAGCC ACCGAACCCT GCAGAGACTC AGCAGCGGTT 1020 

TTGATGACAT TGACCTGCCC TCAGCAGTCA AGTACCTCAT GGCTTCAGAC CCCAACCTGC 1080 

AGGTGCTGGG AGCGGCCTAC ATCCAGCACA AGTGCTACAG CGATGCAGCC GCCAAGAAGC 1140 

AGGCCCGCAG CCTTCAGGCC GTGCCTAGGC TGGTGAAGCT CTTCAACCAC GCCAACCAGG 1200 

AAGTGCAGCG CCATGCCACA GGTGCCATGC GCAACCTCAT CTACGACAAC GCTGACAACA 1260 

AGCTGGCCCT GGTGGAGGAG AACGGGATCT TCGAGCTGCT GCGGACACTG CGGGRGCAGG 1320 

ATGATGAGCT TCGCAAAAAT GTCACAGGGA TCCTGTGGAA CCTTTCATCC AGCGACCACC 1380 

TGAAGGACCG: CCTGGCCAGA GACACGCTGG AGCAGCTCAC GGACCTGGTG TTGAGCCCCC 1440 

TGTCGGGGGC TGGGGGTCCC CCCCTCATCC AGCAGAACGC CTCGGAGGCG GAGATCTTCT 1500 

ACAACGCCAC CGGCTTCCTC AGGAACCTCA GCTCAGCCTC TCAGGCCACT CGCCAGAAGA 1560 

TGCGGGAGTG CCACGGGCTG GTGGACGCCC TGGTCACCTC TATCAACCAC GCCCTGGACG 1620 

CGGGCAAATG CGAGGACAAG AGCGTGGAGA ACGCGGTGTG CGTCCTGCGG AACCTGTCCT 1680 

ACCGCCTCTA CGACGAGATG CCGCCGTCCG CGCTGCAGCG GCTGGAGGGT CGCGGCCGCA 1740 

GGGACCTGGC GGGGGCGCCG CCGGGAGAGG TCGTGGGCTG CTTCACGCCG CAGAGCCGGC 1800 

GGCTGCGCGA GCTGCCCCTC GCCGCCGATG CGCTCACCTT CGCGGAGGTG TCCAAGGACC 1860 

CCAAGGGCCT CGAGTGGCTG TGGAGCCCCC AGATCGTGGG GCTGTACAAC CGGCTGCTGC 1920 

AGCGCTGCGA GCTCAACCGG CACACGACGG AGGCGGCCGC CGGGGCGCTG CAGAACATCA 1980 

" CGGCAGGCGA CCGCAGGTC3G GCGGGGGTGC TGAGCCGCCT GGCCCTGGAG CAGGAGCGTA 2040 

TTCTGAACCC CCTGCTAGAC CGTGTCAGGA CCGCCGACCA CCACCAGCTG CGCTCACTGA 2100 

CTGGCCTCAT CCGAAACCTG TCTCGGAACG CTAGGAACAA GGACGAGATG TCCACGAAGG 2160 

TGGTGAGCCA CCTGATCGAG AAGCTGCCAG GCAGCGTGGG TGAGAAGTCG CCCCCAGCCG 2220 

AGGTGCTGGT CAACATCATA GCTGTGCTCA ACAACCTGGT GGTGGCCAGC CCCATCGCTG , 2280 

CCCGAGACCT GCTGTATTTT GACGGACTCC GAAAGCTCAT CTTCATCAAG AAGAAGCGGG 2340 

ACAGCCCCGA CAGTGAGAAG TCCTCCCGGG CAGCATCCAG CCTCCTGGCC AACCTGTGGC 2400 

AGTACAACAA GCTCCACCGT GACTTTCGGG CGAAGGGCTA TCGGAAGGAG GACTTCCTGG 2460 

GCCCATAGGT GAAGCCTTCT GGAGGAGAAG GTGACGTGGC CCAGCGTCCA AGGGACAGAC 2520 

TCAGCTCCAG GCTGCTTGGC AGCCCAGCCT GGAGGAGAAG GCTAATGACG GAGGGGCCCC 2580 

TCGCTGGGGC CCCTGTGTGC ATCTTTGAGG GTCCTGGGCC. ACCAGGAGGG GCAGGGTCTT 2640 
ATAGCTGGGG ACTTGGCTTC CGCAGGGCAG GGGGTGGGGC * AGGGCTCAAG GCTGCTCTGG "2700 

TGTATGGGGT GGTGACCCAG TCACATTGGC AGAGGTGGGG GTTGGCTGTG GCCTGGCAGT 2760 

ATCTTGGGAT AGCCAGCACT GGGAATAAAG ATGGCCATGA ACAGTCACAA AAAAAAAAAA 2820 
AAAAGGAATT C 



Seq ID NO: 3S1 Protein sequence " 
Protein Accession : 8 : . NP_009114.1 

1 11 21 31 41 .51 
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MQDGNFLLSA LQPEAGVCSL ALPSDLQLDR RGAEGPEAER LRAARVQEQV RARLLQLGQQ 
PRHNGAAEPB PEAETARGTS- RGQYHTLQAG FSSRSQGLSG DKTSGFRPIA KPAYSPASNS 
SRSAVDLSCS RRIiSSAHNGG SAFGAAGYGG AQPTPPKPTR PVSFHERGGV GSRADYDTfcS 
5 LRSLRLGPGG LDDRYSLVSE QLEPAATSTY RAFAYERQAS SSSSRAGGLD WPEATBVSPS 
RTIRAPAVRT- LQRFQSSHRS RGVGGAVPGA VLEPVARAPS VRSLSLSLAD SGHLPDVHGP 
NSYGSHRTLQ RLSSGFDDID LPSAVKYLMA SDPNLQVLGA AYIQHKCYSD AAAKKQARSL 360 
QAVPRLVKLF NHANQEVQRH ATGAMRNL I Y DNADNKLALV EENGIFELLR TLRBQDDELR 420 
KNVTGILWNL SSSDHLKDRL ARDTIiEQLTD LVLSPLSGAG. GPPMQQNAS: EAEIFYNATG . 480 
1 0 " FLRNLSSASQ: ATRQXKRSCH GLVDALVTSI NHALDAGKCE DKSVENAVCV. LRNLSYRLYD 540 
EMPPSALQRL. EGRGRRDLAG . APPGEWGCF. TPQSRRLREL PLAADALTFA EVSKDPKGLB. 600 
WLWSPQIVGL YNRLLQRCEL NRHTTEAAAG ALQNITAGDR RWAGVLSRIA' LEQERILNPL 660 
■ LDRVRTADHH - QLR SLTGL I R NLSRNARNKD EMSTKWSHL IEKLPGSVGE KSPPAEVLVN 720 
IIAVLNNLW ASPIAARDLL YFDGLRKLIF IKKKRDSPDS EXSSRAASSL LANLWQYNKL 780 
■ 15 HRDFRAKGYR KEDFLGP 

Seq ID NO: 352 DNA sequence 
Nucleic Acid Accession #: M31469 
20 Coding sequence: 1-651 

1 11 21 31 41 51 

ATGGCTGCGC AGGGAGAGCC CCAGGTCCAG TTCAAACTTG TATTGGTTGG TGATGGTGGT 60 

25 • ACTGGAAAAA CGACCTTCGT GAAACGTCAT- TTGACTGGTG AATTTGAGAA GAAGTATGTA 120 

GCCACCTTGG GTGTTGAGGT TCATCCCCTA: GTGTTCCACA CCAACAGAGG ACCTATTAAG 180 

TTCAATGTAT GGGACACAGC CGGCCAGGAG - AAATTCGGTG GACTGAGAGA TGGCTATTAT 240 

ATCCAAGCCC AGTGTGCCAT CAT AATGTTT GATGTAACAT CGAGAGTTAC TTACAAGAAT 300 

GTGCCTAACT- GGCATAGAGA TCTGGTACGA GTGTGTGAAA ACATCCCCAT TGTGTTGTGT 360 

30 GGCAACAAAG TGGATATTAA GGACAGGAAA GTGAAGGCGA AATCCATTGT CTTCCACCGA 420 

AAGAAGAATC TTCAGTACTA CGACATTTCT GCCAAAAGTA ACTACAACTT TGAAAAGCCC 480 

TTCCTCTGGC TTGCTAGGAA GCTCATTGGA GACCCTAACT- TGGAATTTGT TGCCATGCCT 540 

GCTCTCGCCC CACCAGAAGT TGTCATGGAC CCAGCTTTGG CAGCACAGTA TGAGCACGAC 600 
TTAGAGGTTG CTCAGACAAC TGCTCTCCCG GATGAGGATG ATGACCTGTG A 



Seq ID NO:> 353 Protein sequence 
Protein Accession 8: • AAA36546 



1 11 21 f 1 i 1 r 

MAAQGEPQVO FKLVLVGDGG TGKTTFVKRH LTGEFEKKYV ATLGVEVHPL VFHTNRGPIK 60 

FNVWDTAGQE. KFGGLRDGYY IQAQCAIIMF DVTSRVTYKN VPNWHRDLVR VCENIPIVLC 120 

GNKVDIKDRK VKAKSIVFHR KKNLQYYDIS AKSNYNFEKP FLWtARKLiIG DPNLEFVAMP 180 

ALAPPEWMD PALAAQYEHD LEVAQTTALP DEDDDI* 



Seq ID NO: 354 DNA sequence 
Nucleic Acid Accession 8: NM_002820 
Coding sequence.: 304-831 



i u 21 31 41. 51 

CCGGTTCGCA AAGAAGCTGA CTTCAGAGGG GGAAACTTTC TTCTTTTAGG AGGCGGTTAG 60 

CCCTGTTCCA CGAACCCAGG AGAACTG CTG GCCAGATTAA TTAGACATTG CTATGGGAGA 120 

55 CGTGTAAACA CACTACTTAT. CATTGATGCA TATATAAAAC* CATTTTATTT TGGCTATTAT: 180 

TTCAGAGGAA- GCGCCTCTGA TTTGTTTCTT~ TTTTCCCTTT. TTGCTCTTTC TGGCTGTGTG- 240 

GTTTGGAGAA'. AGCACAGTTG GAGTAGCCGG TTGCTAAATA AGTCCCGAGC GCGAGCGGAG: 300 

ACGATGCAGC GGAGACTGGT TCAGCAGTGG AGCGTCGCGG TGTTCCTGCT- GAGCTACGCG - 360 

GTGCCCTCCT GCGGGCGCTC . GGTGGAGGGT CTCAGCCGCC GCCTCAAAAG- AGCTGTGTCT 420 

60 GAACATCAGC TCCTCCATGA CAAGGGGAAG TCCATCCAAG . ATTTACGGCG' ACGATTCTTC 480 

CTTCACCATC TGATCGCAGA AATCCACACA GCTGAAATCA GAGCTACCTC: GGAGGTGTCC 540 

CCTAACTCCA AGCCCTCTCC CAACACAAAG AACCACCCCG TCCGATTTGG GTCTGATGAT 600 

GAGGGCAGAT ACCTAACTCA GGAAACTAAC AAGGTGGAGA CGTACAAAGA GCAGCCGCTC 660 

AAGACACCTG GGAAGAAAAA GAAAGGCAAG CCCGGGAAAC GCAAGGAGCA- GGAAAAGAAA 720 

65 AAACGGCGAA CTCGCTCTGC : CTGGTTAGAC TCTGGAGTGA CTGGGAGTGG GCTAGAAGGG 780 

GACCACCTGT CTGACACCTC CACAAOGTCG CTGGAGCTCG ATTCACGGTA ACAGGCTTCT B40 

CTGGCCCGTA GCCTCAGCGG GGTGCTCTCA GCTGGGTTTT GGAGCCTCCC TTCTGCCTTG 900 

GCTTGGACAA ACCTAGAATT TTCTCCCTTT ATGTATCTCT ATCGATTGTG TAGCAATTGA 960 

CAGAGAATAA CTCAGAATAT i TGTCTGCCTT AAAGCAGTAC CCCCCTACCA CACACACCCC 1020 

70 TGTCCTCCAG CACCATAGAG AGGCGCTAGA GCCCATTCCT. CTTTCTCCAC CGTCACCCAA 1080 

CATCAATCCT TTACCACTCT ACCAAATAAT TTCATATTCA AGCTTCAGAA GCTAGTGACC 1140 

ATCTTCATAA TTTGCTGGAG AAGTGTATTT CTTCCCCTTA CTCTCACACC TGGGCAAACT 1200 

TTCTTCAGTG TTTTTCATTT CTTACGTTCT- TTCACTTCAA GGGAGAATAT AGAAGCATTT 12 60 

GAT ATT AT CT ACAAACACTG ' CAGAACAGCA TCATGTCATA AACGATTCTG AGCCATTCAC 1320 

75 ACTTTTTATT TAATTAAATG TATTTAATTA AATCTCAAAT- TTATTTTAAT GTAAAGAACT 1380 

TAAATTATGT TTTAAACACA TGCCTTAAAT TTGTTTAATT AAATTTAACT CTGGTTTCTA 1440 

CCAGCTCATA CAAAATAAAT GGTTTCTGAA AATGTTT AAG TATTAACTTA CAAGGATATA 1500 

GGTTTTTCTC ATGTATCTTT TTGTTCATTG GCAAGATGAA ATAATTTTTC TAGGGTAATG 1560 
CCGTAGGAAA AATAAAACTT CACATTTAAA AAAAA 



Seq ID NO: 355 Protein sequence 
Protein Accession NM.002820 



1 11 21 ' 31 41 51 

^ MQRRLVQQWS VAVFLLSYAV PSCGRSVEGL SRRLKRAVSE HQLLHDKGKS IQDLRRRFFL 60 
HHLIAEIHTA EIRATSEVSP NSKPSPNTKN HPVRFGSDDE GRYLTQETNK VETYKEQPLK 120 
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TPGKKKKGKP GKRKEQSKKK RRTRSAWLDS GVTGSGLEGD HLSDTSTTSL ELDSR 

• Seq ID NO: 356 DNA sequence 
Nucleic Acid Accession #: NM - 017522 
Coding sequence: ' 1-2100 

1 . 11 ■ 21 '31 41 51 

I I- -\l I I I 

ATGGGCCTCC CCGAGCCGGG CCCTCTCCGG, CTTCTGGOGC. TGCTGCTGCT GCTGCTGCTG 60 

CTGCTGCTGC TGCGGCTCCA* GCATCTTGCG: GCGGCAGCGG CTGATCCGCT GCTCGGCGGC 120 
CAAGGGCCGG CCAAGGAGTG- CGAAAAGGAC CAATTCCAGT GCCGGAACGA GOG CTG CATC . 180 

CCCTCTGTGT GGAGATGCGA CGAGGACGAT GACTGCTTAG ' ACCACAGCGA CGAGGACGAC 240 

TGCCCCAAGA AGACCTGTGC AGACAGTGAC TTCACCTGTG ACAACGGCCA CTGCATCCAC 300 

GAACGGTGGA AGTGTGACGG CGAGGAGGAG TGTCCTGATG. GCTCCGATGA GTCCGAGGCC 360 

ACTTGCACCA AGCAGGTGTG TCCTGCAGAG AAGCTGAGCT GTGGACCCAC CAGCCACAAG. 420 
TGTGTACCTG CCTCGTGGCG CTGCGACGGG GAGAAGGACT GCGAGGGTGG AGCGGATGAG". 480 

GCCGGCTGTG . CTACCTCACT GGGCACCTGG CGTGGGGACG AGTTCCAGTG TGGGGATGGG 540 

ACATGTGTCC TTGCAATCAA GCACTGCAAC CAGGAGCAGG ACTGTCCAGA TGGGAGTGAT 600 

GAAGCTGGCT GCCTACAGGG GCTGAACGAG: TGTCTGCACA ACAATGGOGG CTGCTCACAC 660 

ATCTGCACTG ACCTCAAGAT TGGCTTTGAA TGCACGTGCC CAGCAGGCTT CCAGCTCCTG 720 

GACCAGAAGA CTTGTGGCGA- CATTGATGAG . TGCAAGGACC CAGATGCCTG CAGCCAGATC 780 

• TGTGTCAATT ACAAGGGCTA TTTTAAGTGT GAGTGCTACC CTGGCTGCGA GATGGACCTA 840 
CTGACCAAGA ACTGCAAGGC TGCTGCTGGC: AAGAGCCCAT CCCTAATCTT CACCAACCGC 900 
ACGAGTGCGG AGGATCGACC TGTGAAGCGG- AACTATTCAC GCCTCATCCC CATGCTCAAG 960 
AATGTCGTGG CACTAGATGT GGAAGTTGCC ACCAATCGCA TCTACTGGTG TGACCTCTCC 1020 
TACCGTAAGA TCTATAGCGC CTACATGGAC AAGGCCAGTG ACCCGAAAGA GCGGGAGGTC 1080 

• CTCATTGACG AGCAGTTGCA CTCTCCAGAG- GGCCTGGCAG: TGGACTGGGT CCACAAGCAC. 1140 
ATCTACTGGA CTGACTCGGG; CAATAAGACC ATCTCAGTGG CCACAGTTGA" TGGTGGCCGC _ 1200 
CGACGCACTC TCTTCAGCCG: TAACCTCAGT GAACCCCGGG CCATCGCTGT TGACCCCCTG . 1260 
CGAGGGTTCA : TGTATTGGTC TGACTGGGGG GACCAGGCCA AGATTGAGAA ATCTGGGCTC" 1320 
AACGGTGTGG ACCGGCAAAC ACTGGTGTCA- GACAATATTG AATGGCCCAA CGGAATCACC 1380 
CTGGATCTGC TGAGCCAGCG* CTTGTACTGG. GTAGACTCCA AGCTACACCA ACTGTCCAGC 1440 
ATTGACTTCA GTGGAGGCAA CAGAAAGACG. CTGATCTCCT CCACTGACTT CCTGAGCCAC 1500 
CCTTTTGGGA- TAGCTGTGTT TGAGGACAAG GTGTTCTGGA- CAGACCTGGA GAACGAGGCC 1560 
ATTTTCAGTG CAAATCGGCT. CAATGGCCTG . GAAATCTCCA TCCTGGCTGA GAACCTCAAC 1620 
AACCCACATG' ACATTGTCAT. CTTCCATGAG. CTGAAGCAGC CAAGAGCTCC: AGATGCCTGT 1680 
GAGCTGAGTG TCCAGCCTAA" TGGAGGCTGT. GAATACCTGT- GCCTTCCTGC. TCCTCAGATC 1740 
rCCAGCCACT CTCCCAAGTA CACATGTGCC . TGTCCTGACA CAATGTGGCT GGGTCCAGAC, 1800 
ATGAAGAGGT. GCTACCGAGA TGCAAATGAA GACAGTAAGA TGGGCTCAAC AGTCACTGCC" 1860 
GCTGTTATCG GGATCATCGT GCCCATAGTG GTGATAGCCC TCCTGTGCAT GAGTGGATACv" 1920 
CTGATCTGGA GAAACTGGAA GCGGAAGAAC ACCAAAAGCA TGAATTTTGA CAACCCAGTC: 19B0 
TACAGGAAAA CAACAGAAGA AGAAGATGAA GATGAGCTCC ATATAGGGAG AACTGCTCAG" 2040: 
ATTGGCCATG. TCTATCCTGC ACGAGTGGCA TTAAGCCTTG AAGATGATGG ACTACCCTGA- 2100 
GGATGGGATC, ACCCCCTTCG TGCCTCATGG. AATTCAGTCC CATGCACTAC ACTCCGGATG' 2160 
GTGTATGACT GGATGAATGG GTTTCTATAT ATGGGTCTGT GTGAGTGTAT GTGTGTGTGT 2220 
GATTTTTTTT TTTAAATTTA TGTTGOGGAA AGGTAACCAC AAAGTTATGA TGAACTGCAA 2280 
ACATCCAAAG: GATGTGAGAG TTTTTCTATG TATAATGTTT TATACACTTT TTAACTGGTT 2340 
GCACTACCCA' TGAGGAATTC GTGGAATGGC TACTGCTGAC TAACATGATG CACATAACCA 2400 
AATGGGGGCC AATGGCACAG TACCTTACTC ATCATTTAAA AACTATATTT ACAGAAGATG 2460 
TTTGGTTGCT GGGGGGCTTT TTTAGGTTTT GGGCATTTGT TTTTTGTAAA TAAGATGATT 2520 
ATGCTTTGTG GCTATCCATC AACATAAGT 

Seq ID HO: 357 Protein sequence 
Protein Accession 8: NPJJS9992 

1 11 21 • 31 41 51 

I I I I I I 

MGLPSPGPLR LLALLLLLLL LLLLRLQHLA AAAADPLLGG QGPAKECEKD QF^CRNERCI 60 

PSVWKCDEDD DCLDHSDEDD CPKKTCADSD PTCDNGHCIH ERWKCDGBEE CPDGSDESEA 120 

TCTKQVCPAE KLSCGPTSHK CVPASWRCDG BKDCEGGADE AGCATSLGTC RGDEFQCGDG 180 

TCVLAIKHCN QEQDCPDGSD EAGCLQGLNE CLHNNGGCSH ICTDLKIGPE CTCPAGPQLL 240 

DQKTCGDIDB CKDPDACSQIv CVNYKGYFKC ECYPGCEMDL I»TKNCKAAAG KSPSLIFTNR- 300 

TSAEDRPVKR NYSRLIPMLK NWALDVEVA TNRIYWCDLS YPJCIYSAYMD KASDPKEREV 360 

LIDEQLHSPB-GLAVDWVHKH IYWTDSGNKT ISVATVDGGR- RRTLFSRNLS EPRAIAVDPL* 420 

RGFMYWSDWG DQAKIEKSGI* NGVDRQTLVS DNIEWPNGIT LDLLSQRLYW VDSKLHQLSS 480 

IDFSGGNRKT- LISSTDFLSH PFGIAVFEDK- VFWTDLENEA' IFSANRLNGL EISILAENLN 540 

NPHDIVIFHE LKQPRAPDAC ELSVQPNGGC EYLCLPAPQI SSHSPKYTCA CPDTMWLCPD 600 

MKRCYRDANE DSKMGSTVTA AVIGIIVPIV- VIALLCMSGY LIWRNWKRKN TKSMNFDNPV 660 
YRKTTEEEDE DELHI GRTAQ IGHVYPARVA LSLEDDGLP 

Seq ID NO: 358 DNA sequence 
Nucleic Acid Accession #: M27826 
Coding sequence: <l-503 

1 II 21 31 41 51 

I I " I I I I 

AGCCCAAGAA ACATCTCACC AATTTCAAAT CTGATCTATT CGGCTTAGCG ACTGAAGATT 60 

GACGCTGCCC GATCGCCTCG GAAGTCCCCT GGACCATCAC AGAAGCCGAG CTTCGGGTAA 120 

CTCTCACAGT GGAGGGTAAG TCCATCCCCT GTTTAATCGA TACGGGGGCT ACCCACTCCA 180 

CGTTGCCTTC TTTTCAAGGG CCTGTTTCCC TTGCCCCCAT AACTGTTGTG GGTATTGACG 240 

GCCAAGCTTC AAAACCCCTG AAAACTCCCC CACTCTGGTG CCAACTTGGA CAACACTCTT 300 

TTATGCACTC TTTTTTAGTT ATCCCCACCT GCCCACTTCC CTTATTAGGC CGAAATATTT 360 

TAACCAAATT ATCTGCTTCC CTGACTATTC CTGGAGTACA GCTACATCTC ATTGCTGCCC 420 

TTCTTCCCAA TCCAAAGCCT CCTTTGTGTC CTCTAACATC CCCACAATAT CAGCCCTTAC 480 

CACAAGACCT CCCTTCAGCT TAATCTCTCC CACTCTAGGT TCCCACGCCG CCCCTAATCC 540 

CACTTGAAGC AGCCCTGAGA AACATCGCCC ATTCTCTCTC CATACCACCC CCCAAAAATT 600 

TTCGCCGCTC CAACACTTCA ACACTATTTT GTTTTATTTG TCTTATTAAT ATCAGAAGGC 660 



320 



WO 02/086443 a *_ * 

AGGAATGTCA GGCCTCTGAG CCCAGGCCAG GCCATOGCAT CCCCTGTGAC ™CAOGTAT 720 

ACATCCAGAT GGCCTGAAGT AACTGAAGAT CCACAAAAGA AGTAAAAACA GCCTTAACTG 780 

ATGACATTCC ACCATTGTGA TTTGTTCCTG CCCCACCCTA ACTGATCAAT GTACTTTGTA 840 

ATCTCCCCCA CCCTTAAGAA GGTTCTTTGT AATTCTCCCC. ACCCTTGAGA ATGTACTTTG 900 

TGAGATCCAC CCCTGCCCAC CAGAGAACAA CCCCCTTTGA TTGTAATTTT ™™CCTT 960 

CCCAAATCCT ATAAAACAGC CCCACCCCTA TCTTCCTTCA CTGACTCTCT TTTCGGACTC 1020 
AGCCACCGGC ACCCAGGTGA AATAAACAGC TTTATTGCTC AC 

Seq ID NO: 359 Protein sequence 
Protein Accession ft: • AAA65999 

1 11 .21 31 41 51 

PKKHLTNFKS DLFGLATEDW- RCPIASEVPW TITEAELRVT- LTVEGKSIPC LIDTGATHST 60 
LPSFQGPVSL APITWGIDG QASKPLKTPP LWCQLGQHSF: MHSFLVIPTC PLPLLGRNIL 120 
TKLSASLTIP GVQLHLIAAL ' LPNPKPPLCP LTSPQYQPU QDLPSA 

Seq ID NO: 360 DNA sequence 
Nucleic Acid Accession #: NM_001854 
Coding sequence: 162-5582 



60 
120 
180 
240 
300 
360 
420 



1 11 21 31 41 51 

AACCATCAAA" TTTAGAAGAA AAAGCCCTTT GACTTTTTCC CCCTCTCCCT CCCCAATGGC 
TGTGTAGCAA ACATCCCTGG CGATACCTTG GAAAGGACGA AGTTGGTCTG CAGTCGCAAT 
TTCGTGGGTT GAGTTCACAG TTGTGAGTGC GGGGCTCGGA* GATGGAGCCG TGGTCCTCTA- 
GGTGGAAAAC GAAACGGTGG CTCTGGGATT TCACCGTAAC AACCCTCGCA TTGACCTTCC- 
TCTTCCAAGC TAGAGAGGTC AGAGGAGCTG CTCCAGTTGA' TGTACTAAAA GCACTAGATT 
TTCACAATTC- TCCAGAGGGA- ATATCAAAAA CAACGGGATT- TTGCACAAAC AGAAAGAATT 
CTAAAGGCTC AGATACTGCT . TACAGAGTTT. CAAAGCAAGC ACAACTCAGT GCCCCAACAA 
AACAGTTATT TCCAGGTGGAv ACTTTCCCAG AAGACTTTTC AATACTATTT. ACAGTAAAAC 480 
CAAAAAAAGG AATTCAGTCT TTCCTTTTAT' CTATATATAA TGAGCATGGT ATTCAGCAAA ■ 540 
TTGGTGTTGA GGTTGGGAGA TCACCTGTTT TTCTGTTTGA- AGACCACACT GGAAAACCTG 600 
CCCCAGAAGA- CTATCCCCTC TTCAGAACTG TTAACATCGC. TGACGGGAAG TGGCATCGGG 680 

tagcaatcag: cgtggagaag aaaactgtga caatgattgt; ^taag J^joa 7 0 

CGAAACCACT TGATAGAAGT GAGAGAGCAA TTGTTGATAC CAATGGAATC ACGGTTTTTG 780 
GAACAAGGAT TTTGGATGAA GAAGTTTTTG AGGGGGACAT, TCAGCAGTTT. TTGATCACAG- 840 
GTGATCCCAA GGCAGCATAT GACTACTGTG AGCATTATAG TCCAGACTGT GACTCTTCAG- 900 
CACCCAAGGC TGCTCAAGCT CAGGAACCTC AGATAGATGA GTATGCACCA-GAGGATATAA' 960 
TCGAATATGA' CTATGAGTAT GGGGAAGCAG AGTATAAAGA GGCTGAAAGT GTAACAGAGG - 1020 

AAGAATACAA CTATGGAACA- ATGGAAAGTT ACCAGACAGA* AGCTCCTAGG CATGTTTCTG 1140 
GGACAAATGA GCCAAATCCA GTTGAAGAAA TATTTACTGA- AGAATATCTA ACGGGAGAGG 1200 
ATTATGATTC CCAGAGG AAA AATTCTGAGG ATACACTATA TGAAAACAAA GAAATAGACG. 1260 
GCAGGGATTC TGATCTTCTG : GTAGATGGAG ATTTAGGCGA ATATGATTTT TATGAATATA: 1320 
^SaTATGA- AGATAAACCA ACAAGCCCCC CTAATGAAGA- ATTTGGTCCA OGTGTACCAG, 1380 
CAGAAACTGA TATTACAGAA ACAAGCATAA ATGGCCATGG TGCATATGGA GAGAAAGGAC 1440 
AgTaAGGAGA ACCAGCAGTG GTTGAGCCTG. GTATGCTTGT CGAAGGACCA CCAGGACCAG 1500 
CAGGACCTGC AGGTATTATG- GGTCCTCCAG GTCTACAAGG CCCCACTGGA CCCCCTGGTG?. 1560 
ACCCTGGCGA TAGGGGCCCC; CCAGGACGTC CTGGCTTACC AGGGGCTGAT GGTCTACCTG 162 0 
GTCCTCCTGG TACTATGTTG ATGTTACCGT. TCCGTTATGG TGGTGATGGT- TCCAAAGGAC 
CAACCATCTC TGCTCAGGAA GCTCAGGCTC AAGCTATTCT. TCAGCAGGCT CGGATTGCTC 
TGAGAGGCCC ACCTGGCCCA ATGGGTCTAA- CTGGAAGACC AGGTCCTGTG GGGGGGCCTG 
GTTCATCTGG GGCCAAAGGT GAGAGTGGTG ATCCAGGTCC TCAGGGCCCT CGAGGCGTCC 
AGGGTCCCCC TGGTCCAACG GGAAAACCTG GAAAAAGGGG TCGTCCAGGT GCAGATGGAG 
GAAGAGGAAT. GCCAGGAGAA CCTGGGGCAA AGGGAGATCG AGGGTTTGAT GGACTTCCGG- 1980 
GTCTGCCAGG TGACAAAGGT CACAGGGGTG AACGAGGTCC TCAAGGTCCT CCAGGTCCTC 2040 
CTGGTGATGA TGGAATGAGG GGAGAAGATG GAGAAATTGG ACCAAGAGGT CTTCCAGGTG 
AAGCTGGCCC ACGAGGTTTG. CTGGGTCCAA * GGGGAACTCC AGGAGCTCCA GGGCAGCCTG ; 
GTATGGCAGG TGTAGATGGC CCCCCAGGAC CAAAAGGGAA CATGGGTCCC ^GGGGAGC 
CTGGGCCTCC AGGTCAACAA GGGAATCCAG GACCTCAGGG TCTTCCTGGT CCACAAGGTC 
CAATTGGTCC TCCTGGTGAA AAAGG ACCAC ' AAGGAAAACC AGGACTTGCT GGACTTCCTG 
GTGCTGATGG GCCTCCTGGT CATCCTGGGA AAGAAGGCCA GTCTGGAGAA AAGGGGGCTC 
TGGGTCCCCC TGGTCCACAA GGTCCTATTG GATNNCCGGG CCCCCGGGGA GTAAAGGGAG 
CAGATGGTGT CAGAGGTCTC AAGGGATCTA AAGGTGAAAA GGGTGAAGAT GGTTTTCCAG 
GATTCAAAGG TGACATGGGT- CTAAAAGGTG ACAGAGGAGA AGTTGGTCAA ATTGGCCCAA 
GAGGGNAAGA TGGCCCTGAA GGACCCAAAG GTCGAGCAGG CCCAACTGGA (^CCCAGGTC 
CTTCAGGTCA AGCAGGAGAA AAGGGAAAAC TTGGAGTTCC AGGATTACCA GGATATCCAG 
GAAGACAAGG TCCAAAGGGT TCCACTGGAT TCCCTGGGTT TCCAGGTGCC AATGGAGAGA 2760 
AAGGTGCACG GGGAGTAGCT GGCAAACCAG GCCCTCGGGG TCAGCGTGGT CCAACGGGTC 2820 
CTCGAGGTTC AAGAGGTGCA AGAGGTCCCA CTGGGAAACC TGGGCCAAAG GGCACTTCAG- 2880 
CCCTCCTGGC CCTCCAGGTG AAAGAGGTCC TCAAGGACCT CAGGGTCCAG: 2940 
TTGGATTCCC TGGACCAAAA GGCCCTCCTG GACCACCAGG AAGGATGGGC TGCCCAGGAC 3000 
ACCOCGGGCA ACGTGGGGAG ACTGGATTTC AAGGCAAGAC CGGCCCTCCT GGGCCAGGGG 3060 
£g?gS acc^caggga CCAACCGGTG AGACTGGTCC AATAGGGGAA CGTGGGTATC 3120 
CTGGTCCTCC TGGCCCTCCT GGTGAGCAAG GTCTTCCTGG TGCTGCAGGA AAAGAAGGTG 3180 
CAAAGGGTGA TCCAGGTCCT GAAGGTATCT CAGGGAAAGA TGGACCAGCA GGATTACGTG. 3240 
GTTTCCCAGG GGAAAGAGGT CTTCCTGGAG CTCAGGGTGC ACCTGGACTG AAAGGAGGGG? 3300 
AAGGTCCCCA GGGCCCACCA GGTCCAGTTG GCTCACCAGG AGAACGTGGG TCAGCAGGTA, 3360 
CAGCTGGCCC AATTGGTTTA CGAGGGCGCC CGGGACCTCA GGGTCCTCCT GGTCCAGCTG 3420 
GAGAGAAAGG TGCTCCTGGA GAAAAAGGTC CCCAAGGGCC TGCAGGGAGA GATGGAGTTC 3480 
AAGGTCCTGT TGGTCTCCCA GGGCCAGCTG GTCCTGCCGG CTCCCCTGGG GAAGACGGAG. 3540 
ACAAGGGTGA AATTGGTGAG CCGGGACAAA AAGGCAGCAA GGGTGGCAAG GGAGAAAATG 3600 
G^^G.T^AC<jr CTTC^GGAC CAGTTGGTGC CCCTGGAATT GCTGGAGGTG 3660 
ATGGTGAACC AGGTCCTAGA GGACAGCAGG GGATGTTTGG GCAAAAAGGT GATGAGGGTG 
CCAGAGGCTT CCCTGGACCT CCTGGTCCAA TAGGTCTTCA GGGTCTGCCA GGCCCACCTG 
GTGAAAAAGG TGAAAATGGG GATGTTGGTC CATGGGGGCC ACCTGGTCCT CCAGGCCCAA 
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5 
10 
15 
20 
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50 
55 
60 
65 
70 
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GAGGCCCTCA 
CAGTTGGTGG 
GGGAAGCAGG 
CTGGAGCTGC 
ACCCGGGTCC 
GTCAAGATGG 
CTGGCCCATC 
CTGCAGGTGC 
GTCCTCCTGG 
AAGGTCTTCG 
AAGATGGACC 
GCTCCAAGGG 
AAGGGGAAAA 
ATGGGGGAAT 
GTCCTCAAGG 
GTGGTCTTCC 

caatcttgtc 
ataatattct; 
aacaagacat- 
gtaaagacct 
accaaggttg 

CTTGCATTTA- 
AACCAGGAAGr 
AAGGAAATTC- 
GGCAAAATTT 
GTTATGACAA 
ATCCTTTTAT 
TCATTGAAAT 
ACTTTGGTGA" 
AAGATTAAGA 
TTTTGTGCCA 
TACCATT TAG 
ATCATAAAGA- 
TTCTCAACTC 
AATAT AT ATT " 
TGTGTTTAAT' 
CCAAAACTTG 
GATGGCAATA' 
TTTCTTTGGT 



AGGTCCCAAT 
TGTTGGAGAA" 
TGTAGGCGGt. 
TGGACCTCCA? 
-TGTTGGTTTT. 

tgttggtggt 

TGGTGAGGCT 
AGAGGGAAGA 
AAAAACCGGC: 
GGGCATCCCT 
ACCTGGTCCT 
TGAAAAGGGA" 
AGGTGACCGA 
TCCTGGTCCT 
CCCAAAGGGT 
AGGGCCTCCT 
CTCCAAAAAA 
TGATTACTCG 
CGAGCATATG 
GCAACTCAGC 
CTCAGGAGAT 
TCCAGACAAA- 
. TTGGTTTAGT 
CATCAAT ATG '■ 
CACCTACCAC 
AGCACTT CGC 
CAAAACACTG" 
CAATACACCA-' 
TCAGAATCAff? 
CAAAGAACAT 
CATGCAAGTT 
GAAATACOGA 
TATAAGTTGG, 
TCCTTTTCCT 
CATAAAAAAT 
AAATTGTAAT 
CACGTGTCCC: 
ATATATGTAT 
TAATGATGAA 



GGAGCTGATG 
AAGGGTGAAC 
CCCAAAGGAG 
GGTGCCAAGG 
CCTGGAGATC 
GACAAGGGTG 
GGCCCACCAG 
CAAGGTGAAA' 
CCAGTCGGTC 
GGTCCTGTGG 
ATGGGACCTC 
CATCCTGGTT 
GGGCTCCCTG 
GCTGGTCCCT 
AACAAAGGCT 
GGGCCTCCAG 
ACGAGAAGAC 
GATGGAATGG 
AAATTTCCAA 
CATCCTGACT 
TCCTTCAAAG 
AAATCTGAGG 
GAATTTAAGA 
GTGCAAATGA 
TGTCATCAGT 
TTCCTGGGAT 
TATGATGGTT 
AAAATTGATC 
AAGTTCGGAT 
ATCAAATCAA 
TTGAATAAGG 
TGCCTTTGTG 
.TGTGGCTAAG 
ATTTGAATTT 
ATGGTGCTCA 
TATTTTGTGT 
TGAATTCCGC 
TATGAAAATG 
ATTCCTTTGT 



GACCACAAGG 
CTGGAGAAGC 
AAAGAGGAGA 
GGCOGCCAGG 
CTGGTCCTCC 
AAGATGGAGA 
GTCCTCCTGG 
AAGGTGCTAA 
CTCAGGGACC 
GAGAACAAGG 
CTGGCTTACC 
TAATTGGCCT 
GAACTCAAGG 
TAGGTCCACC 
CTACTGGACC: 
GTCCACCTGG 
ATACTGAAGG 
AAGAAATATT 
TGGGTACTCA 
TCCCAGATGG 
TTTACTGTAA 
GAGTAAGAAT 
GGGGAAAACT 
CATTCCTGAA 
CAGCAGCCTG 
CAAATGATGA 
GTACGTCCAG 
AAGTACCTAT 
TTGAAGTTGG 
CAGAAAATGT. 
ATGTATGGAA 
GGGGCAGAAT. 
ATGGAAACAG 
CTTTGGTGCT 
TTCTCATCCA 
ACAGTTCTAT 
TGACTCTAAT 
AAGTTATGAT 
GTGTGTTT 



ACCCCCAGGT 
AGGAAACCCA 
GAAAGGGGAA 
TGATGATGGC 
TGGGGAACTT 
TCCTGGTCAA 
AAAACGAGGT 
GGGGGAAGCA' 
TGCAGGAAAG. 
TCTCCCTGGA 
TGGTCTCAAA 
GATTGGTCCT 
ATCTCCAGGA 
TGGTCCTCCA' 
CGCTGGCCAG 
TGAAGTCATT 
CATGCAAGCA- 
TGGTTCCCTC 
GACCAATCCA 
TGAATATTGG 
TTTCACATCT 
TTCATCATGG 
GCTTTCATAC 
ACTTCTGACT 
GTATGATGTG 
GGAGATGTCC 
AAAAGGCTAT 
TGTTGATGTC 
TCCTGTTTGT 
ACCTTGGTGC 
AACAACGCTG 
CACAGACAAA 
GGCTGATTCT 
GTAGAAAACA 
TCCAGGATGT 
ACTGTTATCT 
TTATGAGGAT 
TTCCGATGAC 



TCTGTTGGTT 

GGGCCTCCTG 

GCTGGTCCAC- 

CCTAAGGGTA 

GGCCCTGCAG 

CCGGGTCCTC 

CCTCCTGGAG 

GGTGCAGAAG 

CCTGGTCCAG 

GCTGCAGGCC 

GGTGACCCTG 

CCAGGAGAAC 

GCAAAAGGGG 

ggcttaccag 
aaaggtgaca 
cagcctttac 
gatgcagatg". 
aattccctga- 
gcccgaactt 
attgatccta 

GGTGGTGAGA 

CCAAAGGAGA 

TTAGATGTTG . 

GCCTCTGCTC 

TCATCAGGAA 

TATGACAATA " 

GAAAAAACTG 

ATGATCAGTG ' 

TTTCTTGGCT * 

CACCAACCCA 

CATATACAGG 

AGCTTTGAAA- 

TGATTCCCAA' 

AAAAAAGAAA- 

ACTAAAACAG 

GTGTCCATTT 

GCCGAACTCT 

CCTAAGTCCC 
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Seq ID NO: 361 Protein sequence 
Protein Accession #: NP_001845 



45 T 



MEPWSSRWKT 
CTNRKNSKGS 
EHGIQQIGVE 
DCKKKTTKPU 
PDCDSSAPKA 
NIVDDFQEYN 



AYGEKGQKGE 
GADGLPGPPG 
GPVGGPGSSG 
GPDGLPGLPG 
GAPGQPGMAG 
GLAGIiPGADG 
GEDGFPGFKG 
GLPGYPGRQG 
GPKGTSGGDG 
GPPGPGGWG 
GPAGLRGPPG 
GPPGPAGEKG 
GGKGENGPPG 
GLPGPPGEKG 
GNPGPPGEAG 
GELGPAGQDG 
G SAG AEG PPG 
GLKGDPGSKG 
GPPGIiPGPQG 
MQADADDNIL 
EVWIDPMOGC 
LSYLDVEGNS 
EMSYDNNPFI 
PVCFLG 



11 
I 

KRWLWDFTVT 
DTAYRVSKQA 
VGRSPVFIjFE 
DRSERAIVDT 
AQAQEPQIDE 
YGTMESYQTE 
DLLVDGDLGE 
PAWEPGMDV 
TMLMLPFRYG 
AKGESGDPGP 



VDGPPGPKGN: 
PPGHPGKEGQ 
DMGLKGDRGE.. 
PKGSTGFPGP 
PPGPPGERGP 
PQGPTGETGP 
ERGLPGAQGA 
APGEKGPQGP 
PPGLQGPVGA^ 



VGGPKGERGE 
VGGDKGEDGD 
KTGPVGPQGP 
EKGHPGLIGL 
PKGNKGSTGP 
DYSDGMEEIF 
SGDSFKVYCN 
INMVOMTFLK 
KTLYDGCTSR 



21 

! 

TLAi/TFLFQA 
QIiSAPTKQLF 
DHTGKPAPED 
NGITVFGTRI- 
YAPEDIIEYD' 
APRHVSGTNE 
YDFYEYKEYE 
EGPPGPAGPA 
GDGSKGPTIS 
QGPRGVQGPP 
QGPPGPPGDD 
MGPQGEPGPP 
SGEKGALGPP 
VGQIGPRGXD 
PGAHGEKGAR 
QGPQGPVGFP 
IGERGYPGPP 
PGLKGGEGPQ 
AGRDGVQGPV 
PGIAGGDGEP 
PGPPGPRGPQ 
KGEAGPPGAA 
PGQPGPPGPS 
AGKPGPEGLR 
IGPPGEQGEK 
AGQKGDSGLP 
GSUISIiKQDI 
FTSGGETCIY 
LLTASARQNF 
KGYEKTVIEI 



31 
I 

REVRGAAPVD 
PGGTFPEDPS 
YPLFRTVKIA 
LDEEVFEGDI 1 
YEYGEAEYKE 
PNPVEBIFTE- 
DKPTSPPNEE 
GIMGPPGLQG 
AQEAQAQAIL 
GPTGKPGKRG 
GMRGEDGEIG 
GQQGNPGPOG 
GPQGPIGXPG 
GPEGPKGRAG 
GVAGKPGPRG 
GPKGPPGPPG 
GPPGEQGLPG 
GPPGPVGSPG 
GLPGPAGPAG 
GPRGQQGMFG 
GPNGADGPQG 
GPPGAKGPPG 
GEAGPPGPPG 
GIPGPVGEQG 
GDRGLPGTQG 
GPPGPPGPPG 
EHMKFPMGTQ 
PDKKSEGVRI 
TYHCHQSAAW 
NTPKIDQVPI 



41 
I 

VLKALDFHNS 
ILFTVKPKKG 
DGKWHRVAIS 
QQFLITGDPK 
AESVTEGPTV 
EYLTGEDYDS. 
FGPGVPAETD. 
PTGPPGDPGD 
QQARIALRGP 
RPGADGGRGM 
PRGLPGEAGP 
LPGPOGPIGP 
PRGVKGADGV 
PTGDPGPSGQ 
QRGPTGPRGS 
RMGCPGHPGQ 
AAGKEGAKGD 
ERGSAGTAGP 
SPGEDGDKGE 
QKGDEGARGF 
PPGSVGSVGG 
DDGPKGNPGP 
. KRGPPGAAGA 
LPGAAGQDGP 
SPGAKGDGGI 
EVIQPLPILS 
TNPARTCKDL 
SSWPKEKPGS 
YDVSSGSYDK 
VDVMISDFGD 



51 
I 

PEGISKTTGF 

IQSFIiLSIYN 

VEKKTVTMIV 

AAYDYCEHYS 

TEETIAQTEA 

QRKNSEDTLY 

ITETSINGHG 

RGPPGRPGLP 

PGPMGLTGRP 

PGEPGAKGDR 

RGIjLGPRGTP 

PGEKGPQGKP 

RGLKGSKGEK 

AGEKGKIjGVP 

RGARGPTGKP 

RGETGFQGKT 

PGPQGISGKD 

IGLRGRPGPQ 

IGEPGQKGSK 

PGPPGPIGLQ 

VGEKGEPGEA 

VGFPGDPGPP 

EGRQGEKGAK 

PGPMGPPGLP 

PGPAGPLGPP 

SKKTRRHTEG . 

QLSHPDFPDG " 

WFSEFKRGKL 

ALRFLGSNDE 

QNQKFGFEVG 



Seq ID NO: 362 DNA sequence 
Nucleic" Acid Accession #: NM_003107 
Coding sequence: 351-1775 
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600 
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1020 
1080 
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11 



21 31 41 51 

I I I I 

TTCCCCAGCA TTCGAGAAAC- TCCTCTCTAC TTTAGCACGG TCTCCAGACT CAGCCGAGAG 
. ACAGCAAACT gcagcgcggt GAGAGAGCGA GAGAGAGGGA GAGAGAGACT CTCCAGCCTG 
GGAACTATAA CTCCTCTGCG AGAGGCGGAG AACTCCTTCC CCAAATCTTT TGGGGACTTT 



60 
120 
180 



322 



WO 02/086443 

TCTCTCTTTA CCCACCTCCG ." CCCCTGCGAG GAGTTGAGGG GCCAGTTCGG CCGCCGCGOG 240 

CGTCTTCCCG TTCGGCGTGT ' GCTTGGCOCG* GGGAACCGGG AGGGCCCGGC GATCGCGOGG '300 

CGGCCGCCGC GAGGGTGTGA GCGCGCGTGG . GCGCCCGCCG AGCCGAGGCC ATGGTGCAGC 360 

AAACCAACAA TGCCGAGAAC AOGGAAGCGC TGCTGGCCGG CGAGAGCTCG GACTCGGGCG 420 
CCGGCCTCGA GCTGGGAATC GCCTCCTCCC CCACGCCOGG CTCCACCGCC TCCACGGGCG : 480 

GCAAGGCCGA CGACCCGAGC TGGTGCAAGA CCCCGAGTGG GCACATCAAG CGACCCATGA 540 

ACGCCTTCAT GGTGTGGTCG CAGATCGAGC GGCGCAAGAT CATGGAGCAG' TCGCCCGACA 600 

TGCACAACGC CGAGATCTCC AAGCGGCTGG. GCAAACGCTG GAAGCTGCTC AAAGACAGCG 660 

ACAAGATCCC TTTCATTCGA GAGGCGGAGC GGCTGCGCCT CAAGCACATG GCTGACTACC 720 

CCGACTACAA GTACCGGCCC AGGAAGAAGG . TGAAGTCGGG CAACGCCAAC TCCAGCTCCT 780 

CGGCCGCCGC CTCCTCCAAG CCGGGGGAGA AGGGAGACAA GGTCGGTGGC AGTGGCGGGG 840 
GCGGCCATGG GGGCGGCGGC GGCGGCGGGA GCAGCAACGC GGGGGGAGGA GGCGGCGGTG - 900 

CGAGTGGCGG CGGCGCCAAC TCCAAACOGG CGCAGAAAAA GAGCTGCGGC TCCAAAGTGG 960 

CGGGCGGCGC GGGCGGTGGG GTTAGCAAAC CGCACGCCAA GCTCATCCTG GCAGGCGGCG 1020 

GCGGCGGCGG GAAAGCAGCG GCTGCCGCCG CCGCCTCCTT OGCCGCCGAA" CAGGCGGGGG 1080 

CCGCCGCCCT GCTGCCCCTG GGCGCCGCCG COGACCACCA CTCGCTGTAC AAGGCGCGGA 1140 

CTCCCAGCGC CTCGGCCTCC GCCTCCTCGG CAGCCTCGGC CTCCGCAGCG CTCGCGGCCC ,. 1200 

CGGGCAAGCA CCTGGCGGAG AAGAAGGTGA AGCGCGTCTA CCTGTTCGGC GGCCTGGGCA* .1260 

CGTCGTOGTC GCCCGTGGGC GGCGTGGGCG. CGGGAGCOGA CCCCAGCGACCCCCTGGGCC 1320 

TGTACGAGGA' GGAGGGCGCG GGCTGCTCGC CCGACGCGCC CAGCCTGAGC GGCCGCAGCA 1380 

GCGCCGCCTC GTCCCCOGCC GCCGGCCGCT CGCCCGCCGA CCACCGCGGC TACGCCAGCC . 1440 

TGCGCGCCGC CTCGCCCGCC CCGTCCAGCG- CGCCCTCGCA CGCGTCCTCC TCGGCCTCGT 1500 

CCCACTCCTC CtCTTCCTCC-.TCCTCGGGCT CCTCGTCCTC, CGACGACGAG TTCGAAGACG ■ 1560 

ACCTGCTCGA CCTGAACCXTC AGCTCAAACT TTGAGAGCAT GTCCCTGGGC AGCTTCAGTT 1620 

CGTCGTCGGC GCTCGACCGG GACCTGGATT TTAACTTCGA GCCCGGCTCC GGCTCGCACT 1680 

TCGAGTTCCC GGACTACTGC ACGCCCGAGG TGAGCGAGAT GATCTCGGGA- GACTGGCTCG 1740 

AGTCCAGCAT CTCCAACCTG GTTTTCACCT ACTGAAGGGC GCGCAGGCAG GGAGAAGGGC 1800 

CGGGGGGGGT AGG AGAGG AG AAAAAAAAAG TGAAAAAAAG AAACGAAAAG GACAGACGAA I860 

GAGTTTAAAG AGAAAAGGGA AAAAAGAAAG AAAAAGTAAG CAGGGCTCGT TCGCCCGCGT 1920 

TCTCGTCGTC GGATCAAGGA GCGCGGCGGC GTTTTGGACC CGCGCTCCCA TCCCCCACCT 1980 

TCCCGGGCCG GGGACCCACT CTGCCCAGCC GGAGGGACGC GGAGGAGGAA GAGGGTAGAC 2040 

AGGGGCGACC TGTGATTGTT GTTATTGATG..TTGTTGTTGA TGGCAAAAAA AAAAAGCGAC 2100 

TTCGAGTTTG CTCCCCTTTG CTTGAAGAG A- CCCCCTCCCC CTTCCAACGA GCTTCCGGAC 2160 

TTGTCTGCAC CCCCAGCAAG AAGGCGAGTT AGTTTTCTAG' AGACTTGAAG. GAGTCTCCCC 2220 

CTTCCTGCAT CACCACCTTG GTTTTGTTTT ATTTTGCTTC TTGGTCAAGA AAGGAGGGGA 2280 

GAACCCAGCG CACCCCTCCC CCCCTTTTTT TAAACGCGTG ATGAAGACAG AAGGCTCCGG 2340 

GGTGACGAAT TTGGCCGATG GCAGATGTTT TGGGGGAACG CCGGGACTGA GAGACTCCAC 2400 

GCAGGCGAAT TCCCGTTTGG GGCCTTTTTT TCCTCCCTCT TTTCCCCTTG CCCCCTCTGC 2460 

AGCCGGAGGA . GGAGATGTTG AGGGGAGGAG GCCAGCCAGT GTGACCGGCG CTAGGAAATG 2520 

ACCCGAGAAC CCCGTTGGAA GCGCAGCAGC GGGAGCTAGG GGCGGGGGCG GAGGAGGACA ' 2580 

CGAACTGGAA GGGGGTTCAC GGTCAAACTG AAATGGATTT GCACGTTGGG GAGCTGGCGG 2640 

CGGCGGCTGC TGGGCCTCCG CCTTCTTTTC TACGTGAAAT CAGTGAGGTG AGACTTCCCA 2700 

GACCCCGGAG GCGTGGAGGA GAGGAGACTG TTTGATGTGG TACAGGGGCA GTCAGTGGAG 2760 
GGCGAGTGGT TTCGGAAAAA AAAAAAGAAA AAAAGGG 

Seq ID NO: 363 Protein sequence 
Protein Accession #: NP_00309B 

1 11 . 21 31 41 51 

I ' I, • I ' I I I 

MVQOTNNAEN TEALLAGESS DSGAGLELGI ASSPTPGSTA STGGKADDPS WCKTPSGHIK '60 

RPMNAFMVWS QIERRKIMEQ SPDMHNAEIS KRLGKRWKLL KESDKIPPIR EAERLRLKHM 120 

ADYPDYKYRP RKKVKSGNAN SSSSAAASSK PGEKGDKVGG SGGGGHGGGG GGGSSNAGGG 180 

GGGASGGGAN' SKPAQKKSCG SKVAGGAGGG VSKPHAKLIL AGGGGGGKAA AAAAASFAAE 240 

QAGAAALLPL GAAADHHSLY KARTPSASAS ASSAASASAA LAAPGKHLAE KKVKRVYLFG 300 " 

GLGTSSSPVG. GVGAGADPSD PLGLYEEEGA GCSPDAPSLS GRSSAASSPA AGRSPADHRG 360 

YASLRAASPA PSSAPSHASS SASSHSSSSS SSGSSSSDDE FEDDLLDLNP SSNPESMSLG . 420 
SPSSSSALDR DLDFNPEPGS GSHFEFPDYC TPEVSEMISG DWLESSISNL VPTY 



Seq ID NO: 364 DNA sequence 
Nucleic Acid Accession ft : U1086O 
Coding sequence: 123-2204 

1 11 21 . 31. 41 51 

I I I I I I 

TGCCGGCTGC TCCTCGACCA GGCCTCCTTC TCAACCTCAG CCCGCGGCGC CGACCCTTCC 60 

GGCACCCTOC CGCCCCGTCT CGTACTGTCG CCGTCACCGC CGCGGCTCCG GCCCTGGCCC 120 

CGATGGCTCT GTGCAACGGA GACTCCAAGC TGGAGAATGC TGGAGGAGAC CTTAAGGATG 180 

GCCACCACCA CTATGAAGGA GCTGTTGTCA TTCTGGATGC TGGTGCTCAG TACGGGAAAG 240 

TCATAGACCG AAGAGTGAGG GAACTGTTCG TGCAGTCTGA AATTTTCCCC TTGGAAACAC 300 

CAGCATTTGC TATAAAGGAA CAAGGATTCC GTGCTATTAT CATCTCTGGA GGACCTAATT 360 

CTGTGTATGC TGAAGATGCT CCCTGGTTTG ATCCAGCAAT ATTCACTATT GGCAAGCCTG 420 

TTCTTGGAAT TTGCTATGGT ATGCAGATGA TGAATAAGGT ATTTGGAGGT ACTGTGCACA 480 

AAAAAAGTGT CAGAGAAGAT ' GGAGTTTTCA ACATTAGTGT GGATAATACA TGTTCATTAT 540 

TCAGGGGCCT TCAGAAGGAA : GAAGTTGTTT TGCTTACACA TGGAGATAGT GTAGACAAAG 600 

TAGCTGATGG ATTCAAGGTT GTGGCACGTT CTGGAAACAT AGTAGCAGGC ATAGCAAATG 660 

AATCTAAAAA GTTATATGGA GCACAGTTCC ACCCTGAAGT TGGCCTTACA GAAAATGGAA 720 
AAGTAATACT GAAGAATTTC CTTTATGATA TAGCTGGATG CAGTGGAACC TTCACCGTGC ' 780 

AGAACAGAGA ACTTGAGTGT ATTCGAGAGA TCAAAGAGAG AGTAGGCACG TCAAAAGTTT " 840 
TGGTTTTACT CAGTGGTGGA GTAGACTCAA CAGTTTGTAC. AGCTTTGCTA AATCGTGCTT : 900 

TGAACCAAGA ACAAGTCATT GCTGTGCACA TTGATAATGG CTTTATGAGA AAACGAGAAA 960 

GCCAGTCTGT TGAAGAGGCC CTCAAAAAGC TTGGAATTCA GGTCAAAGTG ATAAATGCTG 1020 

CTCATTCTTT . CTACAATGGA i ACAACAACCC TACCAATATC AGATGAAGAT AGAACCCCAC 1080 

GGAAAAGAAT TAGCAAAACg' TTAAATATGA CCACAAGTCC TGAAGAGAAA AGAAAAATCA 1140 

TTGGGGATAC TTTTGTTAAG * ATTGCCAATG AAGTAATTGG AGAAATGAAC TTGAAACCAG 1200 

AGGAGGTTTT CCTTGCCCAA ( GGTACTTTAC GGCCTGATCT AATTGAAAGT GCATCCCTTG 1260 
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• TTGCAAGTGG 
AGT TGAGAGA '. 
GAATTTTGGG 
GTCCTGGCCT 
CTGAAACCAA 
ATACCCTATT 
AAATTACCAG 
AGGGTGACTG. 

. GGGAATCACT 
TTGTTTATAT 
TGACAACAGG 
GGGAGTCTGG 
TTGATCGGGA 
TTATTACTAG 
AGGTGGTATT' 
ATGACTTAAC ■ 



CAAAGCTGAA 
GGAGGGAAAA 
CAGAGAACTT. 
GGCAATCAGA. 
CAATATTTTG . 
ACAGAGAGTC 
TCTGCATTCA' 
TCGTTCCTAC 
TATTTTTCTG 
ATTTGGCCCA 
GGTGCTCAGT • 
GTATGCTGGG. 
CCCACTTCAA 
TGACTTCATG 
AAAGATGGTC 
ATCAAAGCCC 



CTCATCAAAA 
GTAATAGAAC 
GGACTTCCAG 
GTAATATGTG 
AAAATAGTAG 
AAAGCCTGCA' 
CTGAATGCCT 
AGTTACGTGT 
GCTAGGCTTA* 
CCAGTTAAAG 
ACTTTACGCC 
AAAATCAGCC 
AAGCAGCCTT 
ACTGGTATAC: 
ACTGAGATTA 
CCAGGAACTA 



CCCATCACAA 
CTCTGAAAGA 
AAGAGTTAGT 
CTGAAGAACC 
CTGATTTTTC 
CAACAGAAGA 
TCTTGCTGCC 
GTGGAATCTC 
TACCTCGCAT 
AACCTCCTAC. 
AAGCTGATTT 
AGATGCCGGT; 
CATGCCAGAG' 
CTGCAACACC 
AGAAGATTCC 
CTGAGTGGGA 



TGACACAGAG 
TTTTCATAAA 
TTCCAGGCAT 
TTATATTTGT 
TGCAAGTGTT 
GGATCAGGAG 
AATTAAAACT 
CAGTAAAGAT 
GTGTCACAAC 
AGATGTTACT 
TGAGGCCCAT 
GATTTTGACA 
ATCTGTGGTT 
TGGCAATGAG 
TGGTATTTCT 
GTAATAAACT 



CTCATCAGAA 

GATGAAGTGA. 

CCATTTCCAG 

AAGGACTTTC 

AAAAAGCCAC 

AAGCTGATGC 

GTAGGTGTGC 

GAACCTGACT. 

GTTAACAGAG 

CCCACTTTCT 

AACATTCTCA 

CCATTACATT • 

ATTCGAACCT 

ATCCCTGTAG* 

CGAATTATGT 

TC 



Seq ID NO: 365 Protein sequence 
Protein Accession #: AAA60331 



1 
I 

MALCNGDSKD 
APAIKEQGFR" 
KSVREDGVFN 
SKKLYGAQFH 
VLLSGGVDST 
HSFYNGTTTL 
EVFLAQGTLR 
ILGRELGLPE 
TLLQRVKACT 
ESLIPLARLI 
ESGYAGKISQ 
WLKMVTEIK 



111 
I 

ENAGGDLKDG 
AIIISGGPNS 
1SVDNTCSLF 
PEVGLTENGK 
VCTALLNRAL 
PISDEDRTPR. 
PDLIE5ASLV 
ELVSRHPFPG - 
TEEDQEKLMQ. 
PRMCHNVNRVr 
MPVILTPLHF 
KIPGISRIMY 



21 
I 

HHHYEGAWI 
VYAEDAPWFD 
RGLQKEEWL 
VILKNFLYDI- 
NQEQVIAVHI 
KRISKTLNMT 
ASGKAELZKT 
PGLAIRVICA 
ITSLHSLNAF 
VYIFGPPVKE 
DRDPLQKQPS 
DLTSKPPGTT 



31 
I 

LDAGAQYGKV 
PAIFTIGKPV 
LTHGDSVDXV 
AGCSGTFTVQ 
DNGFMRKRES 
TSPEEKRKII-' 

hhndtelirk 
eepyickdfp' 
llpiktvgvq; 

PPTDVTPTFIi 
CQRSWIRTF 



41 
I 

IDRRVRELFV 
LGICYGMQMM 
ADGFKWARS 
NRELECIREI 
QSVEEALFCKL 
GDTFVKIANE 
LREEGKVZEP 
ETNNILKIVA" 
GDCRSYSYVC 
TTGVLSTIiRQ 
ITSDFMTGIP 



51 
I 

QSEIPPLETP 
NKVFGGTVHK 
GMTVAGIANE 
KERVGTSKVL" 
GIQVKVINAA 
VIGEMNLKPE- 
LXDFHKDEVR- 
DFSASVKKPH- 
GISSKDEPDW 
ADFEAHNILR 
ATPGNEIPVE 



Seq ID NO: 366 DNA sequence 
Nucleic Acid Accession #: NM_004219 
Coding sequence: 46-654 



1 
I 

GCGGCCTCAG 
TATGTTGATA 
CTGGGGTCTG 
TTTGGCAAAA 
ACTGTCAACA 
CCAAGCTTTT 
GCCTCAGATG 
GAGAGTTTTG 
CTCATGATCC 
CCTGTGAAGA" 
CTGTCGACCC 
TAGTGCTTCA 
AAAAAAAA 



11 
I 

ATGAATGOGG 
AGGAAAATGG 
GACCTTCAAT. 
CGTTCGATGC 
GAGCTACAGA 
CTGCCAAAAA 
ATGCCTATCC 
ACCTGCCTGA 
TTGACGAGGA 
TGCCCTCTCC 
TGGATGTTGA 
GAGTTTGTGT 



21 
I 

CTGTTAAGAC 
AGAACCAGGC 
CAAAGCCTTA 
CCCACCAGCC 
AAAGTCTGTA 
GATGACTGAG 
AGAAATAGAA 
AGAGCACCAG 
GAGAGAGCTT 
ACCATGGGAA 
ATTGCCACCT 
GTATTTGTAT 



31 
I 

CTGCAATAAT 
ACCCGTGTGG 
GATGGGAGAT 
TTACCTAAAG 
AAGACCAAGG 
AAGACTGTTA 
AAATTCTTTC 
ATTGCGCACC 
GAAAAGCTGT 
TCCAATCTGT 
GTTTGCTGTG 
TAATAAAGCA 



41 
I 

CCAGAATGGC 
TTGCTAAGGA 
CTCAAGTTTC 
CTACTAGAAA 
GACCCCTCAA 
AAGCAAAAAG 
CCTTCAATCC 
TCCCCTTGAG 
TTCAGCTGGG 
TGCAGTCTCC 
ACATAGATAT 
TTCTTCAACA 



51 

I 

TACTCTGATC 
TGGGCTGAAG. 
AACACCACGT 
GGCTTTGGGA- 
ACAAAAACAG 
CTCTGTTCCT 
TCTAGACTTT 
TGGAGTGCCT 
CCCCCCTTCA 
TTCAAGCATT 
TTAAATTTCT 
GAAAAAAAAA 



Seq ID NO: 367 Protein sequence 
Protein Accession 8: NP_004210 



11 



21 



31 



SI 



41 

I I I ! I I 

MATLIYVDKE NGBPGTRWA KDGLKLGSGP SIKALDGRSQ VSTPRFGKTF DAPPALPKAT 
RKALGTVNRA TEKSVKTKGP LKQKQPSFSA KKMTEKTVKA KSSVPASDDA YPEIEKFFPF 
NPLDFESFDL PEEHQIAHLP LSGVPLMILD EERELEKLFQ LGPPSPVKMP SPPWESNLLQ 
SPSSILSTLD VELPPVCCDI DI 



Seq ID NO:' 368 DNA sequence 
Nucleic Acid Accession #: NM_000597 
Coding sequence: . 118-1104 



1 ' 
I 

ATTCGGGGCG 
CCTGCCCGCC 
CTGCCGAGAG 
CCGCTGCTGC 
CTGTTCCGCT 
GCGCCGCCCG 
GTCCGGGAGC 
GGCGTCTACA 
CTGCCCCTGC 
TATGGCGCCA 
GTGGAGAACC 
AAGCCCCTCA 
CACCGGCAGA 
CGACCACCCC. 



11 

I 

AGGGAGGAGG 
CGCCCGCTCG 
TGGGCTGCCC 
TGCTGCTACT. 
GCCOGCCCTG 
CCGCGGTGGC 
CGGGCTGCGG 
CCCCGCGCTG 
AGGCGCTGGT 
GCCCGGAGCA 
ACGTGGACAG 
AGTCGGGTAT 
TGGGCAAGGG 
CTGCCAGGAC 



21 
I 

AAGAAGCGGA 
CTCGCTCGCC 
OGCGCTGCCG 
GGGCGCGAGT 
CACACCCGAG 
CGCAGTGGCC 
CTGCTGCTCG 
CGGCCAGGGG 
CATGGGCGAG 
GGTTGCAGAC 
CACCATGAAC 
GAAGGAGCTG 
TGGCAAGCAT 
TCCCTGCCAA 



31 
I 

GGAGGCGGCT 
CGCCGCGCCG 
CTGCCGCCGC 
GGCGGCGGCG 
CGCCTGGCCG 
GGAGGCGCCC 
GTGTGCGCCC 
CTGCGCTGCT 
GGCACTTGTG 
AATGGCGATG 
ATGTTGGGCG 
GCCGTGTTCC 
CACCTTGGCC 
CAGGAACTGG 



41 

1 

CCCGCTCGCA 
CGCTGCCGAC 
CGCCGCTGCT 
GCGGGGCGCG 
CCTGCGGGCC 
GCATGCCATG 
GG CTGGAGGG 
ATCCCCACCC 
AGAAGCGCCG 
ACCACTCAGA 
GGGGAGGCAG 
GGGAGAAGGT 
TGGAGGAGCC 
ACCAGGTCCT 



51 
I 

GGGCCGTGCA 
CGCCAGCATG 
GCCGCTGCTG 
CGCGGAGGTG 
CCCGCCGGTT 
CGCGGAGCTC 
CGAGGCGTGC 
GGGCTCCGAG 
GGACGCCGAG 
AGGAGGCCTG 
TGCTGGCCGG 
CACTGAGCAG 
CAAGAAGCTG 
GGAGCGGATC 



1320 

1380 

1440 

1500 

1560 

1620. 

1680 • 

1740 

1800' 

1860 

1920 

1980 

2040 

2100 

2160 
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300 
360 
420 
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660 



60 
120 
180 
240 
300 
360 
420 
480 
540 
■ 600 
.660 
720 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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WO 02/086443* 

TCCACCATGC- GCCTTCCGGA 
CCCAACTGTG ACAAGCATGG 
CAGCGTGGGG . AGTGCTGGTG 
ACCATCCGGG GGGACCCCGA 
GTGCACACCC AGCGGATGCA 
GCCCCTCTCC AAACACCGGC 
TTCCAGTTCT GACACACGTA 
CCCGGCCTCT CTCTTCCCAG 
GAGGAAGGGG GTTGTGGTCG 
TTTATTTTTG AACCCCTGTG 



TGAGCGGGGC • CCTCTGGAGC ACCTCTACTC CCTGCACATC 
CCTGTACAAC CTCAAACAGT GCAAGATGTC TCTGAACGGG 
TGTGAACCCC AACACCGGGA AGCTGATCCA GGGAGCCCCC 
GTGTCATCTC TTCTACAATG AGCAGCAGGA GGCTTGCGGG 
GTAGACCGCA GCCAGCCGGT GCCTGGCGCC CCTGCCCCCC. 
AGAAAACGGA- GAGTGCTTGG. GTGGTGGGTG CTGGAGGATT^ 
TTTATATTTG GAAAGAGACC AGCACCGAGC TCGGCACCTC 
CTGCAGATGC CACACCTGCT CCTTCTTGCT TTCCCCGGGG 
GGGAGCTGGG. GTACAGGTTT GGGGAGGGGG AAGAGAAATT 
ATAAGATTAA AGGAAGGAAA AGT 



900 
960 
1020 

: iobo 

1140 
1200 
1260 
1320 
1380 



PCI7US02/12476- 



Seq ID NO: 369 Protein sequence 
Protein Accession #: NP_000S88 

1 ' 11 " 21 31 41 51 . 

II! I I I 

MLPRVGCPAL PLPPPPLLPL LPLLLLLLGA SGGGGGARAE VLFRCPPCTP ERLAACGPPP 
VAPPAAVAAV. AGGARMPCAE LVREPGCGCC SVCARLEGEA CGVYTPRCGQ GLRCYPHPGS 
ELPl»QALVMG EGTCEKRRDA EYGASPEQVA DNGDDHSEGG • LVENHVDSTM NMLGGGGSAG' 
RKPUCSGWXE LAVFREKVTE" QHRQMGKGGK HHW3LEEPKK LRPPPARTPC QQEU5QVLER 
ISTMRLPDER- GPLEHLYSLH IPNCDKHGLY NLKQCKMSLN GQRGECWCVN PNTGKLIQGA 
PTIRGDPECH LFYNEQQEAC GVKTQRMQ 



Seq ID NO: 370 DNA sequence 
Nucleic Acid Accession &: NMJJ04264 
Coding sequence: 6-440 



1 .11 21 31 41 

I I I 1 1 

•GGAACATGGC GGATCGGCTC ACGCAGCTTC AGGACGCTGT GAATTCGCTT 
TTTGTAATGC CATTGGAGTA TTGCAGCAAT GTGGTCCTCC TGCCTCTTTC 
AGACAGCAAT TAACAAAGAC CAGCCAGCTA ACCCTACAGA AGAGTATGCC 
CAGCACTGAT TGCACGAACA GCAAAAGACA TTGATGTTTT GATAGATTCC 
AAGAATCTAC AGCTGCTTTA CAGGCTGCTA GCTTGTATAA GCTAGAAGAA 
AAGCTGCTAC ATGTGTGGAG GATGTTGTTT ATCGAGGAGA CATGCTTCTG 
AAAGCGCACT TGCTGATATT GCACAGTCAC AGCTGAAGAC' AAGAAGTGGT 
AGTCTCTTCC . AGACTCATAG CATCAGTGGA TACCATGTGG" CTGAGAAAAG 
GTGCCATTAA GAATTCTGCA TCAGACTTAG AT ACAAG CCT TACCAACAAT 
TTAAACACTA TGACACATTA CCTTTTTAGC TATTTTTAAT AGTCTTCTAT 
GATAAGCTTA TAAATCATGA TTGAATCAGC TTTAAAGCAT CATACCATCA 
GAGTGAAATT ATTAAGGCAT- GTAATACATT AATGAACATA ATATAAGGAA 
AATTCTGTTA TGACATAATT TATGTCTCCA TTTTGTTGTA TTGGCCAGTA 
C 

Seq ID NO: 371 Protein sequence 
protein Accession NP__004255 

1 11 21 '31 41 51 

I I I I I I 

MADRIiTQLQD AVNSLADQFC NAIGVLQQCG PPASFNNIQT AINKDQPANP TEEYAQLFAA 
L1ARTAKDID VLIDSLPSEE STAALQAASIi YKLEEENHEA ATCVEDWYR GDMLLEKIQS 
ALADIAQSQL KTRSGTHSQS LPDS 

Seq ID NO: 372 DNA sequence 
Nucleic Acid Accession 8: AJ271091 
Coding sequence: 1-1113 



51 
I 

GCAGATCAGT 

AATAATATTC. 

CAGCTTTTTG. 

TTACCCAGTG ' 

GAAAACCATG; 

GAGAAGATAC 

ACCCATAGCC 

AACTGTTTGA 

TACAGAAACA 

TTTCACTCTT 

TTTTTTAACT 

ACATATGTAA 

CTTTTACAAT 



ATGGAGAATC 
CTGCGCGTGG 
CATTTCAAAG 
TTCTTAGACC 
ACAGTACAGA 
CTGTTTTTGG 
AGAGCTAAGG 
ACTCTTACAA 
TTCTCCTGGA 
TATGACACAT 
GAAACTATCA 
CTTCTTGGAA 
AAAGCTGTGG : 
TTCTACATGC 
CTGTGGATTC 
ATTCCAATAT 
AAAGTTAGAT 
ATAAATTTTC 
CATGCCTGTG 



11 
I 

AGGTGTTGAC 
AGCTGAGTGA 
CTCAAGGACA 
TTGTGAAACC 
AGAAAGTGAG 
CTCCTGACTT 
AAGAAGAGCG 
ACTTAAGGAA 
TCTTTGTCAA 
TCCATACTGT 
ATGCAGCAAT 
GAAATTTTAT 
TTTTCTTTGT 
TGACGTGCAT 
CCTTATATCC 
TCAATGAGAC 
TTTCCTTTTT 
. GTCACCTTT A 
ATCCCAGCGC 



21 
I 

GCCGCATGTC 
CGTACAGAAC 
TGGTGCCAAA 
AGAGCCTGTT 
TCAGTGGTGG 
TGATCGTTGG 
CCTAAATAAA 
AGGATACCTG 
CCTGACTGTG 
GGCTGACATG 
TGGAGTCACT 
TTTGTTTATC 
GTTTTATTTG 
TGACATGGAT 
ACTGGGATGT 
CGGACGATTC 
TCTTCAGATT 
TAAACAGCGC 
TTTGGGAGGC 



31 

I 

TACTGGGCTC 
CCTGCCATCA 
GGAGACAATG 
TACAAACTGA 
GAGAGACTCA 
CTGGATGAAT 
CTCCGACTGG 
TTTATGTATA 
CGATTCTGTA 
ATGTATTTCT 
ACGTCACCGG 
ATCTTTGGCA 
TGGAGTGCAA 
TGGAAGGTGC 
TTGGCGGAAG 
AGTTTCACAT 
TATCTTATAA 
AGACTGAAAA 
TGA 



41 
I 

AGCGACACCG 
GCATCACTGA 
TCTATGAATT 
CCCAGAGGCA 
CAAAGCAGGA 
CTGATGCGGA 
AAAGCGAAGG 
ATCTTGTGCA 
TCTTGGGAAA 
GCCAGATGCT 
TGCTGCCTTC 
CCATGGAAGA 
TTGAAATTTT 
TCACATGGCT 
CTGTCTCAGT 
TGCCATATCC 
TGATATTTTT 
TGAGGGCAGG 



51 
I 

CG AG CTATAT 

aaacgtgctg 
tcacctggag: 
ggtaaacatt 
aaagcgacca 

AATGGAGCTC 
CTCTCCTGAA 
ATTCTTGGGA 
AGAGTCCTTT- 
GGCAGTTGTG 
TCTGATCCAG 
AATGCAGAAC 
CAGGTACTCT 
TCGTTACACT 
GATTCAGTCC 
AGTGAAAATC 
AGGTTTATAC 
. CGCAGTGGCT 



Seq ID NO: 373 Protein sequence 
Protein Accession #: CAB69070 

1 li. 21 31 41 51 

MENQVLTPHV -YWAQRHRELY LRVELSDVQN PAISITENVL HFKAQGHGAK GDNVYEPHLE 
PLDLVKPEPV YKLTQRQVNI TVQKKVSQWW , ERLTKQEKRP LFIAPDFDRW LDESDAEMEL 
RAKEEERLNK LRLESEGSPE TLTNLRKGYL FWYNLVQFLG FSWIFVNLTV RFCILGKESP 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
780 
840 
900 
960' 
1020 
1080 



60 
120 
180" 
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YDTFHTVADM MYFCQMLAVV STINAAiGVT -TSPVLPSLIQ LLGRNFILFI IFGTHEEKQN 
KAWFFVFYL WSAIEIFRYS PYMLTCIDMD WKVLTWLRYT LWIPLYPLGC LAEAVSVIQS 
IPIFNETGRF SFTLPYPVKI KVRFSFFLQI .YLIMIFLGLY INFRHLYKQR MJ04RAGAVA . 
HACDPSALGG 



Seq ID NO: 374 DNA sequence 
Uucleic Acid Accession S: NMJ)i6395 
Coding sequence: 1-1113 



PCT/US02/12476 



240 
300 
360 



ATGGAGAATC 
CTGCGCGTGG. 
CATTTCAAAG 
TTCTTAGACC 
ACAGTACAGA 
CTGTTTTTGG 
AGAGCTAAGG 
ACTCTTACAA 
TTCTCCTGGA 
TATGACACAT 
GAAACTATCA 
CTTCTTGGAA 
AAAGCTGTGG 
TTCTACATGC 
CTGTGGATTC 
ATTCCAATAT 
AAAGTTAGAT- 
ATAAATTTTC 
CATGCCTGTG 



11 
I 

AGGTGTTGAC 
AGCTGAGTGA 
CTCAAGGACA 
TTGTGAAACC 
AGAAAGTGAG 
CTCCTGACTT 
AAGAAGAGCG 
ACTTAAGGAA 
TCTTTGTCAA 
TCCATACTGT 
ATGCAGCAAT 
GAAATTTTAT 
TTTTCTTTGT 
TGACGTGCAT 
CCTTATATCC 
TCAATGAGAC 
TTTCCTTTTT 
GTCACCTTTA 
ATCCCAGCGC 



21 
I 

GCCGCATGTC 
CGTACAGAAC 
TGGTGCCAAA 
AGAGCCTGTT 
TCAGTGGTGG 
TGATCGTTGG 
CCTAAATAAA 
AGGATACCTG 
CCTGACTGTG 
GGCTGACATG 
TGGAGTCACT 
TTTGTTTATC 
GTTTTATTTG 
TGACATGGAT 
ACTGGGATGT 
CGGACGATTC 
TCTTCAGATT 
TAAACAGCGC 
TTTGGGAGGC 



31 
I 

TACTGGGCTC 
CCTGCCATCA 
GGAGACAATG 
TACAAACTGA 
GAGAGACTCA 
CTGGATGAAT 
CTCCGACTGG 
TTTATGTATA 
CGATTCTGTA 
ATGTATTTCT 
ACGTCACCGG 
ATCTTTGGCA 
TGGAGTGCAA 
TGGAAGGTGC 
TTGGCGGAAG 
AGTTTCACAT 
TATCTTATAA 
AGACTGAAAA 
TGA 



41 
I 

AGCGACACCG 
GCATCACTGA 
TCTATGAATT 
CCCAGAGGCA 
CAAAGCAGGA 
CTGATGCGGA 
AAAGCGAAGG 
ATCTTGTGCA 
TCTTGGGAAA 
GCCAGATGCT 
TGCTGCCTTC 
CCATGGAAGA 
TTGAAATTTT 
TCACATGGCT 
CTGTCTCAGT 
TGCCATATCC 
TGATATTTTT 
TGAGGGCAGG 



51 
I 

CGAGCTATAT 
AAACGTGCTG 
TCACCTGGAG 
GGTAAACATT 
AAAGCGACCA- 
AATGGAGCTC 
CTCTCCTGAA 
ATTCTTGGGA 
AGAGTCCTTT 
GGCAGTTGTG 
TCTGATCCAG 
AATGCAGAAC" 
CAGGTACTCT 
TCGTTACACT 
GATTCAGTCC 
AGTGAAAATC 
AGGTTTATAC 
CGCAGTGGCT 



60 
120 
180 
240 
300 
360 
420 
4BO 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



Seq ID NOt 375 Protein sequence 
Protein Accession #: NP_057479 

1 11 21 31 41 51 

:i i i i.i i 

MENQVLTPHV. YWAQRHRELY LRVELSDVQN PAISITENVL HFKAQGKGAK GDNVYEFKLE 60 

FLDLVKPEPV YKLTQRQVNI • TVQKKVSQWW ERLTKQEKRP LFLAPDFDRW LDESDAEMEL 120 

RAKEEERLNK LRLESEGSPE TLTNLRKGYL FMYNLVQFLG FSWIFVNLTV RFCILGKESF 180 

YDTPHTVADM MYFCQMLAW ETINAAIGVT TSPVLPSLIQ LLGKNFILFI IFGTMEEMQN- 240 

KAWFFVFYL WSAIEIFRYS FYMLTCIDMD WKVLTWLRYT LWIPLYPLGC LVEAVSVIQS 300 

IPIFNETGRF SFTLPYPVKI KVRFSFPLQI YLIMIFLGLY INFRHLYKQR RRRYGKKRKR 3 60 
STKKKDLDGF LPV 

Seq ID NO: 376 DNA sequence 
Nucleic Acid Accession fh NM_005987 
Coding sequence: 1-270 

1 11 21 31 ,41 51 

I I I I I I ' 

ATGAATTCTC AGCAGCAGAA GCAGCCTTGC ACCCCACCCC CTCAGCCTCA GCAGCAGCAG 60 
GTGAAACAAC CTTGCCAGCC TCCACCCCAG GAACCATGCA TCCCCAAAAC CAAGGAGCCC 120 
TGCCAACCCA AGGTGCCTGA GCCCTGCCAC CCCAAAGTGC CTGAGCCCTG CCAGCCCAAG 180 
ATTCCAGAGC CCTGCCAGCC CAAGGTGCCT GAGCCCTGCC CTTCAACGGT CACTCCAGCA 240 
CCAGCCCAGC AGAAGACCAA GCAGAAGTAA 

Seq ID NO: 377 Protein sequence 
Protein Accession #: NP_005978 

1 11 21 31 41 " 51 

I I III I 

MNSQQQKQPC TPPPQPQQQQ VKQPCQPPPQ EPCIPKTKEP CQPKVPEPCH PKVPEPCQPK 60. 
IPEPCQPKVP EPCPSTVTPA PAQQKTKQK 

Seq ID NO: 378 DNA sequence 
Nucleic Acid Accession #: NM_002105 
Coding sequence: 74-505 

1 11 21 31 41 51 

I I I I ' . ' I * I 

\ACAGCAGTTA CACTGCGGCG GGCGTCTGTT CTAGTGTTTG AGCCGTCGTG CTTCACCGGT ' 60 

CTACCTCGCT AGCATGTCGG GCCGOGGCAA GACTGGCGGC AAGGCCCGCG CCAAGGCCAA 120 

GTCGCGCTCG TCGCGCGCCG GCCTCCAGTT CCCAGTGGGC CGTGTACACC GGCTGCTGCG 180 

GAAGGGCCAC TACGCCGAGC GCGTTGGCGC CGGOGCGCCA k GTGTACCTGG CGGCAGTGCT 240 

GGAGTACCTC ACCGCTGAGA" TCCTGGAGCT. GGCGGGCAAT GCGGCCCGCG ACAACAAGAA 300 

GACGCGAATC ATCCCCCGCC ACCTGCAGCT GGCCATCCGC AACGACGAGG AGCTCAACAA . 360 

GCTGCTGGGC GGCGTGAOGA TCGCCCAGGG AGGCGTCCTG CCCAACATCC AGGCCGTGCT 420 

GCTGCCCAAG AAGACCAGCG CCACCGTGGG GCCGAAGGCG CCCTCGGGCG GCAAGAAGGC 480 

CACOCAGGCC TCCCAGGAGT ACTAAGAGGG CCCGCGCCGC GGCCGGCCGC CCCAGCTCCC 540 

CATGCCACCA CAAAGGCCCT TTTAAGGGCC ACCACCGCCC TCATGGAAAG AGCTGAGCCG 600 

CTTCAGACTG CGGGGCAAGC GGGCCGCGGC TCCCTTCCCC TCCCCTCCCC TCGCCCGCCT . 660 

TCGCCGCCCG GCCTCGAGTC CCCGCCCGCC CCCGCTCCCG TCCCGCACCG CCTGCCGCGT- . 720 

CGGCCTCGGG CCTGCCCTGT CCGCCGTCCG CCCTCCGGTA GGGTTCGGGC CTTCOGGATG 780 

CGGCTTGGGC GCTCTTCGGG GACCTCCGTG GCGCGGAAGA CCCGAGCCTG CCGGGGGGAG . 840 



326 



10 



WO 02/086443 - : 

GCCGGOGGCG CCGCACCTGC CCGCCTCGGC" GTTCGTGACT CAGCCGCCCC ATCCCGAGTC 
GCTAAGGGGC TGCGGGGAGG COGCAGCACC TTCTGGAAGA CTTGGCCTTC CGCTCTGACG 
CAGGGCCGAG GTGGGCAGTC CAGGCCGAGA GCCGGCGGCC CTGAAGGTGA GTGAGGCCCT 
CGGCAGCTGC AGCCGGGGTG TCTGGTACCC CCCCGGCGTG GTGCTTAGCC CAGGACTTTC 
AGACGGCCGC TGGCCGGGAG" GCTTT GG TG G ' GAGAGACGOG ATCGCCGATT TOGGTCTGGC 
GCCCCTTCTG CGGCCGGGAC CCAGGCCTTT CACATCAGCT CTCCCTCCAT CTTCATTCAT 
AGGTCTGCGC TGGGGCCGGG ACGAAGCACT TGGTAACAGG CACATCTTCC TCCCGAGTGA 
CTCCCTCCTA' GGAGGACATT. TAGGGGAGGG CAGAGGCCTG CAGTTTGGCT TCACGGCTGG 
CTATGTGGAC ' AGCAAGAGTC GTTTTGCGGA ACGCGACTGG CAGCCAGGCC TGTCGGGCCC 
CCGAOGCCGC CCCATTTCCC TTCCAGCAAA CTCAACTCGG CAATCCAAGC ACCTAGATAC 
CAGCACAAGT CGGTTAATCC. CTGTCTGGAC TGAGCCTCCG TTGGCTTCTG AACTGGAATT 
CTGCAGCTAA CCCTTCCACG ACTAGAACCT TAGGCATTGG GGAGTTTTAG ATGGACTAAT 
TTTATTAAAG GATTGTTTTT TTTTT 



PCT/US02/12476' 



15 Seq ID m'z 379 Protein sequence 

Protein Accession 8: . NP_002096 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 . 
1360 
1440 
1500 
1560 



20 



ii 



41 



51 



21 31 

1 I I' I I I 

MSGRGKTGGK ARAKAKSRSS RAGLQFPVGR* VHRLLRKGHY AKRVGAGAPV YLAAVLEYLT 
AEILELAGNA ARDttKKTRI Z PRHLQLAIRN DESLNKUiGG VTIAQGGVLP NIQAVLLPKX 
TSATVGPKAP SGGKKATQAS QEY 



60 
120 



25 Seq ID NO:- 380 DNA sequence 

Nucleic Acid Accession 8: AL136942 
Coding sequence: 184-864 

1 11 21 31 41 51 

30 | | I I I I . 

ACGCGTCCGG CAGAAGCTCG GAGCTCTCGG GGTATCGAGG AGGCAGGCCC GCGGGCGCAC 60 

GGGCGAGCGG GCCGGGAGCC GGAGCGGCGG AGGAGCCGGC AGCAGCGGCG CGGCGGGCTC 120 

CAGGCGAGGC GGTCGACGCT, CCTGAAAACT TGCGCGCGCG CTCGCGCCAC TGCGCCCGGA . 180 

GCGATGAAGA TGGTCGCGCC CTGGACGCGG TTCTACTCCA ACAGCTGCTG CTTGTGCTGC 240 

35 CATGTCCGCA" CCGGCACCAT CCTGCTCGGC. GTCTGGTATC TGATCATCAA TGCTGTGGTA 300 

CTGTTGATTT TATTGAGTGC CCTGGCTGAT CCGGATCAGT ATAACTTTTC AAGTTCTGAA 360 

CTGGGAGGTG" ACTTTGAGTT. CATGGATGAT GCCAACATGT GCATTGCCAT: TGCGATTTCT " 420 

CTTCTCATGA: TCCTGATATG TGCTATGGCT ACTTACGGAG GGTACAAGCA ACGCGCAGCC 480 

TGGATCATCC CATTCTTCTG TTACCAGATC TTTGACTTTG CCCTGAACAT GTTGGTTGCA 540 

40 ATCACTGTGC TTATTTATCC AAACTCCATT CAGGAATACA TACGGCAACT GCCTCCTAAT 600 

TTTCCCTACA GAGATGATGT CATGTCAGTG AATCCTACCT GTTTGGTCCT TATTATTCTT 660 

• CTGTTTATTA GCATTATCTT; GACTTTTAAG GGTTACTTGA TTAGCTGTGT TTGGAACTGC 720 

TAGCGATACA TCAATGGTAG: GAACTCCTCT GATGTCCTGG TTTATGTTAC CAGCAATGAC 780 

ACTACGGTGC TGCTACCCCC GTATGATGAT GCCACTGTGA ATGGTGCTGC CAAGGAGCCA 840 

45 CCGCCACCTT- ACGTGTCTGC CTAAGCCTTC AAGTGGGCGG AGCTGAGGGC AGCAGCTTGA 900 

CTTTGCAGAC ATCTGAGCAA TAGTTCTGTT ATTTCACTTT TGCCATGAGC CTCTCTGAGC 960 

TTGTTTGTTG CTGAAATGCT ACTTTTTAAA ATTTAGATGT TAGATTGAAA ACTGTAGTTT 1020 

TCAACATATG CTTTGCTAGA ACACTGTGAT AGATTAACTG TAGAATTCTT CCTGTACGAT 1080 

TGGGGATATA ACGGGCTTCA CTAACCTTCC CTAGGCATTG AAACTTCCCC CAAATCTGAT 1140 

50 GGACCTAGAA GTCTGCTTTT GTACCTGCTG GGCCCCAAAG TTGGGCATTT TTCTCTCTGT 1200 

TCCCTCTCTT TTGAAAATGT AAAATAAAAC CAAAAATAGA CAACTTTTTC TTCAGCCATT 1260 

CCAGCATAGA GAACAAAACC TTATGGAAAC AGGAATGTCA ATTGTGTAAT CATTG TTCTA 1320 

ATTAGGTAAA TAGAAGTCCT. TATGTATGTG TTACAAGAAT TTCCCCCACA ACATCCTTTA 1380 

TGACTGAAGT. TCAATGACAG: TTTGTGTTTG GTGGTAAAGG ATTTTCTCCA TGGCCTGAAT 1440 

55 TAAGACCATT AGAAAGCACC AGGCCGTGGG AGCAGTGACC ATCTACTGAC TGTTCTTGTG 1500 

GATCTTGTGT CCAGGGACAT GGGGTGACAT GCCTCGTATG TGTTAGAGGG TGGAATGGAT 1560 

GTGTTTGGCG CTGCATGGGA TCTGGTGCCC CTCTTCTCCT- GGATTCACAT CCCCACCCAG. 1620 

GGCCCGCTTT TACTAAGTGT TCTGCCCTAG; ATTGGTTCAA" GGAGGTCATC CAACTGACTT 1680 

TATCAAGTGG AATTGGGATA TATTTGATAT ACTTCTGCCT AACAACATGG AAAAGGGTTT: 1740 

60 TCTTTTCCCT GCAAGCTACA TCCTACTGCT TTGAACTTCC AAGTATGTCT AGTCACCTTT 1800 

TAAAATGTAA ACATTTTCAG AAAAATGAGG ATTGCCTTCC TTGTATGCGC TTTTTACCTT 1860 

GACTACCTGA ATTGCAAGGG ATTTTTATAT ATTCATATGT TACAAAGTCA GCAACTCTCC 1920 

XGTTGGTTCA TTATTGAATG TGCTGTAAAT TAAGTCGTTT GCAATTAAAA CAAGGTTTGC 1980 
CCACATCCAA AAAAAAAAAA AAAAA 



65 



75 



Seq ID NO: 381 Protein sequence 
Protein Accession #: CAB66876 



Seq ID NO: 382 DNA sequence 
Nucleic Acid Accession #: NMJJ02510 
Coding sequence: 92-1774 



1 11 21 31 41 51 

70 | ! 1.1 i I 

MKMVAPWTRF YSNSCCLCCH VRTGTILLGV WYIillNAWL LILLSALADP DQYNFSSSEL 60 

GGDFEFMDDA NMCIAIAISL LMILICAMAT YGAYKQRAAW IIPFFCYQIF DFALNMLVAI 120 

TVLIYPNSIQ EYIRQLPPNF PYRDDVMSVN PTCLVLIILL FISIILTFKG YLISCVWNCY. 180 
RYINGRNSSD VLVYVTSNDT TVLLPPYDDA TVNGAAKEPP PPYVSA 



80 1 - H : 21- 31 41 51 

CAGATGCCAG AAGAACACTG TTGCTCTTGG TGGACGGGCC CAGAGGAATT CAGAGTTAAA 60 

CCTTGAGTGC CTGCGTCCGT GAGAATTCAG CATGGAATGT CTCTACTATT TCCTGGGATT 120 

TCTGCTCCTG GCTGCAAGAT TGCCACTTGA TGCCGCCAAA CGATTTCATG ATGTGCTGGG 180- 

85 CAATGAAAGA CCTTCTGCTT ACATGAGGGA GCACAATCAA TTAAATGGCT GGTCTTCTGA 240 

' TGAAAATGAC TGGAATGAAA AACTCTACCC AGTGTGGAAG CGGGGAGACA TGAGGTGGAA 300 

AAACTCCTGG AAGGGAGGCC GTGTGCAGGC GGTCCTGACC AGTGACTCAC CAGCCCTCGT 360 



327 
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GGGCTCAAAT ATAACATTTG CGGTGAACCT GATATTCCCT AGATGCCAAA AGGAAGATGC 420 

CAATGGCAAC ATAGTCTATG AGAAGAACTG CAGAAATGAG GCTGGTTTAT CTGCTGATCC 480 

ATATGTTTAC AACTGGACAG CATGGTCAGA 'GGACAGTGAC GGGGAAAATG GCACCGGCCA 540 

AAGCCATCAT AACGTCTTCC CTGATGGGAA ACCTTTTCCT CACCACCCCG GATGGAGAAG. 600 

ATGGAATTTC ATCTACGTCT TCCACACACT TGGTCAGTAT TTCCAGAAAT TGGGACGATG 660 

rrCAGTGAGA GTTTCTGTGA ACACAGCCAA TGTGACACTT GGGCCTCAAC TCATGGAAGT 720 

GACTGTCTAC AGAAGACATG GACGGGCATA TGTTCCCATC GCACAAGTGA AAG ATGTGTA 780 

CGTGGTAACA' GATCAGATTC CTGTGTTTGT ' GACTATGTTC CAGAAGAACG ATCGAAATTC 840 

ATCCGACGAA ACCTTCCTCA- AAGATCTCCC CATTATGTTT GATGTCCTGA TTCATGATCC . 900 

TAGCCACTTC CTCAATTATT CTACCATTAA CTACAAGTGG AGCTTCGGGG ATAATACTGG 960 
CCTGTTTGTT ■ TCCACCAATC ATACTGTGAA TCACACGTAT GTGCTCAATG . GAACCTTCAG . 1020 

CCTTAACCTC ACTGTGAAAG. CTGCAGCACC AGGACCTTGT CCGCCACCGC CACCACCACC 10 SO 

CAGACCTTCA AAACCCACCC. CTTCTTTAGG ACCTGCTGGT GACAACCCCC TGGAGCTGAG 1140 

TAGGATTCCT. GATGAAAACT G CCAGATT AA CAGATATGGC CACTTTCAAG CCACCATCAC 1200 

AATTGTAGAG. GGAATCTTAG, AGGTTAACAT CATCCAGATG ACAGACGTCC TGATGCCGGT, 1260 

GCCATGGCCT GAAAGCTCCC TAAT AGACTT . TGTCGTGACC TGCCAAGGGA GCATTCCCAC 1320 

GGAGGTCTGT ACCATCATTT* CTGACCCCAC CTGCGAGATC ACCCAGAACA CAGTCTGCAG 1380 

CCCTGTGGAT GTGGATGAGA TGTGTCTGCT GACTGTGAGA CGAACCTTCA ATGGGTCTGG 1440 

GACGTACTGT GTGAACCTCA CCCTGGGGGA TGACACAAGC CTGGCTCTCA CGAGCACCCT 1500 

GATTTCTGTT CCTGACAGAG . ACCCAGCCTC GCCTTTAAGG ATGGCAAACA GTGCCCTGAT 1560 

CTCCGTTGGC TGCTTGGCCA - TATTTGTCAC TGTGATCTCC CTCTTGGTGT ACAAAAAACA 1620 

CAAGGAATAC AACCCAATAG AAAATAGTCC TGGGAATGTG GTCAGAAGCA AAGGCCTGAG 1680 

TGTCTTTCTC AACCGTGCAA AAGCCGTGTT CTTCCCGGGA AACCAGGAAA AGGATCCGCT 1740 

ACTCAAAAAC CAAGAATTTA AAGGAGTTTC. TTAAATTTCG ACCTTGTTTC TGAAGCTCAC 1800 

TTTTCAGTGC CATTGATGTG; AGATGTGCTG GAGTGGCTAT TAACCTTTTT TTCCTAAAGA 1860 

TTATTGTTAA ATAGATATTG TGGTTTGGGG AAGTTGAATT TTTTATAGGT TAAATGTCAT 1920 

TTTAGAGATG GGGAGAGGGA TTATACTGCA GGCAGCTTCA GCCATGTTGT GAAACTGATA 1980 

AAAGCAACTT AGCAAGGCTT CTTTTCATTA TTTTTTATGT TTCACTTATA AAGTCTTAGG 2040 

TAACTAGTAG- GATAGAAACA - CTGTGTCCCG AGAGTAAGGA GAGAAGCTAC TATTGATTAG 2100 

AGCCTAACCC AGGTTAACTG" CAAGAAGAGG CGGGATACTT TCAGCTTTCC ATGTAACTGT 2160 

ATGCATAAAG CCAATGTAGT CCAGTTTCTA AGATCATGTT CCAAGCTAAC TGAATCCCAC 2220 

TTCAATACAC ACTCATGAAC TCCTGATGGA ACAATAACAG GCCCAAGCCT GTGG TATG AT 2280 

GTGCACACTT GCTAGACTCA GAAAAAATAC TACTCTCATA AATGGGTGGG AGTATTTTGG 2340 

TGACAACCTA- CTTTGCTTGG. CTGAGTGAAG GAATGATATT 1 CATATATTCA TTTATTCCAT 2400 

GGACATTTAG, TTAGTGCTTT TTATATACCA GGCATGATGC TGAGTGACAC TCTTGTGTAT 2460 

ATTTCCAAAT TTTTGTATAG TCGCTGCACA TATTTGAAAr' CATATATTAA GACTTTCCAA 2520 

AGATGAGGTC CCTGGTTTTT CATGGCAACT TGATCAGTAA GGATTTCACC TCTGTTTGTA 2580 

ACTAAAACCA TCTACTATAT GTTAGACATG ACATTCTTTT TCTCTCCTTC CTGAAAAATA 2640 
AAGTGTGGGA AGAGACAAAA AAAAAAAAA 

Seq ID NOt 383 Protein sequence 
Protein Accession #: NP_002501 

1 11 21 31 41 51 

MECLYYFLGP LLLAARLPLD AAKRFHDVLG NERPSAYMRE HNQLNGWSSD ENDWNEKLYP SO 

VWKRGDMRWK NSWKGGRVQA VLTSDSPALV GSNITFAVNL IFPRCQKEDA NGNIVYEKNC 120 

RNEAGLSADP YVYNWTAWSB DSDGENGTGQ: SHHNVPPDGK PFPHHPGWRR WNFIYVFHTL 180 

GQYFQKLGRC SVRVSVNTAN VTLGPQLMEV TVYRRHGRAY VPIAQVKDVY WTDQIPVFV 240 

TMFQKKDRNS SDETFLKDLP IMFDVLIHDP SHFLNYSTIN YKWSFGDNTG LFVSTNHTVN 300 

HTYVLNGTFS LNLTVKAAAP GPCPPPPPPP RPSKPTPSLG PAGDNPLELS RIPDENCQIN 360 

RYGHFQATIT IVEGILEVNI IQMTDVLMPV PWPESSLIDF. WTCQGSIPT EVCTIISDPT 420 

CEITQMTVCS PVDVDEMCLL TVRRTFNGSG TYCVNLTLGD DTSLALTSTL ISVPDRDPAS 480 

PLRMANSALI SVGCLAIFVT VISLLVYKKH KEYNPIENSP GNWRSKGLS VFLNRAKAVP 540 
FPGNQEKDPL LKNQEFKGVS 

Seq ID NOr 384 DNA sequence " 
Nucleic Acid Accession #: NM_001134 
Coding sequence: 48-1877 

1 11 -21- ! 31 41 51 

rCCATATTGT GCTTCCACCA CTGCCAATAA CAAAATAACT AGCAACCATG AAGTGGGTGG 60 

AATCAATTTT TTTAATTTTC CTACTAAATT TTACTGAATC CAGAACACTG CATAGAAATG 120 

AATATGGAAT AGCTTCCATA TTGGATTCTT ACCAATGTAC TGCAGAGATA AGTTTAGCTG 180 

ACCTGGCTAC CATATTTTTT GCCCAGTTTG TTCAAGAAGC CACTTACAAG GAAGTAAGCA 240 

AAATGGTGAA AGATGCATTG ACTGCAATTG AGAAACCCAC TGGAGATGAA CAGTCTTCAG 300 

GGTGTTTAGA AAACCAGCTA CCTGCCTTTC TGGAAGAACT TTGCCATGAG AAAGAAATTT 360 

TGGAGAAGTA CGGACATTCA GACTGCTGCA GCCAAAGTGA AGAGGGAAGA CATAACTGTT .420 

TTCTTGCACA CAAAAAGCCC ACTCCAGCAT CGATCCCACT TTTCCAAGTT CCAGAACCTG 480 

TCACAAGCTG TGAAGCATAT GAAGAAGACA GGGAGACATT CATGAACAAA TTCATTTATG 540 

AGATAGCAAG AAGGCATCCC TTCCTGTATG CACCTACAAT TCTTCTTTGG GCTGCTCGCT .600 

ATGACAAAAT AATTCCATCT TGCTGCAAAG CTGAAAATGC AGTTGAATGC TTCCAAACAA 660 

AGGCAGCAAC AGTTACAAAA GAATTAAGAG AAAGCAGCTT GTTAAATCAA CATGCATGTG 720 

CAGTAATGAA AAATTTTGGG ACCCGAACTT TCCAAGCCAT AACTGTTACT AAACTGAGTC 780 

AGAAGTTTAC CAAAGTTAAT- TTTACTGAAA TCCAGAAACT AGTCCTGGAT GTGGCCCATG 840 

TACATGAGCA CTGTTGCAGA GGAGATGTGC TGGATTGTCT GCAGGATGGG GAAAAAATCA 900 

TGTCCTACAT ATGTTCTCAA CAAGACACTC TGTCAAACAA AATAACAGAA TGCTGCAAAC 960 

TGACCACGCT GGAACGTGGT CAATGTATAA TTCATGCAGA AAATGATGAA AAAC CTGAAG 1020 

GTCTATCTCC AAATCTAAAC AGGTTTTTAG GAGATAGAGA TTTTAACCAA TTTTCTTCAG 1080. 

GGGAAAAAAA TATCTTCTTG GCAAGTTTTG TTCATGAATA TTCAAGAAGA CATCCTCAGC 1140 

- TTGCTGTCTC AGTAATTCTA AGAGTTGCTA AAGGATACCA GGAGTTATTG GAGAAGTGTT 1200. 

TCCAGACTGA AAACCCTCTT GAATGCCAAG ATAAAGGAGA AGAAGAATTA CAGAAATACA 1260. 

TCCAGGAGAG CCAAGCATTG GCAAAGCGAA GCTGCGGCCT CTTCCAGAAA CTAGGAGAAT 1320 

ATTACTTACA AAATGCGTTT CtCGTTGCTT ACACAAAGAA AGCCCCCCAG CTGACCTCGT 1380 

CGGAGCTGAT GGCCATCACC AGAAAAATGG CAGCCACAGC AGCCACTTGT TGCCAACTCA 1440 

GTGAGGACAA ACTATTGGCC TGTGGCGAGG GAGCGGCTGA CATTATTATC GGACACTTAT 1500 
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GTATCAGACA TGAAA7GACT. CCAGTAAACC CTGGTGTTGG 
ATGCCAACAG . GAGGCCATGC TTCAGCAGCT TGGTGGTGGA 
CATTCTCTGA TGACAAGTTC ATTTTCCATA- AGGATCTGTG 
"TGCAAACGAT GAAGCAAGAG TTTCTCATTA ACCTTGTGAA 
AGGAACAACT TGAGGCTGTC ATTGCAGATT. TCTCAGGCCT 
GCCAGGAACA GGAAGTCTGC . TTTGCTGAAG. AGGGACAAAA 
CTGCTTTGGG AGTTTAAATT " ACTTCAGGGG: AAGAGAAGAC 
TGTGAACTTT TCTCTTTAAT TTTAACTGAT TTAACACTTT 
AAAGACTTTT ATGTGAGATT • TCCTTATCAC AGAAATAAAA 

Seg ID NO: 385 Protein sequence. 
Protein Accession #: NP_001125 



PCTAJS02/12476 



CCAGTGCTGC 
TGAAACATAT 
CCAAGCTCAG 
GCAAAAGCCA 
GTTGGAGAAA" 
ACTGATTTCA 
AAAACGAGTC. 
TTGTGAATTA 
TATCTCCAAA 



1 
I 

MKWVESIFLI 
KEVSKMVKDA 
RHNCFLAHKK 
WAARYDKIIP 
TKLSQKFTKV 
ECCKLTTLER 
RHPQLAVSVI 
KLGBYYLQNA 
IGHLiCIRKEM 
QGVALQTMKQ 
SKTRAALGV 



11 
I 

FLLNFTESRT 
LTAIEKPTGD". 
PTPASIPLFQ 
SCCKAENAVE 
NFTSIQKLVL 
GQCIIHAEND. 
LRVAKGYQEL; 
FLVAYTKKAP. 
TPVNPGVGQC 
EFLINLVKQK 



21 

I- 

lhrneygias 
eqssgclenq: 
vpepvtscea- 
cpqtkaatvt: 
dvahvhehcc 
ekpeglspnl 
lekcfqtenp 

QLTSSELMAX. 
CTSSYANRRP 
PQITEEQLEA 



31 
I 

ILDSYQCTAE 
LPAFLEELCH 
YEEDRETFMN'' 
KEhKESSLLN. 
RGDVLDCLQD- 
NRFLGDRDFNr 
LECQDKGEEE 
TRKMAATAAT 
CFSSLWDET 
VIADFSGLLE 



41 
I 

ISLADLATIF 
EKEIIiEKYGH 
KFIYEIARRH' 
OHACAVMKNF 
GEKIMSYICS 
QFSSGEKNIF 
LQKYIQESQA 
CCQLSEDKLL 
YVPPAFSDDK 
KCCQGQEQEV 



Seq ID NO: 38S DNA sequence 

Nucleic Acid Accession #: NM_002205.1 

Coding sequence: 1..3149 



1 

I. 

ATGGGGAGCC 
CGCCGACXrCC 
GGCTTCAACT- 
GGATTCTCAG'- 
CCCAAGGCTA 
TGGGGTGCCA 
CTGGAGTCCT 
TGGTTOGGGG 
AGCTGGCGCA 
GATAACTTCA 
GGACAGGGTT 
TTAGGTGGAC 
ATTGCAGAAT 
CGCCAGGCCA 
TTCAGTGGTG 
GGCTATGTCA 
CAGATGGCCT 
GATGACTTGC 
GAGGTGGGCA 
CTTACCCTCA 
GACCTGGACC 
CAGCAGGGAG 
CAGGTTCTGC 
CGAGGAGGCC 
GTGGACAAGG 
ATCTTCCCCG 
GCCTGCATCA 
GGTTTCACAG 
CTGTTCCTGG 
CGAGAGGATT 
CTCTCGCCGA 
CACGGCCTCA 
ATCTTGCTGG 
GGGGAGCAGA 
CAGAATGTGG 
GCTGAGTACT 
TTTGCCGTGA 
GCCAGTCTGT 
ATCCAGTTTG 
TCCTTTCGGC 
GAGGCAGTGC 
GACCTGGGAC 
AGCCAGGGTG 
GTGACCAGAG 
GAGTTGGATC 
TCTGCTTCCT 
TGTGAGCTCG 
TGGGCCAAGA 
TACAAAGCCC 
CAGGTGGCCA 
ATCATCATCC 
TACAAGCTTG 
CTCAAGCCTC. 



11 

I . 

GGACGCCAGA 
CGCTSSTGCC 
TAGACGCGGA 
TGGAGTTTTA 
ATACCAGCCA' 
GCCCCACACA 
CACTGTCCAG 
CAACAGTTCG 
CAGAGAAGGA 
CCCGAATTCT 
ACTGCCAAGG 
CAGGAAGCTA 
CTTATTACCC 
GTTCCATCTA- 
ATGACACAGA 
CCATCCTTAA 
CCTACTTTGG 
TGGTGGGGGC' 
GGGTCTACGT- 
CTGGCCATGA 
AGGATGGCTA 
TAGTGTTTGT 
AGCCCCTGTG 
GAGACCTGGA 
CTGTGGTATA 
CCATGTTCAA 
ACCTTAGCTT 
TGGAACTTCA 
CCTCCAGGCA 
GCAGAGAGAT 
TTCACATCGC 
GGCCAGCCCT 
ACTGTGGAGA 
ACCATGTGTA 
GTGAGGGTGG 
CAGGACTCGT 
ACCAGAGCGG 
GGGGTGGCCT 
ACTTCCAGAT, 
TCTCCGTGGA v 
TATTCCCAGT 
CTCCTGTCCA 
TGCTGGAACT 
TTACGGGACT 
CCGAGGGTTC 
CGGGACCTCAr 
GGCCCCTGCA 
CTTTCTTGCA- 
TGAAGATGCC 
CAGCTGTGCA 
TAGCCATCCT 
GATTCTTCAA 
CAGCCACCTC 



21 • 

I . 

GTCCCCTCTC 
GCTGCTGTTG 
GGCCCCAGCA' 
CCGGCCGGGA 
GCCAGGAGTG 
GTGCACCCCC 
CTCAGAGGGA 
AGCCCATGGC 
GCCACTGAGC 
GGAGTATGCA 
AGGCTTCAGT. 
TTTCTGGCAA 
CGAGTACCTG 
TGATGACAGC 
AGACTTTGTT 
TGGCTCAGAC 
CTATGCAGTG 
ACCCCTGCTC 
CTACCTGCAG' 
TGAGTTTGGC 
CAATGATGTG 
ATTTCCTGGG 
GGCAGCCAGC 
TGGCAATGGA 
CAGGGGCCGC 
CCCAGAGGAG 
CTGCCTCAAT 
GCTGGACTGG 
GGCAACCCTG 
GAAGATCTAC 
TCTCAACTTC 
ACATTATCAG 
AGACAACATC 
CCTGGGTGAC 
CGCCTATGAG 
CAGACACCCA 
CCTGCTGGTG 
TCGGTTTACA 
CCTCAGCAAG 
GGCTCAGGCC 
AAGCGACTGG 
CCATGTCTAT 
CAGCTGTCCC 
CAACTGCACC 
CCTGCACCAC 
GATCCTGAAA 
CCAACAAGAG 
GCGGGAGCAC 
CTACCGAATC 
ATGGACCAAG 
GTTTGGCCTC 
ACGCTCCCTC 
TGATGCCTGA 



31 41 

I I 

CACGCCGTGC AGCTGCGCTG 
CTGCTSSTGC CGCCGCCACC 
GTACTCTCGG: GGCCCCCGGG 
ACAGACGGGG TCAGTGTGCT 
CTGCAGGGTG GTGCTGTCTA 
ATTGAATTTG ACAGCAAAGG 
GAGGAGCCTG; TGGAGTACAA 
TCCTCCATCT TGGCATGCGC 
GACCCCGTGG GCACCTGCTA 
CCCTGCCGCT. CAGATTTCAG 
GCCGAGTTCA CCAAGACTGG 
GGCCAGATCG TGTCTGCCAC 
ATCAACCTGG TTCAGGGGCA 
TACCTAGGAT ACTCTGTGGC. 
GCTGGTGTGC CCAAAGGGAA- 
ATTCGATCCC TCTACAACTT 
GCCGCCACAG ACGTCAATGG 
ATGGATGGGA CCCCTGACGG 
CACCCAGCCG GCATAGAGCC 
CGATTTGGCA GCTCCTTGAC 
GCCATCGGGG CTCCCTTTGG 
GGCCCAGGAG GGCTGGGCTC 
CACACCCCAG ACTTCTTTGG 
TATCCTGATC . TGATTGTGGG 
CCCATCGTGT' CCGCTAGTGC 
CGGAGCTGCA GCTTAGAGGG 
GCTTCTGGAA AACACGTTGC 
CAGAAGCAGA AGGGAGGGGT 
ACCCAGACCC TGCTCATCCA 
CTCAGGAACG AGTCAGAATT 
TCCTTGGACC CCCAAGCCCC 
AGCAAGAGCC GGATAGAGGA 
TGTGTGCCTG ACCTGCAGCT 
AAGAATGCCC TGAACCTCAC 
GCTGAGCTTC GGGTCACCGC 
GGGAACTTCT CCAGCCTGAG 
TGTGACCTGG GCAACCCCAT 
GTCCCTCATC TCCGGGACAC 
AATCTCAACA ACTCGCAAAG 
CAGGTCACCC TGAACGGTGT 
CATCCCCGAG ACCAGCCTCA 
GAGCTCATCA ACCAAGGCCC 
CAGGCTCTGG AAGGTCAGCA 
ACCAATCACC CCATTAACCC 
CAGCAAAAAC; GGGAAGCTCC 
TGCCCGGAGG CTGAGTGTTT 
AGCCAAAGTC TGCAGTTGCA 
CAGCCATTTA GCCTGCAGTG 
CTGCCTCGGC AGCTGCCCCA 
GCAGAAGGCA GCTATGGCGT 
CTGCTCCTAG GTCTACTCAT 
CCATATGGCA CCGCCATGGA 



ACTTCTTCAr 
GTCCCTCCTG 
GGTGTAGOGC 
CAAATAACAG 
TGCTGCCAAG 
AAAACTCGTG. 
TTTCATTCGG 
ATGAAATGAT 
TG 



51 
I 

FAQFVQEATY 

SDCCSQSEEG 

PFLYAPTILI* 

GTRTFQAITV.' 

QQDTLSNKIT - 

LASFVHEYSR' 

LAKRSCGLFQ;. 

ACGEGAADII- 

FIFHKDLCQA 

CFAEEGQKLI 



51 



1560 
1620 
1660 
1740 
1800 
1860 
1920 
1980 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



GGGCCCCCGG 

CAGGGTCGGG 

CTCCTTCTTC 

GGTGGGAGCA 

CCTCTGTCCT- 

CTCTCGGCTC 

GTCCTTGCAG 

TCCACTGTAC 

CCTCTCCACA: 

CTGGGCAGCA 

CCGTGTGGTT 

TCAGGAGCAG 

GCTGCAGACT-- 

TGTTGGTGAA 

CCTCACTTAC 

CTCAGGGGAA 

GGACGGGCTG 

GCGGCCTCAG 

CACGCCCACC 

CCCCCTGGGG- 

TGGGGAGACC 

TAAGCCTTCC 

CTCTGCCCTT 

GTCCTTTGGT ' 

CTCCCTCACC 

GAACCCTGTG - 

TGACTCCATT- 

ACGGCGGGCA 

GAATGGGGCT 

TCGAGACAAA . 

AGTGGACAGC 

CAAGGCTCAG 

GGAAGTGTTT 

TTTCCATGCC 

CCCTCCAGAG 

CTGTGACTAC 

GAAGGCAGGA 

TAAGAAAACC 

CGACGTGGTT 

CTCCAAGCCT 

GAAGGAGGAG 

CAGCTCCATT 

GCTCCTATAT 

AAAGGGCCTG 

AAGCCGCAGC 

CAGGCTGCGC 

TTTCCGAGTC- 

TGAGGCTGTG 

AAAAGAGCGT 

CCCACTGTGG 

CTACATCCTC 

AAAAGCTCAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280' 

2340 

2400 

2460 

2520 

2580 

2640 ' 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 
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Seq ID NO:, 387 Protein sequence" 
Protein Accession #» NP_002196.1 

5.1* .11- 21 31 41 ? X 

I I I I I !" 

MGSRTPESPL HAVQLRWGPR RRPPLLPLLL LLLPPPPRVG GFNLDAEAPA VLSGPPGSFF 60 

GFSVEPYRPG TDGVSVLVGA PKANTSQPGV LQGGAVYLCP WGASPTQCTP IEFDSKGSRL 120 

LESSLSSSEG EEFVEYKSLQ WFGATVRAHG SSILACAPLY SWRTEXEPLS DPVGTCYLST 160 

10 DNFTRILEYA PCRSDFSWAA? GQGYCQGGPS AEFTKTGRW LGGPGSYFWQ: GQILSATQEQ : 240 

IAESYYPEYL 'INLVQGQLOT. RQASSIYDDS YLGYSVAVGE FSGDDTEDFV- AGVPKGNLTY 300 

GYVTILNGSD IRSLYNFSGE QMASYFGYAV AATDVNGDGL DDLLVGAPLL MDRTPDGRPQ 360 

EVGRVYVYLQ . HPAGIEPTPT LTLTGHDEFG RFGSSLTPLG DLDQDGYNDV AIGAPFGGET 420 

QQGWFVFPG. GPGGLGSKPS' QVLQPLWAAS HTPDFFGSAL RGGRDLDGNG' YPDLIVGSFG, 4B0 

15 ■ VEKAWYRGR' PIVSASASLT IFPAMFNPEE RSCSIiEGNPV ACINLSFCUT ASGKHVADSI:. 540 

GFTVELQLDW. QKQKGGVRRA- LFLASRQATL TQTLLIQNGA REDCREMKIY LRN3SEFRDK~- 600 

LSPIHIALNP SLDPQAPVDS HGLRPALHYQ SK5RIEDKAQ ILLDCGEDN* CVPDUJLEVF. 660 

GEQNHVYLGD. KNALNLTFHA- QNVGEGGAYE" AELRVTAPPE AEYSGLVRHP- GNFSSLSCDY" 720 

FAVNQSRIiLV- CDLGNPMKAG ' ASLWGGLRFT VPHLRDTKKT- IQFDFQILSK NLNNSQSDW- • 780 

20 SFRLSVEAQA QVTLNGVSKP EAVLFPVSDW HPRDQPQKEE DI/5PAVHHVY* ELINQGPSSI- 840 

SQGVLELSCP QALEGQQLLY VTRVTGLNCT- TNHPINPKGL' ELDPBGSIiHH QQKREAPSRS: 900 

SASSGPQILK CPEAECFRLR CELGPLHQQE SQSLQLHFRV VTAXTFLQREH QPFSLQCEAV 960 

YKALKMPYRI LPRQLPQKER QVATAVQWTK AEGSYGVPLW IIILAILFGL LLLGliilYIL 1020 
YKLGFFKRSL PYGTAMEKAQ LKPPATSDA 



25 



65 



80 



Seq' ID NO: 388 DNA sequence. 
Nucleic Acid Accession #: NM_002425 
Coding sequence: 26.. 1453 



30 1 11 21 31 41 51 

AAAGAAGGTA- AGGGCAGTGA GAATGATGCA TCTTGCATTC CTTGTGCTGT TGTGTCTGCC 60 

AGTCTGCTCT GCCTATCCTC TGAGTGGGGC AGCAAAAGAG. GAGGACTCCA ACAAGGATCT 120 

TGCCCAGCAA TACCTAGAAA- ' AGTACTACAA CCTCGAAAAG. GATGTGAAAC AGTTTAGAAG 180 

35 AAAGGACAGT AATCTCATTG TTAAAAAAAT" CCAAGGAATG" CAGAAGTTCC TTGGGTTGGA 240 

GGTGACAGGG, AAGCTAGACA CTGACACTCT GGAGGTGATG CGCAAGCCCA GGTGTGGAGTr 300 

TCCTGACGTT GGTCACTTCA GCTCCTTTCC TCGCATGCCG AAGTGGAGGA AAACCCACCT 360 

TACATACAGG' ATTGTGAATT ATACACCAGA TTTGCCAAGA GATGCTGTTG ATTCTGCCAT;' 420 

TCAGAAAGCT CTGAAAGTCT, GGGAAGAGGT. GACTCCACTC ACATTCTCCA* GGCTGTATGA 480 

40 AGGAGAGGCT GATATAATGA TCTCTTTOGC: AGTTAAAGAA CATGGAGACT; TTTACTCTTT. 540 

TGATGGCCCA' GGACACAGTT TGGCTCATGC CTACCCACCT GGACCTGGGC TTTATGGAGA ! 600 

TATTCACTTT GATGATGATG AAAAATGGAC AGAAGATGCA' TCAGGCACCA* ATTTATTCCT 660 

CGTTGCTGCT- CATGAACTTG GCCACTCCCT * GGGGCTCTTT CACTCAGCCA- ACACTGAAGC 720 

TTTGATGTAC CCACTCTACA ACTCATTCAC AGAGCTCGCC CAGTTCCGCC TTTCGCAAGA 780 

45 TGATGTGAATr GGCATTCAGT CTCTCTACGG ACCTCCCCCT GCCTCTACTG AGGAAC CCCT ' 840 

GGTGCCCACA AAATCTGTTC CTTCGGGATC' TGAGATGCCA GCCAAGTGTG ATCCTGCTTT ■'■ 900 

GTCCTTCGAT G CCAT CAGCA --- CTCTGAGGGG AGAATATCTG, TTCTTT AAAG : ACAGATATTT • 960 

TTGGCGAAGA TCCCACTGGA- ACCCTGAACC TGAATTTCAT- TTGATTTCTG CATTTTGGCC' 1020 

CTCTCTTCCA TCATATTTGG ATGCTGCATA TGAAGTTAAC AGCAGGGACA" CCGTTTTTAT 1080 

50 TTTTAAAGGA AATGAGTTCT GGGCCATCAG AGGAAATGAG GTACAAGCAG. GTTATCCAAG^ 1140 

AGGCATCCAT ACCCTGGGTT- TTCCTCCAAC CATAAGGAAA ATTGATGCAG' CTGTTTCTGA 1200 

CAAGGAAAAG AAGAAAACAT ' ACTTCTTTGC AGCGGACAAA TACTGGAGAT- TTGATGAAAA 1260 

TAGCCAGTCC ATGGAGCAAG GCTTCCCTAG ACTAATAGCT- GATGACTTTC CAGGAGTTGA 1320 

GCCTAAGGTT GATGCTGTAT TACAGGCATT TGGATTTTTC TACTTCTTCA GTGGATCATC 1380 

5 5 ACAGTTTGAG - TTTGACCCCA ATGCCAGGAT. GGTGACACAC ATATTAAAGA GTAACAGCTG. 1440 

GTTACATTGC -TAGGCGAGAT AGGGGGAAGA CAGATATGGG:. TGTTTTTAAT AAATCTAATA" 1500 

ATTATTCATC TAATGTATTA TGAGCCAAAA TGGTTAATTT TTCCTGCATG TTCTGTGACT" 1560 

GAAGAAGATG AGCCTTGCAG- ATATCTGCAT GTGTCATGAA GAATGTTTCT GGAATTCTTC 1620 

ACTTGCTTTT GAATTGCACT GAACAGAATT AAGAAATACT CATGTGCAAT AGGTGAGAGA 1680 

60 ATGTATTTTC ATAGATGTGT TATTACTTCC TCAATAAAAA GTTTTATTTT GGGCCTGTTC 1740 
CTT 



Seq ID. NO: 389 Protein sequence 
Protein Accession #: np_002416 



1 11 21 31 41 51 

I I I I I J 

MHLAFLVLLC LPVCSAYPLS GAAKEEDSNK DLAQQYLEKY YNLEKDVKQF RRKDSNLIVK 60 

KIOGMQKFLG LEVTGKLDTD TLEVMRKPRC GVPDVGHFSS FPGMPKWRKT HLTYRIVNYT 120 

70 PDLPRDAVDS AIEXALKVWE EVTPLTFSRL YEGSADIMIS FAVKEHGDFY SFDGPGHSIA 180 

HAYPPGPGLY GDIHFDDDEK WTEDASGTNL FLVAAHFXGH SLGLFHSANT EALMYPLYNS 240 

FTELAQFRLS QDDVNGIQSL YGPPPASTEE PLVPTKSVPS GSEMPAKCDP ALSFDAISTL 300 

RGEYLFFKDR YFWRRSHWNP EPEFHLISAF WPSLPSYLDA AYEVNSRDTV FIFKGNEFWA 360 

IRGNEVQAGY PRGIHTLGFP PTIRKIDAAV SDKEKKKTYF FAADKYWRFD ENSQSMEQGF 420 

75 PRLIADDFPG VEPKVDAVLQ AFGFFYFFSG SSQFEPDPNA RMVTHILKSN SWLHC 



Seq ID NO: 390 DNA sequence 

Nucleic Acid Accession #: NM_002421.2 

Coding sequence: 1..1409 



11 21 31 41 51 

| I I I I I 

ATGCACAGCT TTCCTCCACT GCTGCTGCTG CTGTTCTGGG GTGTGGTGTC ACACAGCTTC 60 

CCAGCGACTC TAGAAACACA AGAGCAAGAT GTGGACTTAG TCCAGAAATA CCTGGAAAAA 120 

85 TACTACAACC TGAAGAATGA TGGGAGGCAA GTTGAAAAGC GGAGAAATAG TGGCCCAGTG 180 

GTTGAAAAAT TGAAGCAAAT GCAGGAATTC TTTGGGCTGA AAGTGACTGG GAAACCAGAT 240 

GCTGAAACCC TGAAGGTGAT GAAGCAGCCC AGATGTGGAG TGCCTGATGT GGCTCAGTTT 300 
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GTCCTCACTG AGGGGAACCC TCGCTGGGAG CAAACACATC TGACCTACAG GATTGAAAAT 360 

TACACGCCAG ATTTGCCAAG AGCAGATGTG GACCATGCCA TTGAGAAAGC CTTCCAACTC 420 

TGGAGTAATG TCACACCTCT GACATTCACC AAGGTCTCTG AGGGTCAAGC AGACATCATG 480 

ATATCTTTTG TCAGGGGAGA TCATCGGGAC AACTCTCCTT TTGATGGACC TGGAGGAAAT S40 

CTTGCTCATG CTTTTCAACC AGGCCCAGGT ATTGGAGGGG" ATGCTCATTT TGATGAAGAT 600 

GAAAGGTGGA CCAACAATTT CAGAGAGTAC AACTTACATC GTGTT GCGGC TCATGAACTC 660 

GGCCATTCTC TTGGACTCTC CCATTCTACT GATATCGGGG CTTTGATGTA CCCTAGCTAC 720 

ACCTTCAGTG GTGATGTTCA GCTAGCTCAG GATGACATTG ATGGCATCCA AGCCATATAT 7 SO 

GGACGTTCCC AAAATCCTGT CCAGCCCATC GGCCCACAAA CCCCAAAAGC ATGTGACAGT 840 

AAGCTAACCT TTGATGCTAT AACTACGATT CGGGGAGAAG TGATGTTCTT TAAAGACAGA 900 

TTCTACATGC GCACAAATCC CTTCTACCCG GAAGTTGAGC TCAATTTCAT TTCTGTTTTC 960 

TGGCCACAAC TGCCAAATGG . GCTTGAAGCT GCTTACGAAT TTGCCGACAG AGATGAAGTC 1020 

CGGTTTTTCA AAGGGAATAA. GTACTGGGCT- GTTCAGGGAC AGAATGTGCT- ACACGGATAC 1080 

CCCAAGGACA TCTACAGCTC CTTTGGCTTC CCTAGAACTG TGAAGCATAT CGATGCTGCT ' 1140 

CTTTCTGAGG AAAACACTGG AAAAACCTAC TTCTTTGTTG CTAACAAATA CTG GAGG TAT 1200 

GATGAATATA AACGATCTAT GGATCCAGGT TATCCCAAAA TGATAGCACA TGACTTTCCT 1260 

GGAATTGGCC ACAAAGTTGA- TGCAGTTTTC ATGAAAGATG GATTTTTCTA TTTCTTTCAT 1320 

' GGAACAAGAC AATACAAATT TGATCCTAAA ACGAAGAGAA TTTTGACTCT CCAGAAAGCT 1380 
AATAGCTGGT TCAACTGCAG GAAAAATTAG. 

Seq ID NO: 3 91 Protein sequence 
Protein Accession #: *" NP_002412.1 

1 11 21 31 |1 51 

MHSFPPLLLL LFWGWSHSF PATLETQEQD VDLVQKYLEK YYNLKNDGRQ VEKRRNSGPV 60 

VEKLKQMQEF FGLKVTGKPD AETLKVMKQP : RCGVPDVAQF VLTEGNPRWE QTHLTYRIEN 120 

YTPDLPRADV DHAIEKAFQL WSNVTPLTFT KVSEGQADIM: ISFVRGDHRD NSPFDGPGGN- 180 

LAHAFQPGPG IGGDAHFDED ERWTNNFREY NLHRVAAHBL GHSLGLSHST DIGALMYPSY 240 

TFSGDVQLAQ' DDIDGIQAIY GRSQNPVQPI GPQTPKACDS' KLTFDAITTI RGEVMFFKDR 300 

FYMRTNPFYP EVKLNFISVF. WPQLPNGLEA AYEFADRDBV- RFFKGNKYWA VQGQNVLHGY 3 60 

PKDIYSSFGF, PRTVKHIDAA LSSENTGKTY- FFVANKYWRY 1 DEYKRSMDPG YPKMIAHDFP 420 
GIGHKVDAVF MKDGFFYFFH GTRQYKFDPK TKRILTLQKA NSWFNCRKN 

Seq ID NO: 392 DNA sequence 

Nucleic Acid Accession #: NM_002421.2 

Coding sequence: 1..1409 

1 ' 11 .21 31 41 . 51 

ATGCACAGCT TTCCTCCACT GCTGCTGCTG' CTGTTCTGGG GTGTGGTGTC ACACAGCTTC .60 

CCAGCGACTC TAGAAACACA AGAGCAAGAT GTGGACTTAG TCCAGAAATA CCTGGAAAAA; 120 

TACTACAACC TGAAGAATGA TGGGAGGCAA GTTGAAAAGC GGAGAAATAG TGGCCCAGTG 1 B 0 

GTTGAAAAAT TGAAGCAAAT GCAGGAATTC TTTGGGCTGA AAGTGACTGG. GAAACCAGAT 24 0 

GCTGAAACCC TGAAGGTCAT GAAGCAGCCC AGATGTGGAG TGCCTGATGT GGCTCAGTTT ■ 300 

GTCCTCACTG AGGGGAACCC TCGCTGGGAG CAAACACATC TGACCTACAG GATTGAAAAT 360 

TACACGCCAG ATTTGCCAAG. AGCAGATGTG GACCATGCCA TTGAGAAAGC CTTCCAACTC 420 

TGGAGTAATG TCACACCTCT GACATTCACC AAGGTCTCTG. AGGGTCAAGC AGACATCATG 480 

ATATCTTTTG TCAGGGGAGA TCATCGGGAC AACTCTCCTT. TTGATGGACC TGGAGGAAAT 54 0 

CTTGCTCATG CTTTTCAACC AGGCCCAGGT ATTGGAGGGG - ATGCTCATTT TGATGAAGAT 600 

GAAAGGTGGA- CCAACAATTT CAGAGAGTAC AACTTACATC GTGTTGCGGC: TCATGCCCTC 660 

GGCCATTCTC TTGGACTCTC CCATTCTACT GATATCGGGG CTTTGATGTA- CCCTAGCTAC 720 

ACCTTCAGTG GTGATGTTCA GCTAGCTCAG GATGACATTG ATGGCATCCA AGCCATATAT 780 

GGACGTTCCC AAAATCCTGT'- CCAGCCCATC GGCCCACAAA CCCCAAAAGC ATGTGACAGT 840 

AAGCTAACCT TTGATGCTAT AACTACGATT CGGGGAGAAG TGATGTTCTT TAAAGACAGA- 900 

TTCTACATGC GCACAAATCC CTTCTACCCG GAAGTTGAGC TCAATTTCAT TTCTGTTTTC 960 

TGGCCACAAC' TGCCAAATGG GCTTGAAGCT GCTTACGAAT TTGCCGACAG AGATGAAGTC 1020 

CGGTTTTTCA- AAGGGAATAA GTACTGGGCT GTTCAGGGAC AGAATGTGCT ACACGGATAC 1080 

CCCAAGGACA TCTACAGCTC CTTTGGCTTC CCTAGAACTG TGAAGCATAT CGATGCTGCT 1140 

CTTTCTGAGG. AAAACACTGG AAAAACCTAC TTCTTTGTTG CTAACAAATA CTGGAGGTAT 1200 

GATGAATATA AACGATCTAT GGATCCAGGT TATCCCAAAA TGATAGCACA TGACTTTCCT 1260 

GGAATTGGCC ACAAAGTTGA TGCAGTTTTC ATGAAAGATG GATTTTTCTA TTTCTTTCAT 1320 

GGAACAAGAC AATACAAATT TGATCCTAAA ACGAAGAGAA TTTTGACTCT CCAGAAAGCT 1380 
AATAGCTGGT TCAACTGCAG GAAAAATTAG 



Seq ID NO:. 393 Protein sequence 
Protein Accession 8: NP_002412.1 

1 11 21 ' 31 41 51 

I I ' • I I 1 I 

MHSFPPLUil* LFWGWSHSF PATLETQBQD VDLVQKYLEK YYNLKNDGRQ VEKRRNSGPV 60 
VEKLKQMQEF FGLKVTGKPD AETLKVMKQP RCGVPDVAQF VLTEGNPRWE QTHLTYRIEN 120 
YTPDLPRADV DHAIEKAFQL WSNVTPLTFT KVSEGQADIM ISFVRGDHRD NSPFDGPGGN 180 
LAHAFQPGPG IGGDAHFDED ERWTNNFREY NLHRVAAHAL GHSLGLSHST DIGALMYPSY 240 
TFSGDVQLAQ DDIDGIQAIY GRSQNPVQPI GPQTPKACDS KLTFDAITTI RGEVMFFKDR 300 
FYMRTNPFYP EVELNPISVF WPQLPNGLEA AYEFADRDEV RFFKGNKYWA VQGQNVLHGY 360 
PKDIYSSFGF PRTVKHIDAA LSEENTGKTY FFVANKYWRY DEYKRSMDPG YPKMIAHDFP 420 
GIGHKVDAVF MKDGFFYFFH GTRQYKFDPK TKRILTLQKA NSWFNCRKN 



Seq ID NO: 3 94 DNA sequence 

Nucleic Acid Accession #: NM_014331.2 

Coding sequence: 1. .1506 

1 11 21 ' . J 31 41 SI 

I V I I I i 
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ATGGTCAGAA AGCCTGTTGT GTCCACCATC " TCCAAAGGAG . GTTACCTGCA GGGAAATGTT 6 0 

AACGGGAGGC TGCCTTCCCT- GGGCAACAAG GAGCCACCTG GGCAGGAGAA AGTGCAGCTG 120 

AAGAGGAAAG TCACTTTACT GAGGGGAGTC TCCATTATCA TTGGCACCAT CATTGGAGCA: 180 

GGAATCTTCA TCTCTCCTAA GGGCGTGCTC CAGAACACGG GCAGCGTGGG CATGTCTCTG 240 

ACCATCTGGA CGGTGTGTGG GGTCCTGTCA CTATTTGGAG CTTTGTCTTA TGCTG AATTG 300 

GGAACAACTA' TAAAGAAATC TGGAGGTCAT TACACATATA TTTTGGAAGT CTTTGGTOCA 360 

TTACCAGCTT • TTGTACGAGT CTGGGTGGAA CTCCTCATAA" TACGCCCTGC AGCTACTGCT 420 

GTGATATCCC TGGCATTTGG ACGCTACATT CTGGAACCAT TTTTTATTCA ATGTGAAATC 480 

CCTGAACTTG ' CGATCAAGCT CATTACAGCT GTGGGCATAA CTGTAGTGAT GGTCCTAAAT 540 

AGCATGAGTG TCAGCTGGAG CGCCCGGATC CAGATTTTCT- TAACCTTTTG CAAGCTCACA 600 

GCAATTCTGA TAATTATAGT CCCTGGAGTT. ATGCAGCTAA TTAAAGGTCA AACGCAGAAC " 660 

tttaaagacg: cgttttcagg. AAGAGATTCA AGTATTACGC" GGTTGCCACT GGCTTTTTAT 720 

TATGGAATGT. ATGCATATGC TGGCTGGTTT TACCTCAACT TTGTTACTGA AGAAGTAGAA 780 

AACCCTGAAA AAACCATTCC CCTTGCAATA TGTATATCCAV TGGCCATTGT CACCATTGGC 840 

TATGTGCTGA CAAATGTGGC CTACTTTAOG ACCATTAATG- CTGAGGAGCT GCTGCTTTCA 900 

AATGCAGTGG. CAGTGACCTT TTCTGAGCGG CTACTGGGAA ATTTCTCATT AGCAGTTCOG 960 

ATCTTTGTTG' CCCTCTCCTG- CTTTGGCTCC' ATGAACGGTG GTGTGTTTGC TGTCTCCAGG *" 1020 

TTATTCTATG TTGCGTCTCG AGAGGGTCAC CTTCCAGAAA TCCTCTCCAT- GATTCATGTC-. 1080 

CGCAAGCACA- CTCCTCTACC AGCTGTTATT GTTTTGCACC . CTTTGACAAT GATAATGCTC 1140 

TTCTCTOGAG. ACCTCGACAG TCTTTTGAAT TTCCTCAGTT TTGCCAGGTG GCTTTTTATT 1200 

GGGCTGGCAG' TTGCTGGGCT GATTTATCTT CGATACAAAT GCCCAGATAT GCATCGTCCT; 1260 

TTCAAGGTGC CACTGTTCAT CCCAGCTTTG TTTTCCTTCA CATGCCTCTT CATGGTTGCC 1320 

CTTTCCCTCT ATTCGGACCC ATTTAGTACA- GGGATTGGCT TCGTCATCAC TCTGACTGGA. 1380 

GTCCCTGCGT. ATTATCTCTT TATTATATGG GACAAGAAAC CCAGGTGGTT- TAGAATAATG 1440 

TCAGAGAAAA TAACCAGAAC ATTACAAATA ATACTGGAAG. TTGTACCAGA AGAAGATAAG IS 00 

TTATGAACTA ATGGACTTGA GATCTTGGCA ATCTGCCCAA - GGGGAGACAC AAAATAGGGA 1560- 

TTTTTACTTC ATTTTCTGAA AGTCTAGAGA' ATTACAACTT TGGTGATAAA CAAAAGGAGT 1620 

CAGTTATTTT.- TATTCATATA • TTTTAGCATA TTCGAACTAA TTTCTAAGAA ATTTAGTTAT 1680 

AACTCTATGT AGTTATAGAA- AGTGAATATG CAGTTATTCT. ATGAGTCGCA CAATTCTTGA 1740 

GTCTCTGATA CCTACCTATT; GGGGTTAGGA GAAAAGACTA GACAATTACT ATGTGGTCAT' 1800 

TCTCTACAAC ATATGTTAGC ACGGCAAAGA ACCTTCAAAT TGAAGACTGA GATTTTTCTG 1860 

T ATATATGGG . TTTTGTAAAG: ATGGTTTTAC ACACTACAGA TGTCTATACT. GTGAAAAGTG 1920 

TTTTCAATTC TGAAAAAAAG CATACATCAT GATTATGGCA AAGAGGAGAG AAAGAAATTT . 1980 

ATTTTACATT. GACATTGCAT- TGCTTCCCCT TAGATACCAA TTTAGATAAC AAACACTCAT 204 0 

GCTTTAATGG ATTATACCCA'- GAGCACTTTG - AACAAAGGTC AGTGGGGATT GTTGAATACA ' 2100 

TTAAAGAAGA. GTTTCTAGGG; GCTACTGTTT ATGAGACACA TCCAGGAGTT ATGTTTAAGT " 2160 

AAAAATCCTT- GAGAATTTAT TATGTCAGAT GTTTTTTCAT TCATTATCAG GAAGTTTTAG 2220 

TTATCTGTCA- TTTTTTTTTT TCACATCAGT TTGATCAGGA" AAGTGTATAA" CACATCTTAG 2280 

AGCAAGAGTT' AGTTTGGTAT TAAATCCTCA TTAGAACAAC CACCTGTTTC ACTAATAACT 2340 

TACCCCTGAT GAGTCTATCT AAACATATGC ATTTTAAGCC TTCAAATTAC ATTATCAACA 2400 

TGAGAGAAAT AACCAACAAA- GAAGATGTTC AAAATAATAG TCCCATATCT- GTAATCATAT. 2460 

CTACATGCAA TGTTAGTAAT: TCTGAAGTTT. TTTAAATTTA TGGCTATTTT TACACGATGA 2520 

TGAATTTTGA CAGTTTGTGC ATTTTCTTTA TACATTTTAT ATTCTTCTGT TAAAATATCT 2580 

CTTCAGATGA- AACTGTCCAG" ATTAATTAGG AAAAGGCATA TATTAACATA AAAATTGCAA 2640 

AAGAAATGTC GCTGTAAATA- AGATTTACAA CTGATGTTTC TAGAAAATTT CCACTTCTAT 2700 

ATCTAGGCTT TGTCAGTAAT TTCCACACCT TAATTATCAT TCAACTTGCA AAAGAGACAA 2760 

CTGATAAGAA GAAAATTGAA ATGAGAATCT GTGGATAAGT GTTTGTGTTC AGAAGATGTT 2820 
GTTTTGCCAG TATTAGAAAA TACTGTGAGC CGGGCATGGT GGCTTACATC TGTAATCCCA '2880 

GCACTTTGGG AGGCTGAGGG. GGTGGATCAC CTGAGGTCGG GAGTTCTAGA CCAGCCTGAC 2940 

CAACATGGAG AAACCCCATC TCTACTAAAA ATACAAAATT AGCTGGGCAT GGTGGCACAT 3000 

GCTGGTAATC TCAGCTATTG- AGGAGGCTGA GGCAGGAGAA TTGCTTGAAC CCGGGAGGCG 3060 

GAGGTTGCAG TGAGCCAAGA TTGCACCACT GTACTCCAGC CTGGGTGACA AAGTCAGACT 3120 
CCATCTCCAA AAAAAAAAAA AAAA 

Seq ID NO: 395 Protein aequence 
Protein Accession ft: NP_0S5146.1 

1 . 11 21 31 41 51 

III I I I 

MVRKPWSTI SKGGYLQGNV NGRLPSLGNK EPPGQEKVQL KRKVTLLRGV SIIIGTIIGA . 60 

GIFISPKGVL QNTGSVGMSI* TIWTVCGVLS LFGALSYAEL GTTIKKSGGH YTYILEVFGP 120 

LPAFVRVWVE LLIIRPAATA VISIiAFGRYI LEPFFIQCEI PELAIKLITA VGITWMVLN 180 

SMSVSWSARi; QI FLTFCKLT AILIIIVPGV MQLIKGQTQN- FKDAFSGRDS SITRLPLAFY 240- 

YGMYAYAGWF YLNFVTEEVE NPEKTIPLAI CISMAITIGV YVLTKVAYFT TINAEELLLS 300 

NAVAVTFSER LLGNFSLAVP IFVALSCFGS MNGGVFAVSR LFYVASREGH liPEILSMIHV 360 

RKHTPLPAVI VLHPLTMIML FSGDLDSLLN FLSFARWLFI GIAVAGLIYL RYKCPDMHRP 420 

FKVPLFIPAL FSFTCLFMVA LSLYSDPFST GIGFVITLTG VPAYYLFIIW DKKPRWFRIH 480 
SEKITRTLQI ILEWPEEDK L 



Seq ID NO: 396 DNA sequence 

Nucleic Acid Accession #: NM_006528 

Coding sequence: 57.^764 

1 11 21 31 41 S1 

I I I I I I 

GCCGCCAGCG GCTTTCTCGG ACGCCTTGCC CAGCGGGCCG CCCGACCCCC TGCACCATGG 60 

ACCCCGCTCG CCCCCTGGGG, CTGTCGATTC TGCTGCTTTT CCTGACGGAG GCTGCACTGG 120 

GCGATGCTGC TCAGGAGCCA ACAGGAAATA ACGCGGAGAT CTGTCTCCTG CCCCTAGACT 180 

ACGGACCCTG CCGGGCCCTA CTTCTCCGTT ACTACTACGA CAGGTACACG CAGAGCTGCC 240 

GCCAGTTCCT GTACGGGGGC TGCGAGGGCA ACGCCAACAA TTTCTACACC TGGGAGGCTT 300 

GCGACGATGC TTGCTGGAGG ATAGAAAAAG TTCCCAAAGT TTGCCGGCTG CAAGTGAGTG . .. 360 

TGGACGACCA GTGTGAGGGG TCCACAGAAA AGTATTTCTT TAATCTAAGT TCCATGACAT 420 

GTGAAAAATT CTTTTCCGGT GGGTGTCACC GGAACCGGAT TGAGAACAGG TTTCCAGATG .480 

AAGCTACTTG TATGGGCTTC TGCGCACCAA AGAAAATTCC ATCATTTTGC TACAGTCCAA 540 

AAGATGAGGG ACTGTGCTCT GCCAATGTGA CTCGCTATTA TTTTAATCCA AGATACAGAA 600 

CCTGTGATGC TTTCACCTAT ACTGGCTGTG GAGGGAATGA CAATAACTTT GTTAGCAGGG 660 
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WO 02/086443. 

AGGATTGCAA ACGTGCATGT GCAAAAGCTT 
GCTTTGCCAG TAGAATCCGG AAAATTCGGA 
ATCTTGTTTG TCTTTATGGC TTATTTGCCT 
GCATGAGGAA ACAAATCATT GGTGATTTAT 
TTCAAAAATT TGGATTTTTT TATATATAAC 
TTTAATTTAT GGTTCAACTG TTTGTGAGAC 
AAATATGACT CACTCATTTC TTGGGGTCGT 
AAACAACAT A AGACAATATA ATCATGTGCT 

Seq ID NO: 39? Protein sequence 
Protein Accession 8: NP_006519 



TGAAAAAGAA 
AGAAGCAATT 
TTATGGTTGT 
TCACCAGTTT- 
TAGCTGCTAT 
GAATTCTTGC 
ATTCCTGATT 
TTTAACA7AT 



AAAGAAGATG 
TTAAACATTC 
ATCTGAAGAA 
TTATTAATAC 
TCAAATGTGA 
AATGCATAAG 
TCAGAAGAGG 
TTGAGAATAA 



CCAAAGCTTC 
TTAATATGTC 
TAATATGACA ' 
AAGTCACTTT 
GTCTACCATT 
ATATAAAAGC 
ATCATAACTG 
AAAGGACTAG 



11 



21 



31 



51 



ATGGCTTCAC 
CCCGAGTTTG 
TTCGTGATGG 
AAAGGATACT 
TTGGTGTTCC 
ACGTCCAGCT 
GCTACGCTGC 
TTCAGGTACA 
GTCACCTCCG. 
GTGAACGTGC 
CAGCCCGAGA 
CAGTCCAGCA 
ATGTGCTGGA 
ACGCGGCCTC 
ACCATCATCT 
ATTCGGAGGA 
GCGTACATGA* 
CCGCTCCTGT 
TGCCGCCTGT 
ACCACCGACA 
TCTGCAAGGA 
TCTAAGTCCC 
AATTCTGCTG 



11 

I 

CCAGCCTCCC. 
AGGTGGCCAC 
GCCTTCTGGG 
TGCAGAAGGA 
TCATCGGCAT 
ACACCCTGTC 
TGCACGTGCT 
AGGCTGTGTC 
CCCTGGTGGC 
CCAGCCACCG 
CCTCCAATAT 
TCTTCGGCGC 
ACATGATGCA 
CGCAGCTGAG 
TCCTGAGGCT 
TCATGGCTGC 
TCCTCCTCCC 
ACACGGTGTC 
CGCTGCAGCA 
GCGCCCGCTT 
GAACTGAGAA 
AGTCATTGAG 
CAGAGAATGG 



21 
I 

GGGCAGTGAC 
CTGGATCAAA 
GAACAGCGTC 
GGTGACAGAC 
GCCCATGGAG 
CTGCAAGCTG 
GACGCTCAGC 
GGGACCTTGC 
ACTGCCCTTG 
GGGTCTCACT 
GTCCATCTGT 
CTTCGTGGTC 
GGTGCTCATG 
GAAGTCCGAG 
GATTGTTGTG 
GGCCAAACCC 
CTTCTCGGAG 
CTCGCAGCAG 
CGCCAACCAC 
TGTGCAGCGC 
GATTTTCTTA 
TCTCGAGTCA 
TTTTCAGGAG 



31 
I 

TGCTCCCAAA 
ATCACCCTTA 
ACCATTCGGG 
CACATGGTGA 
TTCTACAGCA 
CACACTTTCC 
TTTGAGCGCT 
CAGGTGAAGC 
CTGTTTGCCA 
TGCAACCGCT 
ACCAACCTCT 
TACCTCGTGG 
AAAAGCCAGA 
AGCGAAGAGA 
ACATTGGCCG- 
AAGCACGACT. 
ACGTTTrTCT- 
TTTCGGCGGG 
GAGAAGCGCC 
CCGTTGCTCT 
AGCACTTTTC 
CTAGAGCCCA 
CATGAAGTTT 



41 

! 

TCATTGATCA 
"TTCTGGTGTA 
TCACCCAGGT 
GTTTGGCTTG 
TCATCTGGAA 
TCTTCGAGGC 
ACATCGCCAT 
TGCTGATTGG 
TGGGTACTGA 
CCAGCACCCG 
CCAGCCGCTG 
TCCTGCTCTC 
AGGGCTCGCT 
GCAGGACCGC 
TATGCTGGAT 
GGACGAGGTC 
ACCTCAGCTC 
TGTTCGTGCA 
TGCGCGTACA 
TCGCGTCCCG 
AGAGCGAGGC 
ACTCAGGCGC 
GA 



51 
I 

CAGTCATGTC 
CCTGATCATC 
GCTGCAGAAG 
CTCGGACATC 
TCCCCTGACC 
CTGCAGCTAC 
CTGTCACCCC 
CTTCGTCTGG 
GTACCCCCTG. 
CCACCACGAG 
GACCGTGTTC 
CGTAGCCTTC 
GGCCGGGGGC 
CAGGAGGCAG 
GCCCAACCAG 
CTACTTCCGG . 
GGTCATCAAC 
GGTGCTGTGC 
TGCGCACTCC 
GCGCCAGTCC 
CGAGCCCCAG 
GAAACCAGCG 



Seq ID NOt 399 Protein sequence 
Protein Accession #: NP_001499.1 



MASPSLPGSD 
KGYLQKEVTD 
ATLLHVLTLS 
VNVPSHRGLT 
MCWNMMQVLM 
IRRIMAAAKP 
CRLSLQHAKH 
SKSQSLSLES 



11 

! 

CSQIIDHSHV 
HMVSLACSDI 
FERYIAICHP 
CNRSSTRHHB 
KSQKGSLAGG 
KHDWTRSYFR 
EKRLRVHAHS 
LEPNSGAKPA 



21 
I 

PBFBVATWIK 
LVFLIGMPME 
FRYKAVSGPC 
QPBTSNMSIC 
TRPPQLRKSE 
AYMILLPFSB 
TTDSARPVQR 
NSAAENGFQE 



31 

1 

ITLIIiVYLII 
FYSIIWNPLT 
QVKLLIGFVW 
TNLSSRWTVF 
SEESRTARRQ 
TFFYLSSVIN 
PLLPASRRQS 
HEV 



41 
I 

PVMGLLGNSV 
TSSYTLSCKL 
VTSAIiVALPL 
QSSIPGAFW 
TIIFLRLIW 
PLLYTVSSQQ 
SARRTEKIFIi 



51 
I 

TIRVTQVLQK 
HTFLFEACSY 
LFAMGTEYPL 
YLWLLSVAF 
TLAVCWMPNQ 
FRRVFVQVLC 
STFQSEAEPQ 



Seq ID NO: 400 DNA sequence 

Nucleic Acid Accession #: NM_006475.1 

Coding sequence: 28 . .2538 



AACAGAACTG 
TTGCTGCTTA 
AGTCGTATCA 
ACCAAAAAGA 
AAAACGACTG 
TGCCCAGCAG 
ACAACGCAGC 

ttcacttact, 
ggtttggaga' 
aagagaatgt 

TTGGGGCTTT 
ATCCATGGGA 
CAAATTGGTA 
GCAGCTGCCA 
TTTGCTCCCA 
. GGAGACAAAG 
TCTGAGTCTA 



11 
I 

CAACGGAGAG 
TTGTTAACCC 
GGGGTCGGGA 
AATACTTCAG 
TTTTATATGA 
TTTTGCCCAT 
GCTATTCTGA 
TTGCACCGAG 
GCAAOGTGAA 
TGACCAAGGA 
TCATTAACCA 
ACCAGATTGC 
CCTCAATTCA 
TCACATCGGA 
CCAATGAGGC 
TGGCTTCCGA 
TTATGGGAGG 



21 
I 

ACTCAAGATG 
TATAAACGCC 
CCAAGGCCCA 
CACTTGTAAG 
ATGTTGCCCT 
TGACCATGTT 
CGCCTCAAAA 
TAATGAGGCT- 
TGTTGAATTA 
CTTAAAAAAT 
TTATCCTAAT 
AACAAATGGT- 
AGACTTCATT 
CATATTGGAG 
TTTTGAGAAA 
AGCTCTTATG 
AGCAGTCTTT 



31 

I 

ATTCCCTTTT 
AACAATCATT 
AATGTCTGTG 
AACTGGTATA 
GGTTATATGA 
TATGGCACTC 
CTGAGGGAGG 
TGGGACAACT 
CTGAATGCTT 
GGCATGATTA 
GGGGTTGTCA 
GTTGTCCATG 
GAAGCAGAAG 
GCCCTTGGAA 
CTTCCACGAG 
AAGTACCACA 
GAGAOGCTGG 



41 

I' 

TACCCATGTT 
ATGACAAGAT 
CCCTTCAACA 
AAAAGTCCAT 
GAATGGAAGG 
TGGGCATCGT 
AGATCGAGGG 
TGGATTCTGA 
TACATAGTCA 
TTCCTTCAAT 
CTGTTAATTG 
TCATTGACCG 
ATGACCTTTC 
GAGACGGTCA 
GTGTCCTAGA 
TCTTAAATAC 
AAGGAAATAC 



51 
1 

TTCTCTACTA 
CTTGGCTCAT 
GATTTTGGGC 
CTGTGGACAG 
AATGAAAGGC 
GGGAGCCACC 
AAAGGGATCC 
TATCOGTAGA 
CATGATTAAT 
GTATAACAAT 
TGCTCGAATC 
TGTGCTTACA 
ATCTTTTAGA 
CTTCACACTC 
AAGGTTCATG 
TCTCCAGTGT 
AATTGAGATA 



720 
* 780 
840 
900 
960 
1020. 
1080 
1140 



PCTAJS02/12476 



■ 41 

! I VI I I I 

MDPARPLGLS ' ILLLFLTEAA LGDAAQEPTG NNAEICLLPL DYGPCRALLI* RYYYDRYTQS 
CRQFLYGGCE GNANNFYTWE ACDDACWRIE, KVPKVCRLQV: SVDDQCEGST EKYFFNLSSM 
TCEKFPSGGC HRNRIENRFP DEATCMGFCA PKKIPSFCYS PKDEGLCSAN VTRYYFNPRY 
RTCDAFTYTG CGGNDNNFVS REDCKRACAK ALKKKKKMPK LRFASRIRKI RKKQF 

Seq ID NO: 398 DNA sequence 

Nucleic Acid Accession #: NM_001508.1 

Coding sequence: 1..1361 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



60 
120 
1B0 
240 
300 
360 
420 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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WO 02/086443 r... 

GGATGTGACG GTGACAGTAT AACAG TAAAT GGAATCAAAA TGGTGAACAA AAAGGATATT 
GTGACAAATA ATGGTGTGAT CCATTTGATT GATCAGGTCC TAATTCCTGA TTCTGCCAAA 
CAAGTTATTG AGCTGGCTGG AAAACAGCAA ACCACCTTCA CGGATCTTGT GGCCCAATTA 
GGCTTGGCAT CTGCTCTGAG ' GCCAGATGGA GAATACACTT TGCTGGCACC TGTGAATAAT 
GCATTTTCTG ATGATACTCT CAGCATGGTT. CAGCGCCTCC TTAAATTAAT TCTGCAGAAT 
CACATATTGA AAGTAAAAGT TGGCCTTAAT GAGCTTTACA ACGGGCAAAT ACTGGAAACC 
ATCGGAGGCA AACAGCTCAG AGTCTTCGTA TATCGTACAG CTGTCTGCAT" TGAAAATTCA 
TGCATGGAGA AAGGGAGTAA GCAAGGGAGA AACGGTGCGA TTCACATATT CCGC GAGATC 
ATCAAGCCAG CAGAGAAATC CCTCCATGAA AAGTTAAAAC AAGATAAGCG CTTTAGCACC 
TTCCTCAGCC TACTTGAAGC TGCAGACTTG AAAGAGCTCC TGACACAACC TGGAGACTGG 
ACATTATTTG TGCCAACCAA TGATGCTTTT AAGGGAATGA CTAGTGAAGA AAAAGAAATT 
CTGATACGGG ACAAAAATGC TCTTCAAAAC, ATCATTCTTT ATCACCTGAC ACCAGGAGTT 
TTCATTGGAA- AAGGATTTGA ACCTGGTGTT ACTAACATTT TAAAGACCAC ACAAGGAAGC 
AAAATCTTTC TGAAAGAAGT AAATGATACA CTTCTGGTGA- ATGAATTGAA ATCAAAAGAA 
TCTGACATCA TGACAACAAA TGGTGTAATT CATGTTGTAG ATAAACTCCT CTATCCAGCA 
GACACACCTG TTGGAAATGA TCAACTGCTG GAAATACTTA ATAAATTAAT CAAATACATC 
caaattaagt; TTGTTCGTGG TAGCACCTTC AAAGAAATCC CCGTGACTGT ctatacaact 
AAAATTATAA 1 ' CCAAAGTTGT GGAACCAAAA ATTAAAGTGA TTGAAGGCAG TCTTCAGCCT ' 
ATTATCAAAA* CTGAAGGACC CACACTAACA AAAGTCAAAA TTGAAGGTGA ACCTGAATTC 
AGACTGATTA* AAGAAGGTGA AACAATAACT GAAGTGATCC ATGGAGAGCC AATTATTAAA 
AAATACACCA AAATCATTGA- TGGAGTGCCT GTGGAAATAA CTGAAAAAGA" GACACGAGAA 
GAACGAATCA TTACAGGTCC TGAAATAAAA - TACACTAGGA TTTCTACTGG AGGTGGAGAA 
ACAGAAGAAA' CTCTGAAGAA ATTGTTACAA GAAGAGGTCA CCAAGGTCAC CAAATTCATT 
GAAGGTGGTG ATGGTCATTT ATTTGAAGAT GAAGAAATTA AAAGACTGCT TCAGGGAGAC 
ACACCCGTGA GGAAGTTGCA AGCCAACAAA AAAGTTCAAG GTTCTAGAAG ACGATTAAGG 
GAAGGT CGTT CTCAGTGAAA ATCCAAAAAC CAGAAAAAAA*- TGTTTATACA ACCCTAAGTC 
AATAACCTGA- CCTTAGAAAA TTGTGAGAGC CAAGTTGACT. TCAGGAACTG AAACATCAGC" 
ACAAAGAAGC AATCATCAAA TAATTCTGAA- CACAAATTTA ATATTTTTTT TTCTGAATGA 
GAAACATGAG GGAAATTGTG GAGTTAGCCT CCTGTGGTAA AGGAATTGAA GAAAATATAA 
CACCTTACAC CCTTTTTCAT CTTGACATTA AAAGTTCTGG CTAACTTTGG AATCCATTAG 
AGAAAAATCC TTGTCACCAG ATTCATTACA ATTCAAATOG AAGAGTTGTG AACTGTTATC 
CCATTGAAAA GACCGAGCCT TGTATGTATG TTATGGATAC ATAAAATGCA CGCAAGCCAT 
TATCTCTCCA^ TGGGAAGCTA AGTTATAAAA ATAGGTGCTT' GGTGTACAAA ACTTTTTATA' 
TCAAAAGGCT; TTGCACATTT CTATATGAGT GGGTTTACTG GTAAATTATG TTATTTTTTA 
CAACTAATTT: TGTACTCTCA GAATGTTTGT CATATGCTTC TTGCAATGCA' TATTTTTTAA- 
TCTCAAACGT TTCAATAAAA CCATTTTTCA GATATAAAGA GAATTACTTC AAATTGAGTA 
ATTCAGAAAA ACTCAAGATT TAAGTTAAAA AGTGGTTTGG ACTTGGGAA 



PCT/US02/12476 



Seq ID NO: 401 Protein sequence 
Protein Accession #: NP_006466.1 



MIPFLPMFSL 
KNWYKKSICG 
KLREEIEGKG 
NGMI IPSMYK 
IEAEDDLSSF 
MKYHILNTLQ 
IDQVLIPDSA^ 
VQRLLKLILQ 
RNGAIHIFRB- 



TLLVNELKSK 
PKEIPVTVYT 
TEVIHGBPII 
QEEVTKVTKP 



11 . 
I 

LLLtilVNPIN 
QKTTVLYECC 
SFTYFAPSNE 
NLGLFINHYP- 
RAAAITSDIL 
CSBSIMGGAV 
KQVIELAGKQ 
NHILKVKVGL 
IIKPAEKSLH 
ILIRDKNALQ 
ESDIMTTNGV 
TKIITKWEP 
KKYTKIIDGV 
IEGGDGHLFE 



21 

1 

ANNHYDKILA 
PGYMRMEGMK 
AWDNLDSDIR 
NGWTVNCAR 
EALGRDGHFT 
FETLEGNTIE 
QTTFTDLVAQ: 
NELYNGQILE 
EKLKQDKRFS 
NIILYHLTPG. 
IHWDKLLY? 
KIKVIEGSLQ 
PVEITEKETR ; 
DEEIKRLLQG 



31 
I 

HSRIRGRDQG 
GCPAVLPIDH 
RGLESNVNVE 
IIHGNQIATN 
LFAPTNEAFE 
IGCDGDSITV 
IiGLASALRPD 
TIGGKQLRVF 
TFLSLLEAAD 
VFIGKGFEPG 
ADTPVGNDQL 
PIIKTEGPTL 
EERIITGPEI 
DTPVRKLQAN 



Seq ID NO: 402 DNA sequence 

Nucleic Acid. Accession 8; NM_002416 

Coding sequence: 40.. 417 



1 
I 

ATCCAATACA 
TTCCTCTTGG 
AAGGGTCGCT 
GACCTTAAAC 
AAGAATGGAG 
AAGTGGGAGA 
AAGAAAGTTC 
ACCACTTCAC 
TTCCAAAGGA 
ATTACTCTGA 
TTGTTAAAGG 
TTAAGGCCAT 
CTCACAACAG 
TATCTGAGGC 
TGAAATTGAG 
CTACAGGCCT 
ACTGGAGATC 
TTCCATCTTG 
AAGTCAGCTC 
CATATGCTCT 
AGTGCTTTCT 
AAATAAACCT 
CACATGGGTG 



11 
I 

GGAGTGACTT 
GCATCATCTT 
GTTCCTGCAT 
AATTTGCCCC 
TTCAAACATG 
AACAGGTCAG 
TGAAAGTTCG 
CAATAAGTAT 
GGATGGCATA 
AATTGTAACT 
CTATGATTGT 
GATTTTAGCA 
CTGCCTGGAA 
ACATGTCAGC 
CTGGACCTCA 
CACACACAAT" 
ACCAGTGTGT 
CCCGCTCAGG 
TTCTCCATCC 
GATTTATCTG 
TCTCCCAATT 
TTTTGGACAC 
AACACTCAAT 




CAGCACCAAC 
AAGCCCTTCC 
TCTAAACCCA 
CCAAAAGAAA 
AAAATCTCAA 
TCTGTGTTAA 
TAATACAAAG 
AAAGTTAGAA 
CTTTGTTCTT 
ATACCCATGT 
GAGCAGCCCT 
AAGTCCTAAG 
CCAAGCTGCT 
GTGTCTGAGA 
GGCTTTCAGA 
CTGACCACTT 
TACCACAATG 
AGTCAACTCC 
CATCCTCACT 
ACAAATTATC 
GGTTAACTAA 



TCTATCACTA 
ATTGGAGTGC 
CAAGGGACTA 
TGCGAGAAAA 
GATTCAGCAG 
AAGCAAAAGA- 
CGTTCTCGTC 
AAATGTTCTA 
GCTTATTAAT 
AGTTGATTTT-' 
CTACCACCCA 
CTACACAGAT- 
AGGCTTCCAC 
CCTGTTAGCA 
GTGGCCATCA 
GATTCATGCT 
GCCTCCTTTC 
TATTTCTTTT 
CAGTGCCTTT 
TT7CTCATCT 
CAGTCCAGCT 
TTAAAACTCC 
TTCTTGGGTG 



41 
I 

PNVCALQQIL 
VYGTLGIVGA 
LLNALHSHMI 
GWHVIDRVL 
KLPRGVLERF 
NGIKMVNKKD 
GEYTLLAPVN 
VYRTAVCIEN 
LKELLTQPGD 
VTNILKTTQG 
LEILNKLIKY 
TKVKIEGEPE 
KYTRISTGGG 
KKVQGSRRRL 



41 
I 

TGAAGAAAAG 
AAGGAACCCC 
TCCACCTACA 
TTGAAATCAT 
ATGTGAAGGA 
ATGGGAAAAA 
AAAAGAAGAC 
TTTTAATTAT 
TTGACTAGAA 
AAGAATCCAA 
CCAGTTGAAT 
GTTCACCCAA 
GTACTGCAGC 
TGCTGGTGAG 
ACCTCTGTAT 
GATTGTTATT 
TGGCTTTGGA 
TGTTCCCCTT 
CTTCTCTCCA 
TGTCCCCAAC 
TAGTTCAAGT 
TGTTTCACTT 
TTTATCCTAT 



51 

I 

GTKKKYFSTC 

TTTQRYSDAS 

NKRMLTKDLK 

TQIGTSIQDF ' 

MGDKVASEAL 

IVTNNGVIHL 

NAFSDDTLSM 

SCMEKGSKQG 

WTLFVPTNDA- 

SKIFLKEVND 

IQIKFVRGST 

FRLIKEGETI 

ETEETLKKLL 

REGRSQ 



51 
I 

TGGTGTTCTT 
AGTAGTGAGA 
ATCCTTGAAA 
TGCTACACTG 
ACTGATTAAA 
ACATCAAAAA 
TACATAAGAG 
ACCGCTATCA 
AATTTAAAAC 
ACGTTAAGAA 
TTCATCATGC 
CCACATCCCA 
CTCCAGAGAG 
CCAAGCAGTT 
TTGAATCAGC 
GGGTATCACC 
AGCCATGTGA 
TGCTTCATTC 
GTGCACCTGT . 
ACCCCACAGA 
CCTGCCTCTT 
GGTTCAGTAC 
CTCTCCAACC 



1080 
1140 : 
1200. 
1260 : " 
-1320 ' 
1380 : 
1440 
1500 
1S60 
1620 
1680 
1740 , 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580: 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
760 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
. 960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



334 



.5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

AGATTGTCAG CTCCTTGAGG GCAAGAGCCA 
- CTAATAATAC. TGTGGAACTA GGTTTTAATA 
TGGCAACCAG ACCATTGTCT CAGAGCAGGT 
CTAGCCTCTG GTAACCTCTT ACTTATTATC 
GATGCAACAT- CCTTGTCTTT TTATGACAGG 
GCACGTGGTA AAACACTTGC; GGATATTCTG 
AAAATCATAT AATCTTACAA TGAAAAGGAC 
CCAACCATAC AAAAATTCCT. TTTCCCGAAG 
TCTAAGATCT. AACAAGATAG CCACCGAGAT 
AGTTTTATTG TCCGTTTACT. TGTTTCAGAG 
TCTCCCATGA AGAAAGGGAA CGGTGAAGTA 
TAGTGGAAGC ATGATTGGTG CCCAGTTAGC 
GGAGGTTCAG TGAATTGTGT. AGGAGAGGTT 
CTTTCCCAAA TTGAATCACT GCTCACACTG 
TCCCACCCGA ACGTCTTATC TAATCATGAA s 
AAAAATCTAA" GTGTTTCATA AATTTGAGAG 
GTAGACAGTA* TATAACTAAC AACCAAAGAC 
TCATTTATCAv TATATATACA TACATGCATA 
AAACAGTATT GACTTGTATA CCITGTAATT 
TATCAATAAA- TAGACCATTA ATCAG 

Seg ID NO: 403 Protein sequence 
Protein Accession #*. NP_002407 

11 



PCT/US02/12476 . 



CAGTATATTT 
ATTTTTTAAT 
GCTGGCTCTT 
TTCAGGACAC 
ATGTTTGCTC 
GACTGTTTTT 
TTTATAGATC 
GAAAAGGGCT 
CCTTATCGAA 
TTTGTATTGT 
CTAAGCGCTA 
CTCTGCAGGA 
GTCTGTGGCC 
CTGATGATTT 
ACTCCCTAGT. 
TCTGTGACCC 
TACATATTGT 
CACTCTCAAA 
TGAAATATTT 



CCCTGTTTCT 
TGATGTTGTT 
TCCTGGCTAC 
TCACTACAGG 
AGCTTCTCCA 
AAAAAATATA 
AGCCAGTGAC 
TTCTCAATAA 
ACTCATTTTA 
GATT ATCAAT * 
GAGGAAGCAG 
TGTGGAAACC 
AGAATTTAAA 
AGAGTGCTGT' 
TCCTTCATGT 
ACTTACCTTGr. 
CACTGACACA 
GCAAATAATT 
TCTTTGTTAA 



TCCACAGTGC 
ATGGGCAGGA 
TCCATGTTGG 
GACCAGGGAT 
ACAATAAGAA 
CAGTTTACCG 
CAACCTTTTC 
GCCTCAGCTT 
GGCAAATATG 
TACCACACCA 
CCAAGTCGGT 
TCCTTCCAGG 
CCTATACTCA 
CCGGTGGAGA 
AACTTCCCTG*. 
CATCTCACAG ' 
CACGTTATAA 
TTTCACTTCA 
AATAGAATGG 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1660 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 



1 11 . " 21 31 41 51 

I II I 1 I 

MKKSGVLFLL GIILLVLIGV QGTP.WRKGR CSCISTNQGT IKLQSLKDLK QFAPSPSCEK 
IEIIATLKNG VQTCLNPDSA DVKELIKKWE KQVSQKKKQK NGKKHQKKKV LKVRKSQRSR. 
QKKTT 

i ■ 

Seq ID NO: 404 DNA sequence 

Nucleic Acid Accession ft: NM_006670 

Coding sequence i 85 .. 1347 



60 
120 



1 

I 

CCGGCTCGCG 
AGCTCCGGGG 
GACGGGCGTC: 
TCTCCCACCT- 
TCCGCCCAGC 
CGCACAGTCA f 
GTGCGCAACC 
CGCCGGCCGC 
GTGCGCGCGG 
CCACTGGCCG 
AGTCCCCTTG 
CGGAGCTTCG 
OGCCGCTTGG. 
CTGCCCAGCC 
TCCTTCCGCA 
CTTCACAATG 
AACAATCCCT 
GAGGTAGTGC 
GTCCTCTTGG 
CAAACCTCTT 
GTTTTGTATT. 
AGGGATCACA 
AACCTCAGTT 
CATGAGATGT 
TAGATACAAC 
TTTCTCGGTG 
TTCTTTTTCT 
rGGGCTTCTT. 
ACAGATAGCA 
TATCAGTTTT 
CTGCAGACGT 
AGAGCATGCT 
TTCTTTGACA 
. TTTTAATAAA 
ATTCTTAAAA 



11 
I 

CCCTCCGGGC 
AAACGCGAGC 
TGCGGCTGGC 
CCTCGGCATC 
CCCCGCTGCC 
AGTGCGTTAA- 
TCTTCCTTAC 
CGCTGGCGGA 
GCGCCTTCGA 
ACCTCAGTCC 
TGGAACTGAT 
AGGGCATGGT 
AGCTGGCCAG 
TCAGGCACCT 
ACCTGACACA 
GCACCCTGGC 
GGGTCTGCGA 
AGGGCAAAGA 
AACTCAACAG. 
ATGTCTTCCT 
TGAACCGCAA 
TGGAAGGGTA 
CTAACTCGGA* 
AGACTTAAGC 
GGACTTTGAC 
TGTTCTGTTA 
TGGAACTCCT 
GCTGTCTGTC 
TTCAACAAAA 
ATTCTCATGT 
TAGCAGGCTC 
TACATTTTAC 
AAGTAAATTA 
CTGCATCGAG 
GAA* 



21 
I 

CCAGCCTCCC 
CGCGATGCCT 
GCGACTAGCG 

ggaccagtgc 
ccgcaatctg: 
cggcaaccag 

GCTGGCCGCG 
GCATCTGCCC 
CTTCGCTTTC 
CCTGAACCAC 
GGTGGCGGCC 
CAACCACTTC. 
GGACTTAAGT 
TCTAGAAAGC 
TGAGTTGCAA 
CTGCCACATG 
CCGGCTCACC 
TGCTGACCTG 
GGGTATTGTT 
GGGGATAAAA 
TCATTACAGA 
TGTCTGAGAA 
TTTATCCCTA 
TAAAAGCAGT 
ATGTAAGACG 
CAACACGTAT 
TCTCTCTCAG 
GCTGCCTCAA 
ACCTAAGTTG 
TTCAAAATAA 
TGTTCTGCAT 
CTTTTTTGAT 
ATCCAACCGA 



31 
I 

GAGCCTTCGG 
GGGGGGTGCT 
CTGGTACTCC 
TCCTCGGCGC 
CCCGOGCTGT 
ACCGAGGTGC 
CTGGCOGTGC 
CTCAACCTCA 
AGCCTGCGCC 
TCGGGCAGCA 
ATCGTGCCCC 
CTGCTGGCGG 
CTTTACCTGC 
AATAATTCGC 
CTCCACCTGG 
GGTCTACCCC 
GCAGACATGG 
TGTGCATATC 
GACTGTGACC 
TTAGCCCTGA 
AAGTGGATGC 
TATGAAATCA 
ATATTAGAGG 
CTAGGCTTGC 
GAAGGGGATT 
ATGAACAGTT 
GGAGGGATTT 
TACAGTTCAA 
CTTTTTCGAG 
TGGAGAAAAT 
CTCCATGGTG 
ATTACAAAAA 
TGCAGTTTAT 
CTGAATTGTT 



41 



AGOGGGCGCC 
CCCGGGGCCC 
TGGGCTGGGT 
CGTTCCTGGC 
GOGAGTGCTC 
CCACGGACCT. 
TCCCTGCCGG 
,.- GCGGCAGCCG 
AGCTCGACCT 
if ATGCCAGCGT 
: CTGAAGATGA 
!• GCCGTGCACT 
! CGCGGGATGT 
■TGGTGAGCCT 
AGGACAATGC 
ACATTAGGGT 
TGACCTGGCT 
CGGAAAAAAr 
CGATTCTTCC 
TAGGCGCTAT 
ATAACATCAG 
ATGCGGACCC 
ACAGACCAAG 
TCCACTTTCA 
TGCTTCCTTG 
GTGTATAGTG 
TTCAGGTTTC 
GGTGTAGCAA 
AAAAATACTT. 
AATTGCATCC 
CACAGGAGCA 
ATAACTTGCA 
ATGAAAATGT 
AAAAAAAAAA 



51 
I 

GTCCCAGCCC 

CGCCGCCGGG 

CTCCTCGTCT,; 

TTCCGCCGTG- 

CGAGGCAGCG-, 

GCCCGCCTAC. 

CGCCTTCGCC 

CCTGGACGAG 

CAGCCACAAC 

CTCGGCCCCC 

GCGGCAGAAC 

GCAGGGGCTC 

GCTGGCCCAA 

GACCTACGTG 

CCTCAAGGTC 

TTTCCTGGAC 

CAAGGAAACA 

GAGGAATCGG 

CCCATCCCTG- 

TTTCCTCCTG: 

AGATGCCTGC 

CAGATTAACA. 

GACAACTCTG- 

TCCTCCACTA 

TTATGTAAAG 

TTTTAOCCTC 

AGCATGAACA 

GTGTACCCAC 

TATTCATAAA- 

TATAAACTGC 

CCTGCATCCA 

ACTTCATAAC 

ACTGATTTTT 

AAAAATAAAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500. 
1560 
1620 
1680 
1740 
1800 
1860 . 
1920 
1980 
2040 



Seq ID NO: 405 Protein sequence 
Protein Accession #: NP 006661 . 



I 

MPGGCSRGPA 
QCPALCECSE 
AALNLSGSRL 
NHIVPPEDER 
LSNNSLVSLT 
HMADMVTWLK 
IVLALIGAIF 



11 
I 

AGDGRLRliAR 
AARTVKCVNR 
DEVRAGAFEH. 
QNRSFEGMW 
YVSFRNLTHL 
ETEWQGKDR 
LLVLYLNRKG 



21 

•I 

LALVLLGWVS 
NLTEVPTDLP 
LPSLRQLDLS 
AALLAGRALQ 
ESLHLEDNAL 
LTCAYPEKMR 
IKKWMHNIRD 



31 - 
I 

SSSPTSSASS 
AYVRNLFLTG 
HNPLADLSPF 
GLRRLELASN 
KVLHNGTLAE 
NRVLLELNSA 
ACRDHMEGYH 



41 
I 

FSSSAPFLAS 
NQLAVLPAGA 
AFSGSNASVS 
HFLYLPRDVL 
LQGLPHIRVF 
DLDCDPILPP 
YRVEINADPR 



51 
I 

AVSAQPPLPD 
FARRPPIAEL 
APSPLVELIL 
AQLPSLRHLD 
LDNNPWVCDC 
SLQTSWFLG 
LTNLSSNSDV 



60 
120 
180 
240 
300 
360 



Seq ID NO: 406 DNA sequence. 

Nucleic Acid Accession .#: Eos sequence 



335 



WO 02/086443 

Coding sequence: 1. 



PCT/US02/12476 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



1 
1 

ATGCCTGGGG 
CTAGCGCTGG 
TTCTCCTCCT 
CAGTGCCCCG 
AATCTGACCG' 
AACCAGCTGG 
AGCCTCAGGC 
CGCAACCTGA 
AATGGCACCC 
CCCTGGGTCT, 
GTGCAGGGCA? 
TTGGAACTCA 
TCTTATGTCT: 
TATTTGAACC 
CACATGGAAG 
AGTTCTAACT 



11 

I . . . 

GGTGCTCCOG 

TACTCCTGGG. 

CGGCGCCGTT 

CGCTGTGCGA 

AGGTGCCCAC 

CCAGCAACCA 

ACCTGGACTT. 

CACATCTAGA 

TGGCTGAGTT 

GCGACTGCCA 

AAGACCGGCT 

ACAGTGCTGA- 

TCCTGGGTAT 

GCAAGGGGAT 

GGTATCATTA 

CGGATGTCCT 



21 

i * 

GGGCCCCGCC 
CTGGGTCTCC 
CCTGGCTTCC 
GTGCTCCGAG 
GGACCTGCCC 
CTTCCTTTAC 
AAGTAATAAT 
AAGCCTCCAC 
GCAAGGTCTA 
CATGGCAGAC 
CACCTGTGCA* 
CCTGGACTGT 
TGTTTTAGCC 
AAAAAAGTGG 
CAGATATGAA 
CGAGTGA 



31 
I 

GCCGGGGACG 
TCGTCTTCTC 
GCCGTGTCCG 
GCAGCGCGCA 
GCCTACGTGC 
CTGCCGOGGG 
TCGCTGGTGA 
CTGGAGGACA- 
CCCCACATTA 
ATGGTGACCT 
TATCCGGAAA 
GACCCGATTC 
CTGATAGGCG 
ATGCATAACA- 
ATCAATGCGG 



41 
I 

GGOGTCTGCG 
CCACCTCCTC 
CCCAGCCCCC 
CAGTCAAGTG 
GCAACCTCTT 
ATGTGCTGGC 
GCCTGACCTA 
ATGCCCTCAA 
GGGTTTTCCT 
GGCTCAAGGA- 
AAATGAGGAA 
TTCCCCCATC 
CTATTTTCCT 
TCAGAGATGC 
ACCCCAGATT 



51 
I ' 

GCTGGOGCGA' 
GGCATCCTCC; 
GCTGCOGGAC 
CGTTAACCGC 
CCTTACOGGC 
CCAACTGCCC 
CGTGTCCTTC 
GGTCCTTCAC 
GGACAACAAT. 
AACAGAGGTAv 
TCGGGTCCTC 
CCTGCAAACC 
CCTGGTTTTG 
CTGCAGGGAT 
AACAAACCTC 



Seq ID NO: 407 Protein sequence 
Protein Accession ft: Eos sequence 

l 11 



51 



21 31 41 

| I I I I I 

MPGGCSRGPA AGDGRLRLAR. LALVLLGWVS SSSPTSSASS FSSSAPFLAS AVSAQPPLPD. 
QCPALCECSE AARTVKCVNR NLTEVPTDLP AYVRNLFLTG NQLASNHFLY LPRDVLAQLP 
SLRHLDLSNN SLVSLTYVSF. RNLTHLESLH LEDNALKVLH NGTLAELQGL PHIRVFLDNN 
PWVCDCHMAD MVTWLKETEV VQGKDRLTCA YPEKMRNRVL LELNSADLDC DPILPPSLQT 
SYVFLGIVLA LIGAIFLLVI* YLNRKGIKKW MHNIRDACRD HMEGYHYRYE INADPRLTNL 
SSNSDVLE 

Seq ID. NO: 408 DNA sequence 

Nucleic Acid Accession #: NM_000095.1 

Coding sequence: 26.. 2299 



1 
1 

CAGCACCCAG 
CCTGGCTGCC 
GCAGATGCTT 

gcggcagcag 
cgggatgcag: 
gcccggcttc 
cccctgcccc 
cgcccacccc 
ggcttgcccg 
ggccaacaag- 
ccccaactcc 
cttcgtgggc 
ctcgcccagc; 
gtgcgtgtgt. 
agacggcttc. 
cgtgactgtg: 
ogatccggat 
gaacccagac 

GTCCCAGAAG 
CGACGACATC 
CTCAGACCAG 
GAAGAGCAAC 
CGATCAAGAC 
TAACAGTGCC 
CGACAATGAC 
GGAGGACGCG 
GGTGGTAGAC 
GGCCTTCCAG 
GGTGCTCAAC 
GGGTTACACT 
GGATGACGAC 
CATGTGGAAG 
GCCTGGCATC 
CGCTCTGTGG 
AAACGTGGGT 
GGGCTACATC 
CTTGGACACA 
CATCTGGGCC 
TCAGCTGCGG. 
GCGGCTGGAT 
AAGGGCTCAG 



11 
I 

CTCCCCGCCA 
CTCGGGGCGT 
CGGGAACTGC 
GTCAGGGAGA 
CAGTCAGTAC 
TGCTTCCCCG 
GCGGGCTTCA 
TGCTTCCCCG 
CCGGGGTACA 
CAGGTTTGCA- 
GTGTGCATCA 
GACCAGGCGT' 
GAGTGCCACG 
CGCGTTGGCT 
CCGGACGAGA 
CCCAACTCAG 
GCCGACGGGG. 
CAGCGCAACA 
AACGACGACC 
GACGGCGACC 
AAGGACAGTG 
CCGGATCAGG 
CAGGATGGAG 
CAGGAGGACT 
GGAGTCCCTG 
GACAGGGACG 
AAGATCGACG. 
ACAGTCGTGC 
CAGGGAAGGG 
GCCTTCAATG 
TATGCGGGCT. 
CAGATGGAGC 
CAACTCAAGG 
CATACAGGAG 
TGGAAGGACA 
AGGGTGCGAT 
ACCATGCGGG 
AACCTGCGTT 
CAAGCCTAGG 
GGGGGCTCTG 
AGAGGACAAA 



21.. 
I 

CCGCCATGGT 
CCGGACAGGG 
AGGAAACCAA 
TCACGTTCCT 
GCACCGGCCT 
GCGTGGCCTG 
CGGGCAACGG 
GAGTCCGCTG 
GCGGCCCCAC 
CGGACATCAA 
ACACCCGGGG 
CCGGCTGCCA 
AGCATGCAGA 
GGGCCGGCAA 
AGCTGCGCTG 
GGCAGGAGGA' 
ACGGGGTCCC 
CGGACGAGGA 
AAAAGGACAC 
' GGATCCGCAA 
ATGGCGATGG 
CGGATGTGGA 
ACGGACATCA 
CAGACCACGA 
ACAGTCGGGA 
GCGTGGGCGA 
TGTGTCCGGA 
TGGACCCGGA 
AGATCGTGCA 
GCGTGGACTT 
TCATCTTTGG 
AAACGTATTG 
CTGTGAAGTC 
ACACAGAGTC 
AGAAGTCCTA 
TCTATGAGGG 
GTGGCCGCCT 
ACCGCTGCAA 
GACCAGGGTG 
CACCCAGCCC 
ATAAAGTGTG 



31 
I 

CCCCGACACC 
CCAGAGCCCG 
CGCGGCGCTG 
GAAAAACACG 
ACCCAGCGTG 
CATCCAGACG 
CTCGCACTGC 
TATCAACACC 
CCACCAGGGC 
CGAGTGTGAG' 
CTCCTTCCAG 
GCGCGGCGCA 
CTGCGTCCTA 
CGGGATCCTC 
CCCGGAGCCG 
TGTGGACCGC 
CAATGAAAAG 
CAAGTGGGGC 
AGACCAGGAC 
CCAGGCCGAC 
TATAGGGGAT 
CCACGACTTT 
GGACTCTCGG 
TGGCCAGGGT 
CAACTGCCGC 
CGTGTGCCAG 
GAACGCTGAA 
GGGTGACGOG 
GACAATGAAC 
CGAGGGCACG 
CTACCAGGAC 
GCAGGCGAAC 
TTCCACAGGC 
CCAGGTGCGG 
TCGTTGGTTC 
CCCTGAGCTG 
GGGGGTCTTC 
TGACACCATC 
AGGACCCGCC 
AAGGGGTGGC 
TGTGCAGGG 



41 

I 

GCCTGOGTTC. 
TTGGGCTCAG 
CAGGACGTGC 
GTGATGGAGT 
CGGCCCCTGC 
GAGAGCGGCG 
ACCGACGTCA 
AGCCCGGGGT 
GTGGGGCTGG 
ACCGGGCAAC 
TGCGGCCCGT 
CAGCGCTTCT 
GAGCGCGATG 
TGTGGTCGCa 
CAGTGCCGTA 
GATGGCATCG 
GACAACTGCC 
GATGCGTGCG 
GGCCGGGGCG 
AACTGCCCTA 
GCCTGTGACA. 
GTGGGAGATG 
GACAACTGTC 
GATGCCTGCG 
CTGGTGCCTA 
GACGACTTTG 
GTCACGCTCA 
CAGATTGACC 
AGCGACCCAG 
TTCCATGTGA 
AGCTCCAGCT 
CCCTTCCGTG 
CCCGGGGAAC 
CTGCTGTGGA- 
CTGCAGCACC 
GTGGCCGACA 
TGCTTCTCCC 
CCAGAGGACT 
GGATGACAGC 
CGTCCTGAGG 



51 
I 

ttctgctcac 
acctgggccc. 
gggactggct: 
gtgacgcgtg; 
tccactgcgc 
gccgctgcgg 

ACGAGTGCAA 
TCCGCTGCGA 
CTTTCGCCAA' 
ATAACTGCGT 
GCCAGCCCGG 
GCCCCGACGG 
GCTCGCGGTC 
ACACTGACCT 
AGGACAACTG: 
GAGACGCCTG:. 
CGCTGGTGCG 
ACAACTGCCG 
ATGCGTGCGA 
GGGTACCCAA 
ACTGTCCCCA 
CTTGTGACAG 
CCACGGTGCC 
ACGACGACGA 
ACCCCGGCCA 
ATGCAGACAA 
CCGACTTCAG 
CCAACTGGGT 
GCCTGGCTGT 
ACACGGTCAC. 
TCTACGTGGT 
CTGTGGCCGA 
AGCTGCGGAA 
AGGACCCGCG 
GGCCCCAAGT 
GCAACGTGGT 
AGGAGAACAT 
ATGAGACCCA 
CACCCTCACC 
GGGAAGTGAG 



60 
120 
180 
240 
300 * 
360 
420 
480 
540 
600 
660 
720 
780 
'840 
900 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 . 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 



Seq ID NO: 409 Protein sequence 
Protein Accession ft-. NP_000086.1 

1 11 21 31 41 51 

I | I I I I 

KVPDTACVLL LTLAALGASG QGQSPLGSDL GPQMLRELQE TNAALQDVRD WLRQQVREIT 



60 



336 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

FLKNTVMECD ACGMQQSVRT GLPSVRPLLH 
NGSHCTDVNE CNAHPCFPRV RCINTSPGFR 
INBCETGQHN. CVPNSVCINT RGSPQCGPCQ 
ADCVLERDGS RSCVCRVGWA GNGILCGRDT 
EDVDRDGIGD ACDPDADGDG VPNEKDNCPIi 
DTDQDGR6DA- .CDDDID6DRI' RNQADNCPRV 
VDHDFVGDAC DSDQDQDGDG* HQDSRDNCPT 
RDNCRLVFNP. GQEDADRDGV GDVCQDDFDA 
PEGDAQIDPN WWLNQGREI. VQTMNSDPGI* 
PGYQDSSSFY WMWKQMEQT YWQANPFRAV 
ESQVRLLWKD PRNVGWKDKK SYRKFLQHRP 
RLGVFCFSQE NIIWANLRYR CNBTIPEDYE 



Seq ID NO: 410 DNA sequence 

Nucleic Acid Accession 8: NMJ>01565.1 

Coding sequence: * 67 . .363 



CAPGFCFPGV 
CEACPPGYSG 
PGFVGDQASG. 
DLDGFPDEKL. 
VRNPDQRNTD 
PNSDQKDSDG 
VPNSAQEDSD 
DKWDKIDVC 
AVGYTAFNGV 
AEPGIQLKAV 
QVGYIRVRFY- 



ACIQTESGGR 
PTHQGVGLAF 
CQRGAQRFCP 
RCPEPQCRKD 
EDKKGDACDN 
DGIGDACDNC 
HDGOGDACDD 
PENAEVTLTD 
DFEGTFHVNT- 
KSSTGPGEQL 
EGPELVADSN 



CGPCPAGFTG 
AKANKQVCTD 



NCVTVPNSGQ 
CRSQKNDDQK 
PQKSNPDQAD 
DDDNDGVPDS 
FRAFQTWLD 
VTDDDYAGFI 
RNALWHTGDT 
WLDTTMRGG. 



GAGACATTCC 
AGCACCATGA 
ATTCAAGGAG 
CCTGTTAATC 
CGTGTTGAGA 
TCGAAGGCCA 
TAAAACCAGA 
CCTCTCCCAT 
GTTACACTAA 
GGTTAATGTT 
GCTCTACTGA 
ACCTTTCCCA 
TCAGAATCTC 
ACTTCATGGA 
CATACAATTC 
CTTATTTAAT 
TTTCAGTGTA 
TTTTAAAAAT 
TTTTCAAATA 



11 

I 

TCAATTGCTT 
ATCAAACTGC 
TACCTCTCTC 
CAAGGTCTTT 
TCATTGCTAC 
TCAAGAATTT 
GGGGAGCAAA 
CACTTCCCTA 
AAGGTGACCA 
CATCATCCTA 
GGTGCTATGT. 
TCTTCCAAGG 
AAATAACTAA 
CTTCCACTGC 
CAAACACATA 
GAAAGACTGT 
CATGGAATAA 
ACAGATAGAT. 
AAAATGAGGT 



21 
I 

AGACATATTC 
GATTCTGATT 
TAGAACCGTA 
AGAAAAACTT 
AATGAAAAAG 
ACTGAAAGCA 
ATCGATGCAG" 
CATGGAGTAT 
ATGATGGTCA 
AGCTATTCAG 
TCTTAGTGGA- 
GTACTAAGGA 
AAGGTATGCA 
CATCCTCCCA 
CAGGAAGGTA 
ACAAAGTATA 
CATGTAATTA 
ATATGCTCTG . 
ACTCTCCTGG 



31 

! 

TGAGCCTACA 
TGCTGCCTTA' 
CGCTGTACCT. 
GAAATTATTC 
AAGGGTGAGA 
GTTAGCAAGG ■ 
TGCTTCCAAG 
ATGTCAAGCC 
CCAAATCAGC 
TAATAACTCT 
TGTTCTGACC 
ATCTTTCTGC: 
ATCAAATCTG 
AGGGGCCCAA 
GAAATATCTG 
AGTCTTAGAT 
AGTACTATGT 
CATGTTACAT 
AAATATTAAG 



41 
I 

GCAGAGGAAC 
TCTTTCTGAC 
GCATCAGCAT 
CTGCAAGCCA- 
AGAGATGTCT 
AAATGTCTAA 
GATGGACCAC 
ATAATTGTTC 
TGCTACTACT 
ACCCTGGCAC 
CTGCTTCAAA 
TTTGGGGTTT 
CTTTTTAAAG 
ATTCTTTCAG 
AAAATGTATG 
GTATATATTT 
ATCAATGAGT 
AAGATAAATG 



51 
I 

CTCCAGTCTC 
TCTAAGTGGC 
TAGTAATCAA 
ATTTTGTCCA 
GAATCCAGAA 
AAGATCTCCT 
ACAGAGGCTG 
TTAGTTTGCA 
CCTGTAGGAA 
TATAATGTAA-- 
TATTTCCCTC 
ATCAGAATTC 
AATGCTCTTT 
TGGCTACCTA 
TGTAAGTATT 
CCrATATTGT 
AACAGGAAAA 
TGCTGAATGG 



Seq ID NO: 411 Protein sequence 
Protein Accession NP_001556.1 



11 



21 31 41 51 

1 ■ I 1 I I I 

MNQTAILICC LIFLTLSGIQ GVPIiSRTVRC TCISISNQPV NPRSLEKLEI IPASQFCPRV 
EIIATMKKKG EKRCLNPESK AIKNLLKAVS KEMSKRSP 

Seq ID NO: 412 DNA sequence 

Nucleic Acid Accession #: XM_057014 

Coding sequence : 143 . . 874 



GGGAGGGAGA 
CGCGGCGGAG 

cgctgcccgg 
ccgcggcctc 
ccccaagggg 
aatgtgctta: 
cattccgggt 
tctgagggaa 

ATTGAATTAT. 
AAATAGTGCT 
CTGTCAGCGT 
AGCTATAATT 
CACTTCTTCT 
CTGGGTTGGC 
TTCTCGCATC 
TTTTTTTATT 
CATCTGAATG 
TTTAAATCTA 
TGGTTAGAAT 
GGTCTTTTGT 
TGTACAATTT 
CAACCTTAAA 



11 
I 

GAGGCGCGCG 
CCAGACGCTG 
CAGCCGGGAG 
CTGCTGCTCC. 
AAGCAAAAGG 
CAAGGGCCAG. 
ACACCTGGGA 
AGCTTTGAGG: 
GGCA7AGATC 
CTAAGAGTTT 
TGGTATTTCA- 
TATTTGGACC 
GTGGAAGGAC 
ACTTGTTCAG 
ATTATTGAAG 
ATGCCTTGGA- 
AAAAGCAAAG. 
GCATTATTCA 
ACTTTCTTCA" 
TTTTTCTCTT 
GTAAATGTTA 
AAAAAAAAAA 



21 
I 

GGTGAAAGGC 
ACCACGTTCC- 
CCATGCGACC 
TGCTGCTGCA 
CGCAGCTCCG- 
CAGGAGTGCC 
TCCCAGGTCG 
AGTCCTGGAC 
TTGGGAAAAT 
TGTTCAGTGG. 
CATTCAATGG 
AAGGAAGCCC 
TTTGTGAAGG 
ATTACCCAAA 
AACTACCAAA 
ATGGTTCACT 
CTAAATATGT 
TTTTGCTTCA 
TAGTCACATT 
AGTATAGCAT 
AGAATTTTTT 
AAAA 



31 
1 

GCATTGATGC 
TCTCCTCGGT, 
CCAGGGCCCC 
GCTGCCCGCG 
GCAGAGGGAG 
TGGTCGAGAC 
GGATGGATTC 
ACCCAACTAC 
TGCGGAGTGT 
CTCACTTCGG 
AGCTGAATGT 
TGAAATGAAT 
AATTGGTGCT 
AGGAGATGCT 
ATAAATGCTT 
TAAATGACAT 
TTACAGACCA 
ATCAAAAGTG 
CTCTCAACCT 
TTTTAAAAAA 
TTATATCTGT 



41 

I 

AGCCTGCGGC 
CTCCTCCGCC 
GCCGCCTCCC 
CCGTCGAGCG 
GTGGTGGACC 
GGGAGCCCTG 
AAAGGAGAAA 
AAGCAGTGTT 
ACATTTACAA 
CTAAAATGCA 
TCAGGACCTC 
TCAACAATTA 
GGATTAGTGG 
TCTACTGGAT 
TAATTTTCAT 
TTTAAATAAG 
AAGTGTGATT 
GTTTCAATAT 
ATAATTTGGA 
ATATAAAAGC 
TAAATAAAAA 



51 
I 

GGCCTCGGAG 
TCCAGCTCCG 
CGCAGCGGCT 
CCTCTGAGAT. 
TGTATAATGGv 
GGGCCAATGG 
AGGGGGAATG 
CATGGAGTTC 
AGATGCGTTC 
GAAATGCATG 
TTCCCATTGA 
ATATTCATCG 
ATGTTGCTAT 
GGAATTCAGT 
TTGCTACCTC. 
TTTATGTATA 
TCACACTGTT 
TTTTTTTAGT 
ATATTGTTGT ' 
TACCAATCTT 
TTATTTCCAA 



Seq ID NO:' 413 Protein sequence 
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GSPEMNSTIN IHRTSSVEGL CEGIGAGLVD VAIWVGTCSD YPKGDASTGH NSVSRIIIEE 

LPK 



120 
•180- 
240 
300 
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CTCGTGCCGA ATTCGGCACG AGACCGCGTG ' TTCGCGCCTG GTAGAGATTT CTCGAAGACA 60 

CCAGTGGGCC. CGTGTGGAAC CAAACCTGCG * CGCGTGGCCG GGCCGTGGGA CAACGAGGCC 120 

GCGGAGACGA. AGGCGCAATG GCGAGGAAGT TATCTGTAAT CTTGATCCTG ACCTTTGCCC 180 

TCTCTGTCAC AAATCCCCTT. CATGAACTAA AAGCAGCTGC TTTCCCCCAG ACCACTGAGA 240 

AAATTAGTCC GAATTGGGAA TCTGGCATTA * ATGTTGACTT. GGCAATTTCC ACACGGCAAT 300 

ATCATCTACA ACAGCTTTTC TACCGCTATG GAGAAAATAA TTCTTTGTCA GTTGAAGGGT 360 

TCAGAAAATT ACTTCAAAAT ATAGGCATAG. ATAAGATTAA AAGAATCCAT ATACACCATG 420 

ACCACGACCA TCACTCAGAC CACGAGCATC ACTCAGACCA TGAGOGTCAC TCAGACCATG 480 

AGCATCACTC AGACCACGAG CATCACTCTG ACCATGATCA TCACTCCCAC CATAATCATG- 540 

CTGCTTCTGG TAAAAATAAG CGAAAAGCTC TTTGCCCAGA CCATGACTCA GAT AGTTCAG 600 

GTAAAGATCC TAGAAACAGC CAGGGGAAAG GAGCTCACCG: ACCAGAACAT GCCAGTGGTA 660 

GAAGGAATGT CAAGGACAGT GTTAGTGCTA GTGAAGTGAC CTCAACTGTG TACAACACTG 720 

TCTCTGAAGG AACTCACTTT. CTAGAGACAA TAGAGACTCC AAGACCTGGA AAACTCTTCC 780 

CCAAAGATGT AAGCAGCTCC ' ACTCCACCCA GTGTCACATC AAAGAGCCGG GTGAGCCGGC. 840 

TGGCTGGTAG GAAAACAAAT GAATCTGTGA GTGAGCCCCG AAAAGGCTTT ATGTATTCCA 900 

GAAACACAAA TGAAAATCCT CAGGAGTGTT TCAATGCATC AAAGCTACTG ACATCTCATG • 960 

GCATGGGCAT CCAGGTTCCG CTGAATGCAA CAGAGTTCAA CTATCTCTGT CCAGCCATCA 1020 

TCAACCAAAT TGATGCTAGA TCTTGTCTGA TTGATACAAG TGAAAAGAAG GCTGAAATCC 1080 

CTCCAAAGAC CTATTCATTA CAAATAGCCT GGGTTGGTGG TTTTATAGCC ATTTCCATCA • 1140 

TCAGTTTCCT GTCTCTGCTG GGGGTTATCT TAGTGCCTCT CATGAATCGG GTGTTTTTCA 1200 

AATTTCTCCT GAGTTTCCTT GTGGCACTGG COGTTGGGAC TTTGAGTGGT GATGCTTTTT 1260 

TACACCTTCT- TCCACATTCT CATGCAAGTC ' ACCACCATAG TCATAGCCAT GAAGAACCAG 1320 

CAATGGAAAT GAAAAGAGGA CCACTTTTCA GTCATCTGTC TTCTCAAAAC AT AGAAGAAA 1380 , 

GTGCCTATTT TGATTCCACG TGGAAGGGTC TAACAGCTCT AGGAGGCCTG TATTTCATGT 1440 

TTCTTGTTGA ACATGTCCTC. ACATTGATCA AACAATTTAA AGATAAGAAG AAAAAGAATC " 1500 

AGAAGAAACC TGAAAATGAT GATGATGTGG AGATTAAGAA GCAGTTGTCC AAGTATGAAT 1560 

CTCAACTTTC AACAAATGAG GAGAAAGTAG' ATACAGATGA TCGAACTGAA GGGTATTTAC 1620 

GAGCAGACTC ACAAGAGCCC TCCCACTTTG ATTCTCAGCA GCCTGCAGTC TTGGAAGAAG . 1680 

AAGAGGTCAT GAT AG CT CAT • GCTCATCCAC AGGAAGTCTA CAATGAATAT GTACCCAGAG 1740 

GGTGCAAGAA- TAAATGCCAT- TCACATTTCC ACGATACACT CGGCCAGTCA GACGATCTCA 1800 

TTCACCACCA" TCATGACTAC CATCATATTC TCCATCATCA CCACCACCAA AACCACCATC 1860 

CTCACAGTCA CAGCCAGCGC TACTCTCGGG • AGGAGCTGAA AGATGCCGGC GTCGCCACTT 1920. 

TGGCCTGGAT GGTGATAATG" GGTGATGGCC TGCACAATTT CAGCGATGGC CTAGCAATTG 1980 

GTGCTGCTTT-- TACTGAAGGC : TTATCAAGTG GTTTAAGTAC TTCTGTTGCT GTGTTCTGTC • 2040 

ATGAGTTGCC TCATGAATTA GGTGACTTTG- CTGTTCTACT AAAGGCTGGC ATGACCGTTA • 2100 

AGCAGGCTGT CCTTTATAAT GCATTGTCAG CCATGCTGGC GTATCTTGGA ATGGCAACAG 2160 

GAATTTTCAT TGGTCATTAT GCTGAAAATG TTTCTATGTG GATATTTGCA CTTACTGCTG 2220 

GCTTATTCAT GTATGTTGCT CTGGTTGATA TGGTACCTGA AATGCTGCAC AATGATGCTA 2280 

GTGACCATGG ATGTAGCCGC TGGGGGTATT TCTTTTTACA GAATGCTGGG ATGCTTTTGG - 2340 

GTTTTGGAAT TATGTTACTT ATTTCCATAT TTGAACATAA AATCGTGTTT. CGTATAAATT 2400 

TCTAGTTAAG GTTTAAATGC TAGAGTAGCT TAAAAAGTTG TCATAGTTTC AGTAGGTCAT 2460 

AGGGAGATGA GTTTGTATGC TGTACTATGC AGCGTTTAAA GTTAGTGGGT TTTGTGATTT 2520 

TTGTATTGAA TATTGCTGTC TGTTACAAAG TCAGTTAAAG GTACGTTTTA ATATTTAAGT 2580 

TATTCTATCT TGGAGATAAA ATCTGTATGT. GCAATTCACC GGTATTACCA GTTTATTATG 2640 

TAAACAAGAG ATTTGGCATG ACATGTTCTG TATGTTTCAG GGAAAAATGT CTTTAATGCT 2700 

TTTTCAAGAA CTAACACAGT TATTCCTATA CTGGATTTTA GGTCTCTGAA GAACTGCTGG 2760 

TGTTTAGGAA TAAGAATGTG CATGAAGCCT. AAAATACCAA GAAAGCTTAT ACTGAATTTA 2820 

AGCAAAGAAA TAAAGGAGAA AAGAGAAGAA TCTGAGAATT GGGGAGGCAT AGATTCTTAT 2880 

AAAAATCACA AAATTTGTTG TAAATTAGAG. GGGAGAAATT TAGAATTAAG TATAAAAAGG 2940 

CAGAATTAGT ATAGAGTACA TTCATTAAAC ATTTTTGTCA GGATTATTTC CCGTAAAAAC 3000 

GTAGTGAGCA CTCTCATATA" CTAATTAGTG. TACATTTAAC TTTGTATAAT ACAGAAATCT 3060 

AAATATATTT AATGAATTCA AGCAATATAC ACTTGACCAA' GAAATTGGAA TTTCAAAATG 3120 

TTCGTGCGGG TTATATACCA" GATGAGTACA GTGAGTAGTT TATGTATCAC CAGACTGGGT 3180 

TATTGCCAAG TTATATATCA ■ CCAAAAGCTG TATGACTGGA TGTTCTGGTT ACCTGGTTTA 3240 

CAAAATTATC AGAGTAGTAA AACTTTGATA- TATATGAGGA TATTAAAACT ACACTAAGTA 3300 

TCATTTGATT CGATTCAGAA AGTACTTTGA TATCTCTCAG TGCTTCAGTG CTATCATTGT 3360 

GAGCAATTGT CTTTATATAC GGTACTGTAG CCATACTAGG CCTGTCTGTG GCATTCTCTA 3420 
GATGTTTCTT TTTTACACAA TAAATTCCTT ' ATATCAGCTT G 

Seq ID NO: -415 Protein sequence 
Protein Accession #: XP_084007 • 

1 " 11- 21 31 41 51; 

I | ' I I I I 

MARKLSVILI LTFALSVTNP LHELKAAAFP QTTEKISPNW ESGINVDLAI STRQYHLQQL 60 

FYRYGENNSL SVBGFRKLLQ NIGIDKIKRI HIHHEHDHHS DHEHHSDHER HSDKEHHSDH 120 

■ EHHSDHDHES ■ HHNHAASGKN KRKALCPDHD SDSSGKDPRN SQGKGAHRPE HASGRRNVKD 180. 

SVSASEVTST VYNTVSEGTH PI»ETIETPRP GiCLPPKDVSS STPPSVTSKS RVSRIAGRKT 240 

NBSVSEPRKG FMYSRNTNEN PQECFNASKL LTSHGKGIQV PLNATEFNYL CPAIINQIDA 300 

RSCLIHTSEK KAEIPPKTYS LQIAWVGGFI AISIISFLSL LGVILVPLMN RVFFKFLLSF 360 

LVALAVGTLS GDAFLHLLPH SHASHHHSHS HEEPAMEMKR GPLFSHLSSQ NIEESAYFDS 420 

TWKGLTALGG LYFMFLVEHV. LTLIKQFKDK KKKNQKKPEN DDDVEIKKQL SKYBSQLSTN- 480 

EEKVDTDDRT EGYLRADSQE PSHFDSQQPA" VLEEEEVMIA HAHPQEVYNB YVPRGCKNKC 540 

HSHFHDTIiGQ SDDLIHHHHD YHHILHHHHH QNHHPHSHSQ RYSREELKDA GVATLAWMVI 600 

MGDGLHNFSD GIiAIGAAFTE GLSSGLSTSV AVFCHELPHE LGDFAVLLKA GMTVKQAVLY 660 

NALSAMLAYL GMATGIFIGH YAENVSMWIP ALTAGLFMYV ALVDMVPSML HNDASDHGCS 720 
RHGYFFLQNA GMLLGFGIML LISIFEHKIV FRINP 

Seq ID NO: 416 DNA sequence 

Nucleic Acid Accession ft: NM_015419.1 

Coding sequence :. 1 8487 
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ATGCCCAAGC GCGCGCACTG GGGGGCCCTC TCOGTGGTGC PGATCCTGCT TTGGGGCCAT 60 

CCGCGAGTGG CGCTGGCCTG CCCGCATCCT TGTGCCTGCT ACGTCCCCAG CGAGGTCCAC 120 

TGCACGTTCC GATCCCTGGC TTCCGTGCCC GCTGGCATTG CTAGACACGT GGAAAGAATC 180 

AATTTGGGGT TTAATAGCAT ACAGGCCCTG TCAGAAACCT CATTTGCAGG ACTGACCAAG 240 

TTGGAGCTAC TTATGATTCA CGGCAATGAG ATCCCAAGCA TCCCCGATGG AGCTTTAAGA ■ 300 

GACCTCAGCT/. CTCTTCAGGT. TTTCAAGTrC AGCTACAACA- AGCTGAGAGT GATCACAGGA 360 

CAGACCCTCC AGGGTCTCTC TAACTTAATG . AGGCTGCACA TTGACCACAA CAAGATCGAG 420 

TTTATCCACCr CTCAAGCTTT CAACGGCTTA ACGTCTCTGA GGCTACTCCA TTTGGAAGGA- 480 

AATCTCCTCC. ACCAGCTGCA CCCCAGCACC TTCTCCACGT TCACATTTTT- GGATTATTTC 540 

AGACTCTCCA* CCATAAGGCA CCTCTACTTA GCAGAGAACA TGGTTAGAAC TCTTCCTGCC 600 

AGCATGCTTC" GGAACATGCC GCTTCTGGAG AATCTTTACT TGCAGGGAAA TCCGTGGACC- 660 

■ TGCGATTGTG - AGATGAGATG GTTTTTGGAA TGGGATGCAA AATCCAGAGG AATTCTGAAG 720 

TGTAAAAAGG- ACAAAGCTTA TGAAGGOGGT CAGTTGTGTG. CAATGTGCTT CAGTCCAAAG 780 
AAGTTGTACA- AACATGAGAT ACACAAGCTG. AAGGACATGA CTTGTCTGAA GCCTTCAATA " 840 

GAGTCCCCTC TGAGACAGAA CAGGAGCAGG AGTATTGAGG AGGAGCAAGA ACAGGAAGAG 900 

GATGGTGGCA- GCCAGCTCAT CCTGGAGAAA TTCCAACTGC CCCAGTGGAG CATCTCTTTG 960 

AATATGACCG ACGAGCACGG GAACATGGTG. AACTTGGTCT GTGACATCAA GAAACCAATG 1020 

GATGTGTACAr AGATTCACTT GAACCAAA CG GATCCTCCAG. ATATTGACAT AAATGCAACA - 1080 

GTTGCCTTGG ACTTTGAGTG TCCAATGACC CGAGAAAACT. ATGAAAAGCT ATGGAAATTG " 1140 

ATAGCATACT ACAGTGAAGT- TCCCGTGAAG^ CTACACAGAG AGCTC ATGC T CAGCAAAGAC 1200 

CCCAGAGTCA GCTACCAGTA CAGGCAGGAT GCTGATGAGG AAGCTCTTTA* CTACACAGGT 1260 

GTGAGAGCCC - AGATTCTTGC AGAACCAGAA TGGGTCATGC AGCCATCCAT AGATATCCAG 1320 

CTGAACCGAC GTCAGAGTAC GGCCAAGAAG GTGCTACTTT CCTACTACAC CCAGTATTCT 13 80- 

CAAACAATAT CCACCAAAGA TACAAGGCAG GCTCGGGGCA- GAAGCTGGGT AATGATTGAG 1440' 

CCTAGTGGAG CTGTGCAAAG" AGATCAGACT GTCCTGGAAG.' GGGGTCCATG CCAGTTGAGC 1500 

TGCAACGTGA AAGCTTCTGA- GAGTCCATCT- ATCTTCTGGG TGCTTCCAGA TGGCTCCATC. 1560 

CTGAAAGCGC CCATGGATGA CCCAGACAGC AAGTTCTCCA" TTCTCAGCAG- TGGCTGGCTG 1620 

AGGATCAAGT CCATGGAGCC ATCTGACTCA GGCTTGTACC AGTGCATTGC. TCAAGTGAGG 1680 

GATGAAATGG: ACCGCATGGT ATATAGGGTA CTTGTGCAGT- CTCCCTCCAC TCAGCCAGCC 1740 

G AG AAAGACA CAGTGACAAT TGGCAAGAAC CCAGGGGAGT CGGTGACATT; GCCTTGCAAT 1800 

GCTTTAGCAA-. TACCCGAAGC CCACCTTAGC TGGATTCTTC CAAACAGAAG GATAATTAAT; I860 

GATTTGGCTA ACACATCACA TGTATACATG TTGCCAAATG, GAACTCTTTC CATCCCAAAG: 1920 * 

GTCCAAGTCA GTGATAGTGG TTACTACAGA TGTGTGGCTG TCAACCAG CA' " AGGGGCAGAC 1980 

CATTTTACGG TGGGAATCAC AGTGACCAAG AAAGGGTCTG: GCTTGCCATC. CAAAAGAGGC 2040 

AGACGCCCAG: GTGCAAAGGC TCTTTCCAGA GTCAGAGAAG ACATCGTGGA GGATGAAGGG- 2100 

GGCTCGGGCA TGGGAGATGA AGAGAACACT TCAAGGAGAC TTCTGCATCC . AAAGGACCAA 2160 

GAGGTGTTCC; TCAAAACAAA GGATGATGCC' ATCAATGGAG; ACAAGAAAGC CAAGAAAGGG 2220 

AGAAGAAAGC TGAAACTCTG GAAGCATTCG. GAAAAAGAAC CAGAGACCAA- TGTTGCAGAA 2280 

GGTCGCAGAG TGTTTGAATC TAGACGAAGG. ATAAACATGG CAAACAAACA GATTAATCCG 2340 

GAGCGCTGGG. CTGATATTTT AGCCAAAGTC CGTGGGAAAA ATCTCCCTAA GGGCACAGAA* 2400 

GTACCCCCGT TGATTAAAAC CACAAGTCCT CCATCCTTGA GCCTAGAAGT CACACCACCT . 2460 

TTTCCTGCTG TTTCTCCCCC CTCAGCATCT CCTGTGCAGA CAGTAACCAG TGCTGAAGAA 2520 

TCCTCAGCAG ATGTACCTCT ACTTGGTGAA GAAGAGCACG TTTTGGGTAC CATTTCCTCA • 2580 

GCCAGCATGG GGCTAGAACA CAACCACAAT GGAGTTATTC TTGTTGAACC, TGAAGTAACA 2640 

AGCACACCTC TGGAGGAAGT. TGTTGATGAC CTTTCTGAGA AGACTGAGGA GATAACTTCC 2700 

ACTGAAGGAG ACCTGAAGGG GACAGCAGCC CCTACACTTA TATCTGAGCC TTATGAACCA 2760 

TCTCCTACTC TGCACACATT AGACACAGTC TATGAAAAGC CCACCCATGA AGAGACGGCA 2820 

ACAGAGGGTT GGTCTGCAGC AGATGTTGGA TCGTCACCAG AGCCCACATC CAGTGA GTAT 2880 

GAGCCTCCAT. TGGATGCTGT CTCCTTGGCT GAGTCTGAGC CCATGCAATA CTTTGACCCA 2 940 

GATTTGGAGA CTAAGTCACA ACCAGATGAG; GATAAGATGA AAGAAGACAC CTTTGCACAC 3000 

CTTACTCCAA CCCCCACCAT. CTGGGTTAAT .GACTCCAGTA CATCACAGTT ATTTGAGGAT 3060 

TCTACTATAG GGGAACCAGG TGTCCCAGGC CAATCACATC TACAAGGACT GACAGACAAC 3120 

ATCCACCTTG TGAAAAGTAG TCTAAGCACT CAAGACACCT TACTGATTAA AAAGGGTATG 3180 

AAAGAGATGT CTCAGACACT ACAGGGAGGA AATATGCTAG AGGGAGACCC CACACACTCC 3240 

AGAAGTTCTG AGAGTGAGGG CCAAGAGAGC AAATCCATCA CTTTGCCTGA CTCCACACTG 3300 

GGTATAATGA GCAGTATGTC TCCAGTTAAG AAGCCTGCGG AAACCACAGT TGGTACCCTC 3360 

CTAGACAAAG ACACCACAAC AGTAACAACA ACACCAAGGC AAAAAGTTGC TCOGTCATCC 3420 

ACCATGAGCA CTCACCCTTC TCGAAGGAGA CCCAACGGGA GAAGGAGATT ACGCCCCAAC 3480 

AAATTCCGCC ACCGGCACAA GCAAACCCCA CCCACAACTT TTGCCCCATC AGAGACTTTT- 3540 

TCTACTCAAC CAACTCAAGC ACCTGACATT AAGATTTCAA GTCAAGTGGA GAGTTCTCTG 3600 

GTTCCTACAG CTTGGGTGGA TAACACAGTT AATACCCCCA AACAGTTGGA AATGGAGAAG 3 660 

AATGCAGAAC CCACATCCAA GGGAACACCA CGGAGAAAAC ACGGGAAGAG GCCAAACAAA 3720 

CATCGATATA CCCCTTCTAC AGTGAGCTCA AGAGCGTCCG GATCCAAGCC CAGCCCTTCT 3780 

CCAGAAAATA AACATAGAAA CATTGTTACT CCCAGTTCAG AAACTATACT TTTGCCTAGA 3840 
ACTGTTTCTC TGAAAACTGA GGGCCCTTAT GATTCCTTAG ATTACATGAC AACCACCAGA .3900 

AAAATATATT CATCTTACCC TAAAGTCCAA GAGACACTTC CAGTCACATA TAAACCCACA 3 960 

TCAGATGGAA AAGAAATTAA GGATGATGTT GCCACAAATG TTGACAAACA TAAAAGTGAC 4020 

ATTTTAGTCA CTGGTGAATC AATTACTAAT GCCATACCAA CTTCTCGCTC CTTGGTCTCC 4080 

ACTATGGGAG AATTTAAGGA AGAATCCTCT. CCTGTAGGCT TTCCAGGAAC TCCAACCTGG- 4140 

AATCCCTCAA GGACGGCCCA GCCTGGGAGG CTACAGACAG ACATACCTGT TACCACTTCT 4200 

GGGGAAAATC TTACAGACCC TCCCCTTCTT- AAAGAGCTTG AGGATGTGGA TTTCACTTCC 4260 

GAGTTTTTGT CCTCTTTGAC AGTCTCCACA CCATTTCACC AGGAAGAAGC TGGTTCTTCC 4320 

ACAACTCTCT CAAGCATAAA AGTGGAGGTG GCTTCAAGTC AGGCAGAAAC CACCACCCTT 4380 

GATCAAGATC ATCTTGAAAC CACTGTGGCT ATTCTCCTTT CTGAAACTAG ACCACAGAAT 4440 

CACACCCCTA CTGCTGCCCG GATGAAGGAG CCAGCATCCT CGTCCCCATC CACAATTCTC 4500 

ATGTCTTTGG GACAAACCAC CACCACTAAG CCAGCACTTC CCAGTCCAAG AATATCTCAA- 4560 

GCATCTAGAG ATTCCAAGGA AAATGTTTTC TTGAATTATG TGGGGAATCC AGAAACAGAA 4620 

GCAACCCCAG. TCAACAATGA AGGAACACAG CATATGTCAG GGCCAAATGA ATTATCAACA 4680 
CCCTCTTCCG ACCGGGATGC ATTTAACTTG TCTACAAAGC TGGAATTGGA AAAGCAAGTA - 4740 

TTTGGTAGTA GGAGTCTACC ACGTGGCCCA GATAGCCAAC GCCAGGATGG AAGAGTTCAT 4800 

GCTTCTCATC AACTAACCAG AGTCCCTGCC AAACCCATCC TACCAACAGC AACAGTGAGG 4860 

CTACCTGAAA TGTCCACACA AAGCGCTTCC AGATACTTTG TAACTTCCCA GTCACCTCGT 4920 

CACTGGACCA ACAAACCGGA AATAACTACA TATCCTTCTG GGGCTTTGCC AGAGAACAAA 4980 

CAGTTTACAA CTCCAAGATT ATCAAGTACA ACAATTCCTC TCCCATTGCA CATGTCCAAA 5040 
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CCCAGCATTC CTAGTAAGTT TACTGACCGA AGAACTGACC AATTCAATGG TTACTCCAAA 5100 
GTGTTTGGAA ATAACAACAT CCCTGAGGCA AGAAACCCAG- TTGGAAAGCC TCCCAGTCCA. 5160 
AGAATTCCTC ATTATTCCAA TGGAAGACTC CCTTTCTTTA CCAACAAGAC TCTTTCTTTT, 5220 
CCACAGTTGG GAGTCACCCG GAGACCCCAG ATACCCACTT CTCCTGCCCC AGTAATGAGA -„ 5280 
GAGAGAAAAG TTATTCCAGG TTCCTACAAC . AGGATACATT CCCATAGCAC CTTCCATCTG , 5340 
GACTTTGGCC CTCCGGCACC TCC GTTGTT G CACACTCCGC AGACCACGGG ATCACCCTCA' 54 00 
ACTAACTTAC AGAATATCCC TATGGTCTCT- TCCACCCAGA GTTCTATCTC CTTTATAACA 5460 
TCTTCTGTCC AGTCCTCAGG- AAGCTTCCAC CAGAGCAGCT CAAAGTTCTT- TGCAGGAGGA 5520 
CCTCCTGCAT CCAAATTCTG GTCTCTTGGG GAAAAGCCCC AAATCCTCAC CAAGTCCCCA 5580 
CAGACTGTGT. CCGTCACCGC TGAGACAGAC ACTGTGTTCC; CCTGTGAGGC AACAGGAAAA 5640. 
CCAAAGCCTT TCGTTACTTG GACAAAGGTT- TCCACAGGAG' CTCTTATGAC TCCGAATACC 5700 
AGGATACAAC. GGTTTGAGGT TCTCAAGAAC GGTACCTTAGi- TGATACGGAA GGTTCAAGTA 5760. 
CAAGATCGAG. GCCAGTATAT- GTGCACCGCC AGCAACCTGC- ACGGCCTGGA CAGGATGGTG 5820 
GTCTTGCTTT CGGTCACCGT GCAGCAACCT- CAAATCCTAG;. CCTCCCACTA CCAGGACGTC 5880 
ACTGTCTACC TGGGAGACAC CATTGCAATG; GAGTGTCTGG CCAAAGGGAC: CCCAGCCCCC* 5940 
CAAATTTCCT , GGATCTTCCC TGACAGGAGG. GTGTGGCAAA* CTGTGTCCCC CGTGGAGAGC ' 6000 
CGCATCACCC TGCACGAAAA' CCGGACCCTT TCCATCAAGG? AGGCGTCCTT. CTCAGACAGA^ 6060 
GGCGTCTATA AGTGCGTGGC : " CAGCAATGCA* GCCGGGGCGG ACAGCCTGGC CATCCGCCTG:. 6120 
CACGTGGCGG CACTGCCCCC CGTTATCCAC: CAGGAGAAGC TGGAGAACAT CTCGCTGCCC- 6180 
CCGGGGCTCA GCATTCACAT TCACTGCACT GCCAAGGCTG CGCCCCTGCC ' CAGCGTGCGC 6240 . 
TGGGTGCTCG GGGACGGTAC CCAGATCCGC CCCTCGCAGT- TCCTCCAOGG GAACTTGTTT"- 6300 
GTTTTCCCCA ACGGGACGCT CTACATCCGC AACCTCGCGC CCAAGGACAG CGGGCGCTAT ' 6360 
GAGTGCGTGG CCGCCAACCT • GGTAGGCTCC GOGCGCAGGA CGGTGCAGCT GAACGTGCAGv 6420 
CGTGCAGCAG CCAACGCGCG , CATCACGGGC ACCTCCCCGC GGAGGACGGA CGTCAGGTAC' 6480 
GGAGGAACCC TCAAGCTGGA CTGCAGCGCC TCGGGGGACC CCTGGCCGCG CATCCTCTGG 6540 
AGGCTGCCGT. CCAAGAGGAT- GATCGACGCG CTCTTCAGTT TTGATAGCAG AATCAAGGTG" 6600 
TTTGCCAATG GGACCCTGGT GGTGAAATCA GTGACGGACA' AAGATGCCGG AGATTACCTG" 6660 
TGCGTAGCTC ; GAAATAAGGT TGGTGATGAC TACGTGGTGC TCAAAGTGGA TGTGGTGATG 6720 
AAACCGGCCA AGATTG AACA - CAAGGAGGAG AACGACCACA AAGTCTTCTA CGGGGGTGAC. 6780 
CTGAAAGTGG ACTGTGTGGC CACCGGGCTT CCCAATCCCG AGATCTCCTG GAGCCTCCCA" 6840 
GACGGGAGTC TGGTGAACTC- CTTCATGCAG TCGGATGACA GCGGTGGACG CACCAAGCGC 6900 
TATGTCGTCT TCAACAATGG - GACACTCT AC . TTTAACGAAG TGGGGATGAG GGAGGAAGGA 6960 
GACTACACCT GCTTTGCTGA* AAATCAGGTC GGGAAGGACG AGATGAGAGT CAGAGTCAAG 7020 
GTGGTGACAGr CGCCCGCCAC: CATCCGGAAC AAGACTTACT TGGCGGTTCA GGTGCCCTAT 7080 
GGAGACGTGG TCACTGTAG C ■ CTGTGAGGCC ; AAAGGAGAAC CCATGCCCAA GGTGACTTGG 7 140 
TTGTCCCCAA - CCAACAAGGT GATCCCCACC TCCTCTGAGA AGTATCAGAT- ATACCAAGAT 7200 
GGCACTCTCC; TTATTCAGAA AGCCCAGCGT? TCTGACAGCG GCAACTACAC CTGCCTGGTC 7260 
AGGAACAGCG- CGGGAGAGGA- TAGGAAGACG GTGTGGATTC ACGTCAACGT CCAGCCACCC 7320 
AAGATCAACG. GTAACCCCAA- CCCCATCACC ACCGTGCGGG- AGATAGCAGC CGGGGGCAGT. 7380 
CGGAAACTGA • TTGACTGCAA*: AGCTGAAGGC ATCCCCACCC CGAGGGTGTT ATGGGCTTTT: 7440 
CCCGAGGGTG.TGGTTCTGCC AGCTCCATAC TATGGAAACC' GGATCACTGT CCATGGCAAC 7500 
GGTTCCCTGG : ACATCAGGAG TTTGAGGAAG AGCGACTCCG TCCAGCTGGT ATGCATGGCA 7560. 
CGCAACGAGG GAGGGGAGGC GAGGTTGATC GTGCAGCTCA CTGTCCTGGA GCCCATGGAG 7620 
AAACCCATCT TCCACGACCC GATCAGCGAG AAGATCACGG CCATGGCGGG CCACACCATC 7680 
AGCCTCAACT GCTCTGCCGC GGGGACCCCG ACACCCAGCC TGGTGTGGGT CCTTCCCAAT - 7740 
GGCACCGATC TGCAGAGTGG- ACAGCAGCTG CAGCGCTTCT ACCACAAGGC TG ACGGCATG C 7800 
. CTACACATTA GCGGTCTCTC. CTCGGTGGAC GCTGGGGCCT ACCGCTGCGT GGCCCGCAAT ' 7860 
GCCGCTGGCC ACACGGAGAG GCTGGTCTCC. CTGAAGGTGG GACTGAAGCC AGAAGCAAAC 7920 
AAGCAGTATC ATAACCTGGT CAGCATCATC AATGGTGAGA CCCTGAAGCT CCCCTGCACC. 7980 
CCTCCCGGGG CTGGGCAGGG: ACGTTTCTCC TGGACGCTCC CCAATGGCAT GCATCTGGAG 8040 
GGCCCCCAAA CCCTGGGACG CGTTTCTCTT CTGGACAATG GCACCCTCAC GGTTCGTGAG" ' 8100 
GCCTCGGTGT TTGACAGGGG; TACCTATGTA' TGCAGGATGG AGACGGAGTA CGGCCCTTCG 8160 
GTCACCAGCA . TCCCCGTGAT; , TGTGATCGCC , TATCCTCCCC GGATCACCAG CGAGCCCACC 8220 
CCGGTCATCT ACACCCGGCC. CGGGAACACC GTGAAACTGA- ACTGCATGGC TATGGGGATT 8280 
CCCAAAGCTG: ACATCACGTG, GGAGTTACCG. GATAAGTCGC; ATCTGAAGGC AGGGGTTCAG 8340 
GCTCGTCTGT ATGGAAACAG; ATTTCTTCAC CCCCAGGGAT CACTGACCAT CCAGCATGCC 8400 
ACACAGAGAG ATGCCGGCTT CTACAAGTGC ATGGCAAAAA ■ ACATTCTCGG CAGTGACTCC 8460 
AAAACAACTT - ACATCCACGT." CTTCTGAAAT. GTGGATTCCA GAATGATTGC TTAGGAACTG 8520 
ACAACAAAGC GGGGTTTGTA" AGGGAAGCCA" GGTTGGGGAA'" TAGGAGCTCT TAAATAATGT 8580 
GTCACAGTGC ATGGTGGCCT CTGGTGGGTT TCAAGTTGAG- GTTGATCTTG ATCTACAATT 8640 
GTTGGGAAAA GGAAGCAATG CAGACACGAG AAGGAGGGCT' CAGCCTTGCT GAGACACTTT 8700 
CTTTTGTG'rT: TACATCATGC: CAGGGGCTTC ATTCAGGGTG TCTGTGCTCT GACTGCAATT 8760 
TTTCTTCTTT f TGCAAATGCC. ACTCGACTGC-. CTTCATAAGC GTCCATAGGA TATCTGAGGA 8820 
ACATTCATCA AAAATAAGCC ATAGACATGA ACAACACCTC ACTACCCCAT TGAAGACGCA^ 8880 
TCACCTAGTT AACCTGCTGC AGTTTTTACA . TGATAGACTT TGTTCCAGAT TGACAAGTCA 8940 
TCTTTCAGTT ATTTCCTCTG TCACTTCAAA ACTCCAGCTT GCCCAATAAG GATTTAGAAC 9000 
CAGAGTGACT'- GATATATATA- TATATATTTTf AATTCAGAGT. TACATACATA- CAGCTACCAT 9060 
TTTATATGAA AAAAGAAAAA CATTTCTTCC -TGGAACTCAC TTTTTATATA' ATGTTTTATA 9120 
TATATATTTT TTCCTTTCAA ATCAGACGAT GAGACTAGAA- GGAGAAATAC TTTCTGTCTT 9180 . 
ATTAAAATTA ATAAATTATT GGTCTTTACA' .AGACTTGGAT ACATTACAGC AGACATGGAA 9240 
ATATAATTTT AAAAAATTTC TCTCCAACCT CCTTCAAATT CAGTCACCAC- TGTTATATTA 9300 
CCTTCTCCAG GAACCCTCCA- GTGGGGAAGG CTGCGATATT AGATTTCCTT GTATGCAAAG 9360 ■ 
TTTTTGTTGA AAGCTGTGCT ' CAGAGGAGGT. GAGAGGAGAG GAAGGAGAAA ACTGCATCAT " 9420 
AACTTTACAG AATTGAATCT AGAGTCTTCC CCGAAAAGCC CAGAAACTTC TCTGCAGTAT 9480 
CTGGCTTGTC CATCTGGTCT AAGGTGGCTG CTTCTTCCCC AGCCATGAGT CAGTTTGTGC 9540 
CCATGAATAA TACACGACCT * GTTATTTCCA ' TGACTGCTTT ACTGTATTTT TAAGGTCAAT 9600 
ATACTGTACA TTTGATAATA AAATAATATT CTCCCAAAAA AAAAA 

Seq ID NO: 417 Protein sequence , 
Protein Accession #: NP_056234.l 

1 il" - . 21 31 41 51 

I I I I I I 

MPKRAHWGAL SWLILLWGH PRVALACPHP . CACYVPSEVH CTFRSLASVP AGIARHVERI 60 
NLGFNSIQAL SETSPAGLTK LELLMIHGNE IPSIPDGALR DLSSLQVFKP SYNKLRVITG " 120. 
QTLQGXtSNLM RLHIDHNKIE PIHPQAFNGL TSLRLLHLEG NLLHQLHPST FSTFTFLDYF 180 
RLSTIRHLYL AENMVRTLPA SMLRNMPLLE NLYLQGNPWT CDCEMRWFLE WDAKSRGILK 240 
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CKKDKAYEGG QLCAMCFSFK KLYKHEIHKL KDMTCLKPSI 
DGGSQLILEK FQLPQWsisL NMTDEHGNMV NLVCDIKKPM 
VALDFECPMT RENYEKLWKL IAYYSEVPVK LHRELMLSKD 
VHAQILAEPS WVMQPSIDIQ LNRRQSTAKX VLLSYYTQYS 
PSGAVQRDQT VLEGGPCQLS CNVKASESPS- IFWVLPDGSI- 
RIKSMEPSDS GLYQCIAQVR DEWDRMVYRV LVQSPSTQPA 
ALAIPEAHLS WILPNRRIIN- DLANTSHVYM LPNGTLSIPX 
HFTVGITVTK KGSGLPSKRG RRPGAKALSR- VREDIVEDEG 
EVFLKTKDDA*- INGDKKAKKG RRKLKLWXHS EKEPETNVAE 
ERWADILAKV RGKNLPKGTE VPPLIKTTSP PSLSLEVTPP- 
SSADVPLLGE EEHVLGTISS ASMGLEHNHN GVILVEPEVT 
TEGDIiKGTAA PTLISEPYEp- SPTLHTLDTV- YEKPTHEETA 
EPPLDAVSLA ESEPMQYFDP DLETKSQPDE DKMKEDTFAH 
STIGEPGVPG QSHLQGLTDN- IHLVKSSLST QDTLLIKKGM 
RSSESEGQES KSITLPDSTL GIMSSMSPVK KPAETTVGTI* 
TMSTHPSRRR PNGRRRLRPN KFRHRHKQTP PTTPAPSETF 
VPTAWVDNTVr NTPKQLEMEK NAEPTSKGTP RRKHGKRPNX 
PENKHRNTVT PSSETILLPR TVSLKTEGPY DSLDYMTTTR* 
SDGKEIKDDV ATNVDKHKSD- ILVTGESITN AIPTSRSLVS-. 
NPSRTAQPGR LQTDIPVTTS GENLTDPPLL KELEDVDFTS: 
TTIiSSIKVEV ASSQAETTTIit DQDHLETTVA ILLSETRPQN 
MSLGQTTTTK PALFSPRISQ;- ASRDSKENVF LNYVGNPETE 
PSSDRDAFNIi: STKIiELEKQV' FGSRSLPRGP DSQRQDGRVH : 
LPEMSTQSAS RYFVTSQSPR HWTNKPEITT YPSGALPENK 
PSIPSKFTDR RTDQFNGYSK VFGNNN I PEA* RNPVGKPPSP 
pqlgvtrrpq; IPTSPAPVMR ERKVIPGSYN: RIHSHSTFHL 
TNLQNIPMVS- STQSSI SFIT SSVQSSGSF# QSSSKFPAGGV 
QTVSVTAETD' TVFPCEATGK PKPFVTWTKV STGALMTPNT 
QDRGQYMCTA SNLHGLDRMV^ VLLSVTVQQP QILASHYQDV: 
QISWIFPDRR VWQTVSPVES RITLHENRTL SIKEASFSDR' 
HVAALPPVIH- QEKLENISLP PGLSIHIHCT AKAAPLPSVRV 
VFPNGTLYIR NLAPKDSGRY ECVAANLVGS ARRTVQLNVQ 
GGTLKLDCS A - SGDPWPRILW RLPSKRMIDA LFSFDSRIKV- 
CVARNKVGDD YWLKVDWM KPAKIEHKEB NDHKVFYGGD 
DGSLVNSFMQ SDDSGGRTKR YWFNNGTLY- FNEVGMREEG 
WTAPATIRN KTYIiAVQVPY- GDWTVACEA KGEPMPKVTW 
GTLIjIQKAQR' SDSGNYTCLV' RNSAGEDRKT VWIHVNVQPR" 
RKLIDCKAEG- IPTPRVLWAF- PEGWLPAPY YGNRITVHGN- 
RNEGGEARLI- VQLTVLEPME- KPIFHDPISE KITAMAGHT I' 
GTDLQSGQQL QRFYHKADGM- LHISGLSSVD AGAYRCVARN 
KQYHNLVSII' NGETLKLPCT - PPGAGQGRFS WTLPNGMHLE 
ASVFDRGTYV CRMETEYGPS VTSIPVIVIA YPPRITSEPT 
PKADITWELP DKSHLKAGVQ ARLYGNRFLH PQGSLTIQHA 
KTTYIHVF • 

Seq ID NO: 418 DNA sequence 

Nucleic Acid Accession 8: Eos sequence 

Coding sequences 1..5001 
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ESPLRQNRSR 
DVYKIHLNQT 
PRVSYQYRQD 
QTISTKDTRQ 
LKAPMDDPDS 
EKDTVTIGKN 
VQVSDSGYYR 
GSGKGDEENT 
GRRVFESRRR 
FPAVSPPSAS 
STPLEEWDD 
TEGWSAADVG 
LTPTPTIWVN 
KEMSQTLQGG 
IiDKDTTTVTT 
STOPTQAPDI 
HRYTPSTVSS 
KIYSSYPKVQ 
TMGEFKEESS. 
EFLSSLTVST 
HTPTAARMKE 
ATPVNNEGTQ 
ASHQI/TRVPA 
QFTTPRLSST 
RIPHYSNGRL 
DFGPPAPPLL 
PPASKFWSLG 
RIQRFEVLKN" 
TVYLGDTIAM 
GVYKCVASNA 
WVLGDGTQIR 
RAAANARITG 
FANGTLWKS 
LKVDCVATGL 
DYTCFAENQV 
LSPTNKVIPT 
KINGNPNPIT 
GSLDIRSLRK 
SLNCSAAGTP 
AAGHTERLVS 
GPQTLGRVSL 
PVIYTRPGNT 
TQRDAGFYKC 



SIEEEQEQEB 
DPPDIDINAT 
ADEEALYYTG 
ARGRSWVMIE 
KFSILSSGWL 
PGESVTIiPCN- 
CVAVNQQGAD.. 
SRRLLHPKDQ 
INMANKQINP 
. PVQTVTSAEE 

lsekteeits- 
sspeptssey. 
dsstsqlfed: 
nmlegdpths 
tprqkvapss 
kis%qvessl. 
rasgskpsps: 
etlpvtykpt 
pvgfpgtptw - 

PFHQEEAGSS ' 

PASSSPSTIb- 

HMSGPNELST 

KPILPTATVR - 

TIPLPLHMSK 

PFFTNKTLSF 

HTPQTTGSPS- 

EKPQILTKSP' 

GTLVIRKVQV- 

ECLAKGTPAP- 

AGADSLAIRL 

PSQFLHGNLF 

TSPRRTDVRY 

VTDKDAGDYL 

PNPEISWSLP 

GKDEMRVRVK ' 

SSEKYQIYQD 

TVREIAAGGS 

SDSVQLVCMA- 

TPSLVWVLPN ; 

LKVGIjKPEAN • 

IiDNGTLTVRE 

VKLNCMAMGI 

MAKNILGSDS 



1 
I 

ATGCCAGGCA 
ACCTCTCAAG. 
CAGTCTGTGC" 
TCAAGACAGT 
CAGATCGCTA 
GCAGTCCGTA 
ACACCAGAAT: 
AAACCTACAG' 
GTCTGTCTGC 

tttcagaata' 
cctatcctgg: 

TTTTCAAAAT. 
TCGCTATGCC 
ATCGATATCC 
CCATGTTTTC 
ATGTGCTATG 
GCCAGTAAGG 
CCTTCCTCAC 
CCCCAAGGGA 
GGTGGGGCGC 
TCGACAGAAA 
TCAGACACCC 
GTTGCTCCCG 
GTAGATAAGC 
TCGGCCTCTC 
GCCAGCTTGA 
CTCCACCCCA 
CCGTCCAGCG 
GCGCGGAGGA 
TCAAGACTTT 
AGCTCTCCAC 
GCGCCATCAC 
GTCTCTGAGG 
GGCGGAAGGC 
TTCCATTTGC 
ATTGGGCGGG 



11 

I 

CAAAACTAAC 
AGGACGAATT 
TTGTGTCCTG 

acaccgtgcg: 
acaggcgtgt 
tttcacaggg 

CTGCCCCTAC 
TTGTCGCTGC 
TGGACACAGG 
CATTCTTTCA 
AG ACACT AGT * 
CCGGGCCACA 
AACAAGAATG 
AAACCAAACA 
TTTTCTACTT 
AAGACCCANN 
CGGATGTTCA 
CTTCTCCCAG 
GAAATGCCAA 
CCCGAAAACC 
TCACTGGGGA 
AAGACCAGAA 
G^GGACTGC 
CTGGCTTTTC 
CTGCCCACCA 
ATGACAACGA 
AGGGCGCCTT 
TTCTCCGCGA 
CCCCCCATTC 
CTCCACCCCA 
TTTCCAAGGG 
GGTCCACCAT 
GAGCGGAGGC 
AGGCGGAGGC 
TCAGACACAA 
GACCTCGGCT 



21 
I 

CCGAACAGGC 
GGATGTACCT 
GGTGGATCCT 
CTATCGAGAG 
GCTGATTGAG 
TGAAAGAGAT 
CACAGCTCCT 
ATCTTGGGAT 
ACTGTTTTCA 
TACGCCCCGG 
TCTGCCCTGG 
AACAGGAGAG 
CAGCTGTACC 
AGTTAATAAA 
CCTCACATTT 
TGTTTCTTCT 
GCAGAACACG 
AGCTCCAGCT 
GGACCTTCTT. 
CCAGCTTCGC 
GGAGGAGCTG 
ACGGACCCTG 
AGTGAGGGCC 
CCTGGCCACG 
CGCGTCCACC 
CTTGGTGGAC 
CGCCCAGCCC 
CAGAAGCTCT 
AGGGGCCGCA 
TGGGGGATCA 
CGGGAAGGAT 
GTCCTCCTCC 
TTCTGATGGT 
CACGGCCCAG 
ACCCTTTGCT 
GCAGCCCTCC 



31 
I 

GCCCCAGCAG 
GACGACATCA 
GTTCTGGAAA 
AAGGGGGAAT 
AACCTGATTC 
GGCAAATGGA 
GAAAACTTGA 
GCGCTACCAG 
GTTTCCTCCT 
CTCTCAAACC 
TGGATGGTCT 
GCCTGGGACC 
CAGAAGGATT 
GATCCACAAC 
ATGCTGGATA- 
TTGACAGGCA 
GAGGACAATG 
TCCTCCCAAC 
CTTGACTTGA 
GCCAAGAAGG 
GGTTCCCGGG 
AGGCCGCCAA 
CGGATGCCAG 
CAGCCCCGCC 
CAGGGCACCT 
TCAGACGAAG 
CGGCCAGCCC 
GTGCACCCCG 
GAGGAAGATT 
TCTCGGCTGC 
GGTGAGGACG 
GTCTCTTCTC 
GAAAGCCACG 
ACGCTGCGGG 
GCCAACGGGA 
AGCTCCCCAC 



41 
I 

ACTACAGAGT 
GCGTCCGGGT 
AACAGAAGAA 
TGGCCAGGTG 
CAGACACTGT 
GTACGTCAGT. 
ACGTCTGGCC 
AGACTGAGGG 
TCCAACCATC 
ATTTGGAGCA 
GCAGCCTGGG 
TCACTCCAAA 
TCAGTTGTCT 
TGGAAGGGAG 
TTGGCGGCTT 
ATTCTTTAAA 
GGAAACCCGA 
ACCCCTCTGT 
AGAACAAAAT 
CAGAGGAGCT 
AGGACTCGCC 
GTAGACACGG 
CGCTGCCCCG 
CAGGGGCGCC 
CTCATCGTCC 
ATGAGCGCGC 
TGTCCCCCAG 
GCGCAAAGCC 
CCAGTGCCTC 
TGCCCACCCA 
CCCCAGCCAC 
ATCTCTCGTC 
GTGACGGCGA 
CCCGGCCTGC 
GGTCTCCAAG 
AGTCGACTGT 



SI 
I 

GATATTGAAG 
TATGTCATCT 
AGTTGTTGCA 
GGATTATAAG 
GTATGAATTT 
CTTCCAAAGA 
AGTCAATGGC 
GAAAGTGAAA 
TGCCAAATCA 
AAGTCCCTCA 
GAACGCTATC 
GCCTTCATTG 
AGCTTACCTG 
TGTTTTTGGA 
TTCCTTCATT 
ATCTGTTGCA 
AAAACCTGAG 
GCCTGCTTCT 
ATTGGCTAAT. 
GGATCTTCAG 
CATGTCACCC 
CCACTCGGTG 
AAGGGAAGGC 
CCCCTCGGCT 
TTCCCTGCCT 
TGTGGGCTCC . 
CCGCCAGTCC 
AGCCTCGCCG 
AGCCCCACCC 
GCCACACCTG 
CAACTCCAAT 
CAGGACGCAG 
TAGGGAAGAC 
CTCTGGACAC 
CAGGTTCAGC 
GCCCTCCCGA 
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60 
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GCCCACCCCA GGGTTCCCTC TCACTCTGAT TCCCACCCTA AGCTTAGCTC AGGTATCCAT 2220 
GGAGACGAGG AGGATGAGAA GCCGCTTCCT GCCACCGTTG TCAATGACCA CGTGCCTTCC 2280 
TCCTCCAGGC AGCCCATCTC CCGGGGCTGG GAGGACTTAA GGAGAAGCCC GCAGAGAGGG 2340 
GCCAGCCTGC. ATCGGAAGGA ACCCATCCCA GAGAACCCCA AATCCACAGG. GGCAGATACA r 2400 
CATCCTCAGG GCAAGTACTC. CTCCCTGGCC .TCCAAGGCTC AGGATGTTCA ACAGAGCACA ■ 2460 
GACGCGGACA> CGGAGGGTCA TTCTCCCAAA - GCACAGCCAG GGTCCACAGA CCGCCACGCG 2520 
TCCCCTGCTC GTCCTCCCGC AGCACGGTCA CAGCAGCATC CCAGTGTTCC CAGAAGGATG 2580 
ACACCCGGCC GGGCCCCAGA ACAGCAGCCC CCTCCTCCCG TCGCCACGTC CCAGCACCAC .2640 . 
COGGGACCCC AGAGCAGAGA CGCGGGTCGG TCACCTTCCC AGCCCAGGCT CTCACTGACC 2700 
CAGGCCGGGC GGOCCCGCCC; CACGTCGCAG GGCGGCTCCC ACTCCTCCTC- GGACCCTTAC 2760 
ACGGCGAGCT CCAGAGGGAT. GCTCCCCACG GCCCTCCAGA ACCAGGACGA GGATGCCCAG. 2820. 
GGCAGCTACC ACGACGACAG CACAGAAGTC GAGGCCCAGG ATGTGOGGGC CCCCGCGCAC 2880 
GCCGCGCGCG CCAAGGAGGC AGCTGCGTCC CTTCCCAAGC ACCAGCAGGT GGAGTCTCCC;. 2940. 
acaggogcag: gggcaggtgg, cgaccacagg. tcccagcgcg. GACATGCGGC CTCCCCCGCC . 3000 
AGGCCCAGCC .GACCOGGCGG. CCCCCAGTCC CGCGCCCGGG. TCCCCAGCAG GGCAGCGCCG - 3060 
GGGAAGTCGG-* AGCCTCCTTC CAAGCGGCCC CTGTCCTCCA : AGTCCCAGCA GTCGGTCTCA . 3120 
GCCGAGGACGr AGGAGGAGGA-. GGACGCGGGG ; TTTTTTAAAG GCGGGAAAGA ' AGACCTTCTG- 3180 ■ 
TCTTCCTCTG: TGCCAAAGTG GCCCTCTTCC TCCACTCCCA GGGGCGGCAA AGACGCCGAT . 3240 
GGGAGCCTCG CCAAGGAAGA GAGGGAGCCT GCCATCGCGC TTGCCCCTCG CGGAGGGAGC 3300 
CTGGCTCCTG: TGAAGCGACC TCTCCCCCCA' CCTCCAGGCA GCTCCCCCAG GGCCTCCCAC 3360 
GTCCCTTCCC GACCGCCGCC TCGCAGCGCT- GCCACCGTGA GCCCCGTCGC GGGCACCCAC 3420 
CCCTGGCCGC GGTACACCAC GCGCGCCCCV CCTGGCCACT TCTCCACCAC CCCGATGCTG 3480 
TCCTTGCGCC AGAGGATGAT GCATGCCAGA TTCCGTAACC CTCTCTCCCG ACAGCCTGCC 3540 
AGACCCTCTT, ACAGACAAGG TTATAATGGC AGACCAAATG TAGAAGGGAA AGTCCTTCCT * 3600 
. GGTAGTAATG GAAAACCGAA TGGACAGAGA ATTATCAATG GCCCTCAAGG AACAAAGTGG . 3660 
GTTGTGGACC- TTGATCGTGG GTTAGTATTG AATGCAGAAG GAAGGTACCT. CCAAGATTCA 3720= 
CATGGAAATC CTCTTCGGAT TAAACTAGGA GGAGATGGTC GAACCATTGT AGATCTGGAA 3780. 
GGGACCCCCG TGGTGAGTCC TGACGGCCTC CCACTCTTTG. GGCAGGGGCG ACATGGCACA 3840 
CCTCTGGCCA^ ATGCCCAAGA- TAAGCCAATT TTGAGTCTTG GAGGAAAGCC GCTGG TGGGC" 3900 
TTGGAGGTCA TCAAAAAAAC CACCCATCCC CCTACCACTA CCATGCAGCC CACCACTACT 3960 
ACGACGCCCC TGCCTACCAC TACAACCCCG . AGGCCCACCA CTGCCACCAC CATGCAGCCC 4020 
ACCACTACTA CGACGCCCCT GCCTACCACT ACACCGAGGC CCACCACTGC CACCACCCGC; 4080 
CGCACGACCA CCAGGCGTCC AACAACCACA GTCCGAACCA CTACGCGGAC AACCACCACC . 4140 
ACCACCCCCA AACCCACCAC TCCCATCCCC ACCTGTCCCC CTGGGACCTT GGAACGGCAC ' 4200 
GACGATGATG- GCAACCTGAT AATGAGCTCC AATGGGATCC CAGAGTGCTA CGCTGAAGAA 4260 
GATGAGTTCT. CAGGCTTGGA' GACTGACACT. GCAGTACCTA CGGAAGAGGC CTACGTTATA '4320, 
TATGATGAAG. ATTATGAATT. TGAGACGTCA- AGGCCACCAA CCACCACTGA GCCTTCGACC 4380 
ACTGCTACCA CACCGAGGGT GATCCCAGAG GAAGGCGCCA- TCAGTTCCTT TCCTGAAGAA 4440 
GAATTTGATC TGGCTGGAAG GAAACGATTT GTTGCTCCTT ACGTGACGTA CCTAAATAAA • ■ 4500 
GACCCATCAG: CCCCGTGCTC TCTGACTGAT. GCACTGGATC ACTTCCAAGT. GGACAGCCTG 4560 
GATGAAATCA TCCCCAATGA CCTGAAGAAG AGTGATCTGC CTCCCCAGCA TGCTCCCCGC 4620 
AACATCACCG, TGGTGGCCGT GGAAGGTTGC CACTCATTTG TCATTGTGGA TTGGGACAAA 4680 
GCCACCCCAG GAGATTTGGT CACAGGTTAT TTGGTTTACA GTGCATCCTA TGAAGATTTC 4740 
ATCAGGAACA AGTTTTCCAC TCAAGCTTCA- TCAGTAACTC ACTTGCCCAT TGAGAACCTA 4800 
AAGCCCAACA CGAGGTATTA TTTTAAAGTG CAAGCACAAA ATCCTCATGG CTACGGACCT 4860 
ArCAGCCCTT CGGTCTCATT TGTCACCGAA- TCAGATAATC CTCTGCTTGT TGTGAGGCCC 4920 
CCAGGCGGTG AGCTATCTGG • ATCCCATTCG CTTTCAAACA TGATCCCAGC TACACGGACT 4980 
GCCATGGACG GCAATATGTG AAGCGCACGT GGTATCGAAA GTTCGTGGGA GTTGTTCTTT. 5040 
GTAATTCACT GAGGTATAAA- ATCTACCTCA GTGACAACCT GAAAGATACA TTCTACAGCA 5100 
TTGGAGACAG CTGGGGAAGA GGTGAAGACC ATTGCCAATT TGTGGATTCA CACCTTGATG 5160 
GAAGAACAGG GCCTCAGTCC TATGTAGAAG CCCTCCCTAC TATTCAAGGC TACTATCGCC 5220 
AGTATCGTCA GGAGCCTGTC AGGTTTGGGA ACATCGGCTT CGGAACCCCC TACTACTATG 5280 
TGGGCTGGTA' CGAGTGTGGG. GTCTCCATCC CTGGAAAGTG GTAATCACAG. GACCGTCATG 5340 
CTGCAAGCTT GCCCTGCCCA GCCCCACCAA CTAAGTCGCA CTAGGGGCTG TGAGCAAAGA 5400 
CAGCCAGCAT GCTCAGCCCC GCTGCCCTAG GTGCCAGGAA GGTCACAGAT GGACACTGGC 5460 
CATTCTGGTC ATCTCAGTCT GGAACTCAGT . CCCACTTCTT GGCCTGGACA" ATGAACAGGA 5520. 
TTCAGTTTTG' CTGTTAACTT. TGCTTCTCTA CTTTTTTTTG TTTGTTTGTA ATAGCACATC 5580 
CCAGAGACAT CAGAAACCAG CAACTGATTC AGTGTGATTT CCCAGACTTT TTAGGCATGA 5640 
AATTCGGACA CTTCAGTATT TCCAGGAATA GCATATGCAC GCTGTTCTTG CTTCATGGAA 5700 
- rGCTACATGC TTTCTGTTTT TCTCATTTTG GATTTCTCCA AAACTAACTG AATTTAAGCT 5760 
TCAGGTCCCT .TTGTATGCAG.. TAGAAAGGAA TTATTAAAAA CACCACCAAA GAAAATAAAT 5820 
ATATCCTACT TGAAATTTAC TCTATGGACT TACCCACTGC TAGAATAAAT GTATCAAATC 5880 
TTATTTGTAA ATTCTCAATT TTGATATATA TATGTATATA TGCATATACA TATCCACACT 5940 
TGTCTGCAAG AATATTGATT AAAATTGCTA AATTTGTACT TGTTCACCAA AAAAAAAAAA 6000 
AAAAAAA 

Seq ID NO: 419 Protein sequence 
Protein Accession #: Eos sequence 

1 .11 21 . 31 41 51 

I I I I I I 

MPGTKLTRTG APADYRVILK TSQEDELDVP. DDISVRVMSS QSVLVSWVDP VLEKQKKWA 60 
SRQYTVRYRE KGELARWDYK QIANRRVLIE NLIPDTVYEF AVRISQGERD GKWSTSVFQR 120 
rPESAPTTAP ENLNVWPVNG KPTWAASWD ALPETEGKVK VCLLDTGLFS VSSPQPSAKS 180 
PQNTFFHTPR LSNHLEQSPS PILETLLLPW WMVCSLGNAI FSKSGPQTGE AWDLTPKPSL 240 
SLCQQECSCT QKDFSCLAYL IDIQTKQVNK DPQLEGSVFG PCFLPYFLTF. MLDIGGFSFI 300 
MCYEDPVSSL TGNSLKSVAA SKADVQQNTE DNGKPEKPEP SSPSPRAPAS SQHPSVPASP 360 
QGRNAKDLLL DLKNKILANG GAPRKPQLRA" KKAEELDLQS TEITGEEELG SREDSPMSPS, 420 
DTQDQKRTLR PPSRHGHSW- APGRTAVRAR" MPALPRREGV DKPGPSLATQ PRPGAPPSAS^ 4 80 
ASPAHHASTQ GTSHRPSLPA SLNDNDLVDS DEDERAVGSL EPKGAFAQPR PALSPSRQSP 540 
SSVLRDRSSV HPGAKPASPA RRTPHSGAAE EDSSASAPPS RLSPPHGGSS RLLPTQPHLS 600 
SPLSKGGKDG EDAPATNSNA PSRSTMSSSV SSHLSSRTQV SEGAEASDGE SHGDGDREDG 660 
GRQAEATAQT LRARPASGHF HLLRHKPFAA. KGRSPSRFSI GRGPRLQPSS SPQSTVPSRA 720 
HPRVPSHSDS HPKLSSGIHG DEEDEKPXjPA TWNDHVPSS SRQPISRGWE DLRRSPQRGA 780' 
SLHRKEPIPE NPKSTGADTH PCJGKYSSLAS KAQDVQQSTD ADTEGHSPKA QPGSTDRHAS . 840 
PARPPAARSQ QHPSVPRRMT PGRAPEQQPP PPVATSQHHP GPQSROAGRS PSQPRLSLTQ 900 
AGRPRPTSQG RSHSSSDPYT ASSRGMLPTA LQNQDEDAQG SYDDDSTEVE AQDVRAPAHA 960 
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ARAKEAAASL 
KSEPPSKRPU 
SLAKEEREPA 
WPRYTTRAPP 
SNGKPNGQRI 
TPWSPDGLP 
TPLPTTTTPR 
TPKPTTPIPT 
DEDYEFETSR- 
PSAPCSLTDA 
TPGDLVTGYIi 
SPSVSFVTES 



PKHQQVESPT 
SSKSQQSVSA- 
IALAPRGGSL 
GHFSTTPMLS 

- INGPQGTKWV. 
LFGQGRHGTP- 

. PTTATTMQPT, 
CPPGTLSRHD' 
PPTTTEPSTT 
LDHFQVDSLD 
VYSASYEDPI 
DNPLLWRPP 



GAGAGGDHRS 



APVKRPLPPP 
LRQRNMHARE 
VDLDRGliVLN 
LANAQDKPIL 
TTTTPLPTTT 
DDGNLIMSSN 
ATTPRVIPEE 
EIIPNDLKKS 
RNKFSTQASS 



QRGHAASPAR 
FKGGKED ttLS 
PGSSPRASHV 
RNPLSRQPAR 
AEGRYLQDSH 
SJjGGKPIiVGL 
PRPTTATTRR ' 
GIPECYAEED 
GAISSFPEEE-' 
DLPPQHAPRN 
VTHLPIENLK 
SNMIPATRTA 



Seq ID NO t- 420 DNA "sequence 

Nucleic Acid Accession #: NMJJ22743 

Coding sequence: 128.. 1237. 



1 
I 

GTGGATTTTA 
TCTCTGCTGC 
AAAGCTGATG 
AAAAGCTTGG 
TCCTCCAGAC 
TTCAGAATCA 
TGAAGATAAG 
AGAAATACAG 
AGTGATCTGC 
ATATCCCAGT 
TGGGCCCCAC 
CTGCTACCTG 
. CTGCTTTGAA 
TGGTGATGAG 
GGCACACTGG 
TGAAOGGCTT 
CTGCATCAAC 
ATACAGGATT 
CAAACTGCAG 
TGATATTATG 
TTTAGAAGAA 
CGGCGTGTGT 
GTGAACCTCT 
GGTTTGCAAA 
ATTTGGTTGA 



11 
I 

GAGATACCTC 
AACACCCCTA 
CGATGCTCTC 
CCAGACCACA 
TCCGTTCGAC 
GAGAAGCTTT 
AAAGAGGGCC. 
GATGCCTCTC 
AACTCTTTCA 
ATCTCTTTGC 
CTCTTACTGC 
GATATGCTGA 
TGTGACTGTT 
CAAGTATGGA 
AAGTGGGAGC 
CCCGATATCA- 
CTCGGCCTGT 
TTTTTCCCAG 
CTACATCAAG 
AGAGTGACAC 
TGCGACGCCA 
CTTTGTTGAA 
CTTATTGGAA 
CCACAAGAAT 
GGATGCCAAA 



21 
I 

CCCTCCTTCT 

ctgttctccg 
agtgccgcgt: 
agcgggaatg 
ttcttggcag 

ACTCATTTTA 
TCAGGCAACT 
AGCTGCCACC 
CCATCTGTAA 
TCAATCACAG 
GAGCAGTCC6 
TGACCAGTGA 
TCCGTTGCCA 
AGGAAGTTCA 
AGGTTCTGGC 
ACATCTACCA 
TGGAGGAAGC 
GAAGCCATCC 
GCATGTTTCC 
ATGGCAGAGA 
ACATCAGAGC 
TGCCTTATTG 
ATTCTGTTCC 
CATTAGTTGT 
AAAAAAAAAA 



Seq ID NO: 421 Protein sequence 
Protein Accession ft: NP_073S80 



MRCSQCRVAK 
SEKXYSPYDL 
CNSFTICNAE 
LDMLMTSEER 
WKWEQVLAMC 
IPPPGSHPVR 
E CD AN IRAS 



11 
I 

YCSAKCQKKA 
ESNINKLTED 
MQEVGVGLYP 
RKQLRDQYCF 
QAIISSNSER^ 
GVQVMKVGKLr 



21 

I -. 

WPDHXRECKC 
KKEGLRQLVM 
SISIiLNHSCD 
ECDCFRCQTQ 
LPDINIYQLK 
QLHQGMFPQA 



31 
I 

GCTCAGCTGC. 
TGTATTGGCT 
CGCCAAATAC 
CAAATGCCTT 
AGTTGTCTTC 
TGATCTGGAG 
CGTAATGACA 
TGCCTTTGAC 
TGCGGAGATG 
CTGTGACCCC 
AGACATCGAG 
GGAGCGCCGG 
AACCCAGGAC 
AGAATCCCTG 
CATGTGCCAG: 
GCTGAAGGTG 
CTTGTTCTAT 
CGTGAGAGGG 
CCAAGCAATG 
ACACAGCCTG 
ATCCTAAGGG 
AGGTCACACA 
GTGTTTGTGT 
AGAGAAGCAC 
AAAAAAA 



31 
I 

LKSCXPRYPP 
TFQHFMREEI 
PNCSIVPNGP 
DKDADMLTGD 
VLDCAMDACI' 
MKNLRLAFDI 



PSRPGGPQSR 
SSVPRWPSSS 
PSRPPPRSAA 
PSYRQGYNGR 
GNPIiRIKLGG 
EVIKKTTHPP 
TTTRRPTTTV 
EFSGLETDTA 
FDLAGRKRFV 
ITWAVEGCH 
PNTRYYFKVQ 
KDGNM 



41 
I 

CTTGCAGTAA 
TTTCTGGGCA 
TGTAGTGCTA 
AAAAGCTGCA 
AAACTTATGG 
TCAAATATTA 
TTTCAACATT 
CTTTTTGAAG 
CAGGAAGTTG 
AACTGTTCGA 
GTGGGAGAGG 
AAGCAGCTGA 
AAGGATGCTG 
AAAAAAATTG 
GCGATCATAA 
CTCGACTGCG 
GGTACTCGGA 
GTTCAAGTGA 
AAGAATCTGA 
ATTGAAGATT 
AACGCAGTCA 
CTCTATGCTT 
AGGTAAATAA 
GATTATAATA 



ARVPSRAAPG. 

TPRGGKDADG 

TVSPVAGTHP 

PNVEGKVLPG " 

DGRTIVDLEG 

TTTMQPTTTT 

RTTTRTTTTT 

VPTEEAYVIY 

APYVTYLNKD 

SFVIVDWDKA. 

AQNPHGYGPI 



51 
I 

TTAAACTCTT 

GCAGGAAGGA 

AGTGTCAGAA 

AACCCAGATA 

ATGGAGCACC 

ACAAACTGAC 

TCATGAGAGA 

CCTTTGCAAA 

GTGTTGGCCT 

TTGTGTTCAA 

AGCTCACCAT 

GGGACCAGTA" 

ATATGCTAAC 

AAGAACTGAA 

GCAGCAATTC 

CCATGGATGC. 

CCATGGAGCC. 

TGAAAGTTGG 

GACTGGCTTT 

TGATTCTACT 

GAGGGAAATA 

TGTTAGCTGT 

AGGCAGACAT 

AATTCAAAAC 



41 51 

i I 
DSVRLLGRW FKLMDGAPSE 
QDASQLPPAP DLFEAFAKVI 
HLLLRAVRDI EVGEELTICY 
EQVWKEVQES LKKIEELKAH 
NLGLLBEALP YGTRTMEPYR- 
MRVTHGREHS LIEDLILLLB 



Seq ID NO: 422 DNA sequence. . 
Nucleic Acid Accession #t NM_003014.2 
Coding sequence i 238. .648 



GGCGGGTTCG 
CGGAGCTCCG 
AAACTCTCCT 
GGCAGGAAGA 
TTGCTCTCCA 
GCGCCCTGCG 
ATGCCCAACC 
GAGGAGCTGG 
GCGCCCATTT 
CAACGCGCGC 
AGCCTGGCCT 
ATCGTCACGG 
CAGGAAAGGC 
AAGGTGAAGC. 
AAAATAAAAG 
GAGATCTTCA 
TCTTGCCAGT. 
CGTTCAAGGA 
AAAAGATCCA 
AAGAAAACAG 
GCTCCCAAAC 
.AACCCGAAAA 
GATGAGGCTG 



11 
I 

CGCCCCGAAG 
CGGCCGGACC 
GCGCCCCAGA 
GAAGGCGCTT 
TCCTAGTGGC 
AGGCGGTGCG 
ACCTGCACCA 
TGGACGTGAA 
GCACCCTGGA 
GCGACGACTG; 
GCGACGAGCT 
ACCTCCCGGA 
CTCTTGATGT 
. CAACTTTGGC 
CTGTGCAGAG 
AGTCCTCATC 
GTCCACACAT 
TGATGCTTCT 
TACAGTGGGA 
CCGGGCGCAC 
CAGCCAGTCC 
GAGTGTGAGC 
GGCATTGCCT 



21 
1 

GCTGAGAGCT 
CCGCGGCCGC 
AGATTTCTTC 
TCTGTCTGCC 
GCTGTGCCTG 
CATCCCTATG 
CAGCACGCAG 
CTGCAGCGCC 
GTTCCTGCAC 
CGAGCCCCTC 
GCCTGTCTAT 
GGATGTTAAG 
TGACTGTAAA 
AACGTATCTC 
GAGTGGCTGC 
ACCCATCCCT 
CCTGCCCCAT 
TGAAAATTGC 
AGAGAGGCTG 
CAGTCGTAGT 
CAAGAAGAAC 
TAACTAGTTT 
GGGACAGCCT 



31 
I 

GGOGCTGCTC 
GCTTTGCTGC 
CTCGGCGAAG 
GGGGTCGCAG 
TGGCTGCACC 
TGCCGGCACA 
GAGAACGCCA 
GTGCTGCGCT 
GACCCTATCA 
ATGAAGATGT 
GACCGTGGCG 
TGGATAGACA 
CGCCTAAGCC 
AGCAAAAACT 
AATGAGGTCA 
CGAACTCAAG 
CAAGATGTTC 
TTAGTTGAAA- 
CAGGAACAGC 
AATCCCCCCA 
ATTAAAACTA 
CCAAAGCGGA 
ATGTAAGGCC 



41 

I 

GTGCCCTGTG 
CGACTGGAGT 
GGACAGCGAA 
CGCGAGAGGG 
TGGCGCTGGG 
TGCCCTGGAA 
TCCTGGCCAT 
TCTTCTTCTG 
AGCCGTGCAA 
ACAACCACAG 
TGTGCATTTC 
TCACACCAGA 
CCGATCGGTG 
ACAGCTATGT 
CAACGGTGGT 
TCCCGCTCAT 
TCATCATGTG 
AATGGAGAGA 
GGAGAACAGT 
AACCAAAGGG 
GGAGTGCCCA 
GACTTCCGAC 
ATGTGCCCCT 



51 
I 

TGCCAGACGG 
TTGGGGGAAG 
AGATGAGGGT 
CAGTGCCATG 
CGTGCGCGGC 
CATCACGCGG 
CGAGCAGTAC 
TGCCATGTAC. 
GTCGGTGTGC 
CTGGCCCGAA 
GCCTGAAGCC 
CATGATGGTA 
CAAGTGTAAA 
TATTCATGCC 
GGATGTAAAA 
TACAAATTCT 
TTACGAGTGG 
TCAGCTTAGT 
TCAGGACAAG 
AAAGCCTCCT 
GAAGAGAACA 
TTCCTTACAG 
TGCCCTAACA 



1020 

1080: 

1140 

1200 

1260. 

1320 

1380 

1440- 

1500 

1560 

1620 



60 
120 
180 
240 
300 
.' 360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
, B40 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
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ACTCACTGCA 
GTTTTTCTTT 
GAGTTAAAGC 
CTAGAAGAGT 
AAATGCCATA: 
TATCTGTTGT 
AAGGAACAGT 
TTTTTGTGAT' 
TGTGTTTTTT 

aataataaag 
gttacctgat: 
acagtgagtt: 
attttatacc 
aataatttga- 

TTAAATATTT- 
AAAGTTGAGT. 
AAAAAGAACT. 
ATTTATTTTA r 
AGGCATTCAA 
ACTACACAGA 
GCACTTATAAi 
CTGCCTCCTT- 
TCTCATTTCT 
TATTGGATAC. 



GTGCTCTTCA 
GTAAGCCATC 
TGGTGGAAAA 
AGGGAAAATA 
TTTCAAACAA 
TGCAATGTTA 
AGTGGAATGA 
GAAAGGGGAT 
TACCAATGAC- 
AAAAATAAAT 
TTCCATGATC 
TGTCTGTACC 
CACAAGAGAG 
CAAGCTTAAA 
TCTTTGCCTA 
TCCACCTCTG 
TATTTGCAGC 
AAAAACAATT" 
TAAATGCACA 
GGTAATCACT 
AATGATTTGA 
TGCTTGGCCC 
AACAGCTGTG 
TTAGGTGGTT 



TAGACACATC 
ACAAGCCATA 
GGCTTATTGC 
ATGCTTGTTA 
AACACGTAAT 
GTGATGTTTT 
ATGTTAAAAG 
TTTTTGAAAA 
TTCAGTTTCT . 
AAAAAGGAGA- 
ATGATGCTTC 
ATTAGGAGTT 
G7ATGTCACT 
AATGGCCTTC 
AATACATGTG 
AAATGAGAAT 
ATTTTATCAA 

ttattggcct: 

ACGCCCAAAG 
ATTAGTATTT 
ACAAATAAAA 
TTTATTGAGA 
TTATATTCCA 
TCTTCACTGA 



TTGCAGCATT 
GTGGTAGGTT 
ATTGCATTCA 
CAATTCGACC 
TTTTTTACAG 
AAAATGTGAT 
ATCTTTATGT 
ATTAGAGAAG 
GTTTTTAGCT 
GGCAGACAAT 
TTGTCAACAC 
AGGTACTAAT 
CATCTTACTT- 
ATGTGAGTGC 
AGAGGAGTTA 
TACTTGACAG 
CAAATTTCAT 
TTTGCTAACA 
GAAATAAAAT 
TGGCATATTA 
CTAGGAACCT 
TAAGTTTTCC 
TAGTATGCAT 
CAATACTGAA 



TTTCTTAAGG 
TGCCCTTTGG 
GAGTAACCTG 
TAATATGTGC 
TATGTTTTAT 
GAAAATATAA 
GTTTATGGTC 
TAGCATATGG 
AGAAACTTAA 
GTCTGGATTC 
CCTCTTAAGC 
TAGTTGGCTA 
CCCAGGACAT 
CAAATTTTGT 
AATATAAATG 
TTGGGATACT 
AATTGTGGAC 
CAGTAAGCAT 
CCTATCTAAT- 
TTCTCCAGGT 
GTATACATGT 
TGTCAAGAAA 
TACTCAACAA 
TAAACATCTC 



CTATGCTTCA 

TACAGAAGGT 

TGTGCATACT " 

ATTGTAAAAT 

TACCTTTTGA 

TGTTTTTAAG 

TGCAGAAGGA 

AAAATT AT AA 

AAACAAAAAT 

ctgttttttg ■ 
agcaccagaa 
atgctcaagt' 
ccaccctgag 

TTTTCTTCAT 

tacagagagg 
ttaatcagaa 

AATTGGAGGC 
GTATTTTATA" 
CCTACTCTCC 
GTTTGCTTAT" 
GTTTCATAAC 
GCAGAAACCA' 
ACTGTTGTGC 
ACCGGAATTC 



Seq ID NO:: 423 Protein sequence 
Protein Accession 8:. NP_003 005.1. 



1 
I 

MPLSILVALC 
YEELVDVNCS 
ESLACDELPV. 
KKVKPTLATY: 
SSCQCPHZhP 
KKKTAGRTSR 



11 
I 

LWLHLALGVR 
AVLRFFFCAM 
YDRGVCISPE 
LSKNYSYVIH 
HQDVLIMCYE 
SNPPKPKGKP 



21 
I 

GAPCEAVRIP 
YAPICTLEFL 
AIVTDLPEDV' 
AKIKAVQRSG 
WRSRMMLLEN- 
PAPKPASPKK 



31 
I 

MCRHMPWNIT 
HDPIKPCKSV 
KWIDITPDKM 
CNEVTTWDV 
CLVEKWRDQL 
NIKTRSAQKR 



41 51 • 

I I 
RMPNHLHHST QENAILAIEQ 
CQRAHDDCEP LMKMYNHSWP- 
VQERPLDVDC KRLSPDRCKC 
KEIFKSSSPI PRTQVPLITN: 
5KRSIQWEER LQEQRRTVQD 
TNPKRV 



Seq ID NO: 424 DNA sequence 
Nucleic Acid Accession tt: BC010423 
Coding sequence: 248.. 1780 



l 
I 

CACAGCGTGG 
AGCTACGGCT 

caagtgcgag 
tctgcagccg 
ttcaaccatg 
gctgctactg 
cgtggtaact 
cggcgagcaa 
actagcgcta 
ggagcagccg 
gcaggcggat: 
ggcgcggctg 
actagaagag 
ccccagcgtg' 

CTCCCGCTCT 
GCAGCCACTG 
CATCCTCCAC 
GTGGCACATT 
CTCATACAAC 
CACTTTGGGC 
CAATGAGTTC 
CTCTGGGAAG 
ACTCTTGTTC 
GGCCCAGCAG 
CCGGAGGCTG 
GAGAGCCGAG 
AGAGCCCGAG 
TGAACTGCTG 
CAAACAGGCC 
CAATGGCATC 
CTAGGCCTGG 
ACACCCCCAT, 
AACCCTTCTG 
CACTGTGTGT 
TGACTGTCCG 
AAGTGAACTG 
GTTTGGOGTG 
CAGACCCCAG 
CAGACCCAGG 
TCTCCTACCA 
GAGGCTTGAA 
ACATATTTTC 
ACTTTTAATT 
TTTTATTTTT 



11 
I 

GAAGCAGCTC 
GGGTGTGTAG 
AGGCAAGAAC 
GCTCCCAGGG 
CCCCTGTCCC 
CTGGCATCAT. 
GTGGTGCTGG 
GTGGGGCAAG 
CTGCACTCCA 
CCGCCCCCAC 
GAGGGCGAGT, 
CGGCTCCGAG 
GGCCAGGGCC 
ACCTGGGACA 
GCTGCCGTCA 
ACTTGTGTGG 
GTGTCCTTCC 
GGCAGAGAAG 
TGGACACGGC 
TTTCCCCCAC 
TCCTCAAGGG 
CAGGTGGACC 
TGCCTTCTGG 
ATGACCCAGA 
CATTCCCATC 
GGCCACCGTG 
GGCCGCAGTT 
TCTCCAGGCT 
ATGAACCATT 
TACATCAATG 
CTCCTTCTGT 
TTCTTGCGGA 
TTCATCGGGA 
GTGCATGTGT 
TGGAGGGGTG 
TGGTGTATGT 
TGTGTCATGT 
AGCAGTATTA 
TGTGCGGGCA 
CTTCGGAGCC 
CTGTTACAGA 
TGTAAATATA 
TTTTTCTTTT 
ATT 



rrrr 



21 
I 

TGGGGGAGCT 
AACGGGGCCG 
TCTGCAGCTT 
AGATCTCGGT 
TGGGAGCCGA 
TTACAGGCCG. 
GCCAGGACGC 
TGGCATGGGC 
AATACGGGCT 
GCAACCCCCT 
ACGAGTGCCG 
TGCTGGTGCC: 
TGACCCTGGC 
CGGAGGTCAA : 
CCTCAGAGTT 
TGTCCCATCC 
TTGCTGAGGC 
GAGCTATGCT. 
TGGATGGGCC 
TGACCACTGA 
ATTCTCAGGT 
TAGTGTCAGC 
TGGTGGTGGT 
AATATGAGGA 
ACACGGACCC 
ATAGTCTCAA 
ACTCCACGCT 
CTGGGCGGGC 
TTGTTCAGGA 
GGCGGGGACA 
TGACATGGGA 
AGATGCTCCC 
GGGCTCCACC 
GCCTGTGTGA 
ACTGTGTCCG. 
GCCACGGGAT 
GGCTGTGTGT 
ATGATGCAGA 
TAGCTGGAGC 
ATGGGGGCAA 
AGCCCTCTGC 
CATGCGCCGG 
TTTTTTCTTG 
AGAGTTTGAG 



31 
I 

CGGAGCTCCC 
GGGCTGGGGC 
CCTGCCTTCT 
GGAACTTCAG 
GATGTGGGGG 
GTGCCCCGCG 
AAAACTGCCC 
TCGGGTGGAC 
TCATGTGAGC 
GGACGGCTCA 
GGTCAGCACC 
TCCCCTGCCC 
AGCCTCCTGC 
AGGCACAACG 
CCACTTGGTG 
TGGCCTGCTC 
CTCTGTGAGG. 
CAAGTGCCTG 
TCTGCCCAGT 
GCACAGCGGC 
CACTGTGGAT 
CTCGGTGGTG 
GGTGCTCATG 
GGAGCTGACC 
CAGGAGCCAG 
GGACAACAGT 
GACCACGGTG 
CGAGGAGGAG 
GAATGGGACC 
CCTGGTCTGA 
GATTTTAGCT 
CATCCCACTG 
AATTGAGTCT. 
GTGTTGACTG 
TGGTGTGTAT 
TTGAGTGGTT 
GACCTCTGCC 
GGTTGGAGGA 
TGGAATCTGC 
GTGTGAAGCA 
CCTCTGGTGG 
GAGCTTCTTG 
CCCTTTCCAT 
TCCAGCCTGG 



41 
I 

GATCACGGCT 
TGGGTCCCCT 
GGGTCAGTTC 
AAACGCTGGG 
CCTGAGGCCT 
GGTGAGCTGG 
TGCTTCTACC 
GCGGGCGAAG 
CCGGCTTACG 
GTGCTCCTGC 
TTCCCCGCCG 
TCACTGAATC 
ACAGCTGAGG 
TCCAGCCGTT 
CCTAGCCGCA 
CAGGACCAAA 
GGCCTTGAAG 
AGTGAAGGGC 
GGGGTACGAG 
ATCTACGTCT 
GTTCTTGACC 
GTGGTGGGTG 
TCCCGATACC 
CTGACCAGGG 
CCGGAGGAGA 
AGCTGCTCTG 
AGGGAGATAG 
GAAGATCAGG 
CTACGGGCCA 
CCCAGGCCTG 
CATCTTGGGG 
ACTGCTTGAC 
CTCCCACCAT 
ACTGTGTGTG 
TATGCTGTCA 
GCGTGGGCAA 
TGAAAAAGCA 
GAGAGGTGGA 
CTCCGGTGTG 
GCCAGTCCCT 
CCTCTGGGCC 
CAGGAATACT 
TAGTTGTATT 
ACGATATAGC 



51 
1 

TCTTGGGGGT 

AGTGGAGACC 

CTTATTCAAG - 

CAGTCTGCCT 

GGCTGCTGCT 

AGACCTCAGA 

GAGGGGACTC 

GCGCCCAGGA 

AGGGCCGCGT 

GCAACGCAGT 

GCAGCTTCCA 

CTGGTCCAGC 

GCAGCCCAGC. 

CCTTCAAGCA 

GCATGAATGG 

GGATCACCCA 

ACCAAAATCT 

AGCCCCCTCC; 

TGGATGGGGA 

GCCATGTCAG 

CCCAGGAAGA 

TGATCGCCGC 

ATCGGCGCAA 

AGAACTCCAT 

GTGTAGGGCT 

TGATGAGTGA 

AAACACAGAC 

ATGAAGGCAT 

AGCCCACGGG 

CCTCCCTTCC 

GCCTCCTTAA 

CTTTACCTCC 

GCATGCAGGT 

TGTGGAGGGG 

TATCAGAGTC 

CACTGTCAGG' 

GGTATTTTCT 

GACTGTGGCT 

AGGGAACCTG 

GGGTCAGCCA 

TGCTGCATGT 

GCTCCGAATC 

TTTTATTTAT ' 

CAGACCCTGT 



1440 

1500 

1560 

1620 

16B0 

1740 

1800 

1860 

1920 

1980 

2040 ■ 

2100 

2160 
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2400 

2460 
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2580 
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2760 



60 
120 
180 
240 
300 



60 
120 
-180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
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1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
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WO 02/086443 ... 

CTGTAAAAAA ACCAAAACCC AAAAAAAAAA AAAAAAAAAA 



Seq id NO: 425 Protein sequence 
Protein Accession fc:AAH10423 

1 11 ■ 21 31 41 .51 

I I I I I I 

MPLSLGAEMW GPEAWLLLLL LLASFTGRCP AGELETSDW TWLGQDAKL PCFYRGDSGE 60 
QVGQVAWARV DAGEGAQELA LLHSKYGLHV SPAYEGRVEQ PPPPRNPLDG SVLLRNAVQA 120 
DEGEYECRVS TFPAGSFQAR LRLRVLVPPL PSLNPGPALE' EGQGLTLAAS CTAEGSPAPS 180 
VTWDTEVKGT. TSSRSFKHSR" SAAVTSBFHL VPSRSMNGQP LTCWSHPGL LQDQRITHIL 240 
HVSFLAEASV RGLEDQNLWH IGREGAMLKC LSEGQPPPSY NWTRLDGPLP SGVRVDGDTL 300 
GFPPLTTEHS GIYVCHVSNE FSSRDSQVTV DVLDPQEDSG KQVDLVSASV VWGVIAALL ■ 360 
FCLLVWWL MSRYHRRKAQ. QMTQKYEEEL TLTRENSIRR LHSHHTDPRS QPEESVGLRA 420 
EGHPDSLKDN SSCSVMSEEP* EGRSYSTLTT VREIETQTEL LSPGSGRAEE EEDQDEGIKQ 400 
AMNHFVQENG. TLRAKPTGNG: IYINGRGHLV 

Seq ID NO: 426 DNA sequence 

Nucleic Acid Accession #: NM_003474.2 

Coding sequence: 3 7.. 3 03 6. 

1 11 21 31 41 51 

I II. ! I I 

CACTAACGCT CTTCCTAGTC CCCGGGCCAA CTCGGACAGT. TTGCTCATTT ATTGCAACGG 60 
TCAAGGCTGG CTTGTGCCAG. AACGGCGCGC GCGCGACGCA CGCACACACA CGGGGGGAAA 120 
CTTTTTTAAA AATGAAAGGC TAGAAGAGCT. CAGCGGCGGC GCGGGCCGTG CGCGAGGGCT: 180 
CCGGAGCTGA CTCGCCGAGG CAGGAAATCC CTCCGGTCGC GACGCCCGGC CCCGCTCGGC 240 
GCCCGCGTGG GATGGTGCAG: CGCTOGCCGC CGGGCCCGAG AGCTGCTGCA CTGAAGGCCG 300 
GCGACGATGG CAGCGCGCCC GCTGCCCGTG TCCCCCGCCC GCGCCCTCCT GCTCGCCCTG 360 
GCCGGTGCTC TGCTCGCGCC CTGCGAGGCC CGAGGGGTGA GCTTATGGAA CGAAGGAAGA 420 
GCTGATGAAG: TTGTCAGTGC CTCTGTTCGG AGTGGGGACC TCTGGATCCC AGTGAAGAGC 4 B0 
TTCGACTCCA AGAATCATCC AGAAGTGCTG AATATTCGAC TACAACGGGA AAGCAAAGAA 540 
CTGATCATAA ATCTGGAAAG AAATGAAGGT CTCATTGCCA GCAGTTTCAC GGAAACCCAC 600 

• TATCTGCAAG ACGGTA CTGA' TGTCTCCCTC GCTCGAAATT ACACGGTAAT TCTGGGTCAC- 660 
TGTTACTACC ATGGACATGT:' ACGGGGATAT TCTGATTCAG CAGTCAGTCT CAGCACGTGT; 720 
TCTGGTCTCA GGGGACTTAT TGTGTTTGAA • AATGAAAGCT ATGTCTTAGA- ACCAATGAAA- 780 
AGTGCAACCA ACAGATACAA" ACTCTTCCCA GCGAAGAAGC TGAAAAGCGT CCGGGGATCA 840 
TGTGGATCAC ATCACAACAC ACCAAACCTC GCTGCAAAGA ATGTGTTTCC ACCACCCTCT 900 
CAGACATGGG CAAGAAGGCA-- TAAAAGAGAG > ACCCTCAAGG CAACTAAGTA TGTGGAGCTGi 960 
GTGATCGTGG CAGACAACCG. AGAGTTTCAG AGGCAAGGAA AAGATCTGGA AAAAGTTAAG: 1020 
CAGCGATTAA- TAGAGATTGC TAATCACGTT GACAAGTTTT ACAGACCACT GAACATTCGG 1080 
ATCGTGTTGG TAGGCGTGGA AGTGTGGAAT GACATGGACA AATGCTCTGT AAGTCAGGAC 1140 
CCATTCACCA GCCTCCATGA" ATTTCTGGAC TGGAGGAAGA TGAAGCTTCT ACCTCGCAAA 1200 
TCCCATGACA ATGCGCAGCT TGTCAGTGGG. GTTTATTTCC: AAGGGACCAC CATCGGCATG 1260 
GCCCCAATCA TGAGCATGTG. CACGGCAGAC CAGTCTGGGG GAATTGTCAT GGACCATTCA 1320 
GACAATCCCC TTGGTGCAGC CGTGACCCTG GCACATGAGC TGGGCCACAA TTTCGGGATG* 1380. 
AATCATGACA CACTGGACAG GGGCTGTAGC TGTCAAATGG. CGGTTGAGAA AGGAGGCTGC 1440 
ATCATGAACG CTTCCACCGG GTACCCATTT CCCATGGTGT TCAGCAGTTG CAGCAGGAAG. 1500 
GACTTGGAGA CCAGCCTGGA GAAAGGAATG GGGGTGTGCC TGTTTAACCT GCCGGAAGTC 1560 
AGGGAGTCTT TCGGGGGCCA GAAGTGTGGG AACAGATTTG TGGAAGAAGG AGAGGAGTGT 1620 
GACTGTGGGG AGCCAGAGGA' ATGTATGAAT: OGCTGCTGCA A7GCCACCAC' CTGTACCCTG 1680 
AAGCCGGACG CTGTGTGCGC ACATGGGCTG TGCTGTGAAG ACTGCCAGCT- GAAGCCTGCA 1740 
GGAACAGCGT GCAGGGACTC CAGCAACTCC TGTGACCTCC CAGAGTTCTG' CACAGGGGCC ; 1800 
AGCCCTCACT GCCCAGCCAA CGTGTACCTG CACGATGGGC ACTCATGTCA GGATGTGGAC 1860 

^ GGCTACTGCT ACAATGGCAT CTGCCAGACT CACGAGCAGC AGTGTGTCAC ACTCTGGGGA 1920 
CCAGGTGCTA AACCTGCCCC TGGGATCTGC TTTGAGAGAG TCAATTCTGC AGGTGATCCT 1980 
TATGGCAACT GTGGCAAAGT CTCGAAGAGT TCCTTTGCCA AATGCGAGAT GAGAGATGCT> 2040 
AAATGTGGAA AAATCCAGTG TCAAGGAGGT GCCAGCCGGC CAGTCATTGG TACCAATGCC 2100 
GTTTCCATAG AAACAAACAT CCCCCTGCAG CAAGGAGGCC GGATTCTGTG CCGGGGGACC 2160 
CACGTGTACT TGGGCGATGA CATGCCGGAC CCAGGGCTTG TGCTTGCAGG' CACAAAGTGT- 2220 
GCAGATGGAA AAATCTGCCT GAATCGTCAA TGTCAAAATA TTAGTGTCTT TGGGGTTCAC 2280 
GAGTGTGCAA TGCAGTGCCA CGGCAGAGGG GTGTGCAACA ACAGGAAGAA CTGCCACTGC 2340' 
GAGGCCCACT GGGCACCTCC CTTCTGTGAC AAGTTTGGCT- TTGGAGGAAG CACAGACAGC 2400 
GGCCCCATCC GGCAAGCAGA TAACCAAGGT TTAACCATAG GAATTCTGGT GACCATCCTG 2460 
TGTCTTCTTG CTGCCGGATT TGTGGTTTAT CTCAAAAGGA AGACCTTGAT ACGACTGCTG 2520 
TTTACAAATA AGAAGACCAC CATTGAAAAA CTAAGGTGTG TGCGCCCTTC CCGGCCACCC 2580 
CGTGGCTTCC AACCCTGTCA. GGCTCACCTC GGCCACCTTG GAAAAGGCCT GATGAGGAAG 2640 
CCGCCAGATT CCTACCCACC GAAGGACAAT CCCAGGAGAT TGCTGCAGTG TCAGAATGTT 2700 
GACATCAGCA GACCCCTCAA CGGCCTGAAT GTCCCTCAGC CCCAGTCAAC TCAGCGAGTG 2760 
CTTCCTCCCC TCCACCGGGC CCCACGTGCA CCTAGCGTCC CTGCCAGACC CCTGCCAGCC 2820 
AAGCCTGCAC TTAGGCAGGC CCAGGGGACC TGTAAGCCAA ACCCCCCTCA GAAGCCTCTG 2880 
CCTGCAGATC CTCTGGCCAG AACAACTCGG CTCACTCATG CCTTGGCCAG GACCCCAGGA 2940 
CAATGGGAGA CTGGGCTCCG CCTGGCACCC CTCAGACCTG CTCCACAATA TCCACACCAA 3000 
GTGCCCAGAT CCACCCACAC CGCCTATATT AAGTGAGAAG CCGACACCTT TTTTCAACAG 3060 
TGAAGACAGA AGTTTGCACT ATCTTTCAGC TCCAGTTGGA GTTTTTTGTA CCAACTTTTA 3120 
GGATTTTTTT- TAATGTTTAA AACATCATTA CTATAAGAAC. TTTGAGCTAC TGCCGTCAGT 3180 
GCTGTGCTGT GCTATGGTGC. TCTGTCTACT- TGCACAGGTA CTTGTAAATT ATTAATTTAT 3240 
GCAGAATGTT GATTACAGTG CAGTGCGCTG TAGTAGGCAT TTTTACCATC ACTGAGTTTT 3300 
CCATGGCAGG AAGGCTTGTT GTGCTTTTAG TATTTTAGTG. AACTTGAAAT ATCCTGCTTG 3360 
ATGGGATTCT GGACAGGATG TGTTTGCTTT CTGATCAAGG CCTTATTGGA AAGCAGTCCC 3420 
CCAACTACCC CCAGCTGTGC 7TATGGTACC AGATGCAGCT. CAAGAGATCC CAAGTAGAAT 3480 
CTCAGTTGAT TTTCTGGATT CCCCATCTCA GGCCAGAGCC AAGGGGCTTC AGGTCCAGGC 3540 
TGTGTTTGGC TTTCAGGGAG GCCCTGTGCC CCTTGACAAC TGGCAGGCAG GCTCCCAGGG 3600 
ACACCTGGGA GAAATCTGGC TTCTGGCCAG GAAGCTTTGG TGAGAACCTG. GGTTGCAGAC 3660 
AGGAATCTTA AGGTGTAGCC ACACCAGGAT AGAGACTGGA ACACTAGACA AGCCAGAACT 3720 
TGACCCTGAG CTGACCAGCC GTGAGCATGT TTGGAAGGGG TCTGTAGTGT CACTCAAGGC 3780 
GGTGCTTGAT AGAAATGCCA AGCACTTCTT TTTCTCGCTG TCCTTTCTAG AGCACTGCCA. 3840 
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CCAGTAGGTT 
CTGCAAACCG 
CAATGATCCT 
TGAACCATTA 
AATTAGGCAG 
TATAGTTCAT 
CATCCTCTTT 
ACCTATTTCT 
CAACTTGCTT 
CTCTTCACTC 
AATGGCATGA 
CTGGACTGGT 
TTTATGAGAA 
TGTACCAAGA 
TCCCCACTGT. 
AAACACACAC 
TTATTCTATA- 
AGATACATAC' 
TATATACTAT- 
AGATGCCCAA 
ACCAAAAAAA 



ATTTAGCTTG 
CCACCTCCCT 
GTATTCAGAC 
ACCAGATCTA 
ACTCTTTATG. 
GTCTGCTATC 
TTCCAACTTG- 
TAAACACTTG 
ATCAACTTCC 
TTCAAATGCC 
GAAATACAAA' 
TTTCACATTA' 
AGCCTTCTTT 1 
ATCTTGGTTT 
ATCTAGGCAA 
AAAAGGGAAC 
GTTATTAAGT 
AGAATTACTG 
TAAAAAGGTT 
ATCCTTAGAT 
AAAAAAAAAA 



GGAAAGGTGG 
ATACTGCTTG 
AGATGAGGAC 
GTCAATCAAG 
CTTGCAAAAA 
ATTATTCGTA 
GCTGCAGGAA 
CAACCTACCT 
TAAATATTAT 
TGACTAGGGA 
AATACTCAGA 
GAAGACAATT 
TGGGGTCAAC 
GCCTTCCAGA 
CATAGTATTC 
CCAGCTCTAA 
TCTTTAAAAT 
TAACTGATTA 
TACAGAATTT 
CTGGCATGTT 
AA 



TGTTTCTGTA 
GAGCTGAGCA 
TTTCCATGGG. 
TCTGTTTACT 
CTACAACCAA 
GATATTGGAC 
TCTTTAAAAG 
GTTGAGCATC 
GAGATGTGGC 
GCCATGTTTC 
TAAGGTAAAA 
GACAACAGTT 
AGTTTTCCTA 
AAACAAAACT 
ATGACTATGG 
TACATTCCAA" 
GTAAAGCCAT 
CACTTGGTAA 
TATGGTGCAT 
AGCCCTTCCT 



AGAAACCTAC 
AATCACCACA 
ACCACAACTA 
GCAAGGTTCA 
TGGAATGTGA 
AAAGAACCTT- 
ATGCTTTTAA 
ACAGAATGTG 
TTGGGCAGCA 
ACAAGGTCTT 
TGCCATGATG 
ACATAATTCA* 
TGCTTTGAAA 
GCATTTCACT 
ATAAACTAAA 
CTCGTATAGC 
GCTGGAAAAT 
TTGTACTAAA 
TACGTGGGCA 
CCAATTATAA 



Seq ID NO: 427 Protein sequence 
Protein Accession #: NP_003465 



1 11. 

I . I 
MAARPLPVSP ARALLLALAG 

SKNHPEVLNI RLQRESKELI 

YHGHVRGYSD SAVSLSTCSG 

SHHKTPNLAA KNVFPPPSQT 

LIEIANHVDK- FYRPLNIRIV 

DNAQLVSGVY' FQGTTIGMAP' 

DTLDRGCSCQ • MAVEKGGCIM 

SFGGQKCGNR FVEEGEECDC 

' ACHDSSNSCDC LPEPCTGASP 

AKPAPGICFE: RVNSAGDPYG 

IBTNIPLQQG:- GRILCRGTHV 

AMQCHGRGVC. NNRKNCHCEA" 

LAAGPWYLK RKTLIRLLFT. 

DSYPPKDNPR RLLQCQNVDI- 

ALRQAQGTCK PNPPQKPLPA 

RSTHTAYIK 



21 
I 

ALLAPCEARG 
INIiERNEGLI 
LRGLIVFENE 
WARRHKRETL 
LVGVEVWNDM 
IMSMCTADQS- 
NASTGYPFPM 
GEPEECMNRC 
KCPANVYLHD 
NCGKVSKSSF 
YLGDDMPDPG 
HWAPPFCDKF 
NKKTTIEKLR 
SRPLNGLNVP 
DPLARTTRLT 



31 

.1 

VSLWNEGRAD 
ASSFTETHYI* 
SYVLEPMKSA 
KATKYVELVI- 
DKCSVSQDPF 
GGIVMDHSDK' 
VFSSCSRKDIi 
CNATTCTLKP 
GHSCQOVDGY. 
AKCEMRDAKC 
LVLAGTKCAD 
GFGGSTDSGP 
CVRPSRPPRG 
QPQSTQRVLP 
HALARTPGQW 



41 
I 

EVVSASVRSG 
QDGTDVSLAR 
TNRYKLFPAK 
VADKREFQRQ 
TSLHEETiDWR 
PLGAAVTLAKt 
ETSLEKGMGV 
DAVCAHGLCC 
CYNGI CQTHE 
GKIQCQGGAS 
GKICLNRQCQ 
IRQADNQGLT 
FQPCQAHLGH 
PLH RAP RAPS 
ETGLRLAPLR 



TGCCCAGGCA. 

AACTGTAATA 

TTTTCAGATG 

ACTTATTAAC' 

TGTTCATGGG 

CTCTATGGGG 

CAGAGTCTGA 

ATAAGGAAAT 

TCCCCTTGAA 

TAAAGTGACT ' 

CCTCTGTCTT- 

CTCTGAGTGT 

CAGAAAAATA 

TTCCCGGTGT 

CACGTGACAC 

ATGCATCTGT 

AATACTGCTG . 

GCCAAACATA 

TTGTCTTTTT 

GAGGATATGA 



SI 
I 

DLWIPVKSFD 
NYTVILGHCY 
KLKSVRGSCG 
GKDIiEKVKQR ' 
KMKLLPRKSH 
ELGHNFGMNH 
CLFNLPEVRE 
EDCQLKPAGT 
QQCVTIiWGPG 
RPVIGTNAVS 
NISVFGVHEC 
IGILVTILCL 
LGKGLMRKPP 
VPARPLPAKP 
PAPQYPHQVP 



3900 

3950 

4020 

4080 " 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4S60 

4620 

4680. 

4740 

4800 

4860 

4920 

4980 

5040 



Seq ID NO: 428 DNA sequence 

Nucleic Acid Accession #: NM_003714 

Coding sequence: 135.. 1043 



I 

GAGGAGGAGG 
GAGGAAGAGG. 
TAATACCAAG 
TGGCCACCTT 
ACAGGAGCTC 
GTTTGGTCAA 
GTGAGATTCG 
ATGCCCAGGG 
ACAGGTTCGG 
AGCGGGAATG 
TAGTGGAGAT 
ACTTGCTGGT 
AGTGTGAGCA 
AGAAGCCTCC , 
GGGCCCACCA 
GAGGTGCCAA' 
TCGGGGGCCT 
AGTATTCTGA 
GTCCATTTTC 
ACGCAGGATT 
TGAGATGGAG 
ACGTACTCAA 
AGTCAGTGGG 
GCAGGGCCCC. 
GCAGCAGCCT 
GAATGTAAAA 
AGGGGGTGGT. 
CTCTTGGCGA? 
TGTCTGGGCT 
ACTGCTTCAA- 
.TCTAAATAAA 
TAAAAAAAAA 
TTAAAAGCTA 
CACTTGGGGG 
TTTCCCTTAG 
CTGGCGATTC 
GAAAGAAGAG 



11 

I. 

GAAAAGGCGA 
AGGAGGAGGA 
AACCATGTGT 
TGACCCGGCG 
CCAGCAGAAA 
CGCTGGCGAT 
GGGCTTACAT 
CAAGTCATTC 
CTGCATAAGC 
CTACCTCAAG- 
GATCCATTTC 
GACCTGTGGG 
GAACTGGGGA 
CACGGCGCCC 
CGGGGAAGCA 
GGGTGAGCGA 
TGGGGCTCAG 
TATCCGGAGG 
TTATCTATGG 
CTGTGGGGAC 
ACCCCTGGGG 
GGGAGCGCGC 
TGTCGGCCGC 
CAGAGCTGGG 
CTGGTGCTGT 
ATAAATATCG 
TGGTGCCAAA 
GGGTGGAGGG 
GGGGGGGACA 
ATCTCGATTT 
TGGCTTTCAA 
AAAACCAGCC 
TCAAACAGCG 
AAACCTTAT A 
GATTTCGTTA 
CAGGAGACCC 
AATGAAGACT 



21 
I 

GCAAAAAGGA 
AGAGGGGAGC 
GCCGAGCGGC 
CGGGGGACCG 
GGCCGCCTGT 
GTGGGGTGTG 
GGGATTTGCA 
ATCAAAGACG. 
CGGAAGTGCC 
CACGACCTGT 
AAGGACTTGC 
GAGGAGGTGA 
AGCCTGTGCT 
CCCGAGCGCC 
GGACATCACC 
GGTAGCAAGA 
GGACCTTCCG 
TGAAATGAAA 
ACATTCCAAA 
TGTGGACTTC 
CCGTGGGGTC 
CCGCGTTATC 
TCTGTTGTGG 
CCACACAGTG 
CTCCGCGGAA 
CTTAGAATGC 
CTGAAATTCA 
AGGAGTGTCA 
CTGTCCAAGG 
CACTTTTTTT 
ACAAAGCAAC 
CATCCTTTGA 
ACATAGCCAT 
CCCAGAGGAA 
TCTCACCTTG 
AGCTGGAAAC 
ACTTAGTAAT 



31 
I 

AGAGTGGGAG 
ACAAAGGATC 
TGGGCCAGTT; 
ACGCCACCAA 
CCCTGCAGAA 
GCGTGTTTGA 
TGACTTTTCT 
CCTTGAAATG 
CGGCCATCAG 
GCGCGGCTGC 
TGCTGCACGA 
AGGAGGCCAT 
CCATCTTGAG 
AGCCCCAGGT 
TCCCAGAGCC 
GCCACCCAAA 
GAAGCAGCGA 
GGCCTGGCCA 
ACATTTACCA 
ATCGAGGTGT 
TCAGGGGTGC 
CTCGTACCTT 
GGGAGGTGAA 
GGTGCTGGGC 
GTCAGGGCGG 
AGGAGAAGGG 
GTTTCTTGTG 
TTTCTATGTG 
GAGTGGCCCC 
ATTTATCCAG 
TGGGTCATTA 
GGCTGATTTT 
ACATCTGACT 
AATACACACC 
ACCCTCAGCC 
CTGGCTTCTC 
TCCCATCAGG 



41 

I 

GAGGAGGGGA 
CAGGTCTCCC 
CATGACCCTG 
CCCACCCGAG 
TACAGCGGAG 
ATGTTTCGAG. 
GCACAACGCT 
TAAGGCCCAC 
GGAAATGGTG 
CCAGGAGAAC 
ACCCTACGTG 
CACCCACAGC 
CTTCTGCACC 
GGACAGAACC 
CAGCAGTAGG 
CGCCCATGCC 
GTGGGAAGAC 
CGAAATCTTT. 
TTAGAGAGGG 
GTGTTCGCGG 
CTGGTGAATT 
TGTCTTCTTT 
CCAGGGAGGG 
CTCGCCCCGA 
CTGGATTCCA 
TGGAGAGGAG. 
TGGGGCCTTG 
TAATTTCTGA 
TATGAGTTTA 
TTATATCTAC 
AAACCAGCTC 
TCTTTTTTTT 
GCCTGACATG 
TGGGGAGTAC 
AAGATTGGTA 
CATGTGAGGG 
AAATGCTGAC 



51 
i 

AGCGGCGAAG 

GACGGGAGGT 

GCTTTGGTGT 

GGTCCCCAAG 

ATCCAGCACT 

AACAACTCTT 

GGAAAATTTG 

GCTCTGCGGC 

TCCCAGTTGC 

ACCCGGGTGA" 

GACCTCGTGA 

GTGCAGGTTC ' 

TCGGCCATCC 

AAGCTCTCCA 

GAGACTGGCC 

CGAGGCAGAG 

GAACAGTCTG 

CCTCCACGCC 

GGGATGTCAC 

AACGGACAGG 

CTGCACTTAC 

CCATCTGTGG 

GCAGGGCAAG 

AGCTTCTGGT 

GGACAGGAGT 

GCAGGGGCCG 

CGGTTCAGAG 

GCCATTGTAC 

TATTTTAACC 

ATATCTGTCA 

AAAGGGGGTT 

AAGTTCTATT 

GACTCCTGCC 

ATTTGACAAA 

AAGCTGCGTC 

GATGGGAAAG 

CTTTTACATA 
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600 
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1080 
1140 
1200 
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1440. 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 - 
2040 
2100 
2160 
2220 
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WO 02/086443 . 

AAATCAAGGA GACTGCTGAA AATCTCTAAG GGACAGGATT TTCCAGATCC TAATTGGAAA. 
TTTAGCAATA • AGGAGAGGAG TOCAAGGGGA CAAATAAAGG CAGAGAGAGA GAGAGAGAGA 
GGGAGAGGAA GAAAAGAGAG AGAGAAAAGA GCCTCGTGCC 

Seq ID NO i .429 Protein sequence 

Protein Accession #: NP_003705 - 



PCT/US02/12476 



2280 
2340 



11 



41 



51 



21 31 

I «|l I I 

MCAERLGQFM TLALVLATFD PARGTDATNP PEGPQDRSSQ QKGRLSLQNT AEIQHCLVNA 
GDVGCGVFEC FENNSCEIRG LHGICMTFLH NAGKFDAQGK SFIKDAJaKCK AHALRHRFGC 
ISRKCPAIRB. MVSQLQRECY- LKHDLCAAAQ ENTRVIVEMI HFKDLLLHEP YVDLVNLLLT 
CGEEVKEAIT- HSVQVQCEQN WGSLCSILSF CTSAIQKPPT APPERQPQVD RTKLSRAHHG 
EAGHHLPEPS SRETGRGAKG ERGSKSHPNA HARGRVGGU3 AQGPSGSSEW EDEQSEYSDI 
RR 

Seq ID NO: 430 DNA sequence 

Nucleic Acid Accession NM_005940 

Coding sequence: 23.. 1489" 



AAGCCCAGCA 
CGCCCTCCTG: 
TCTGCCGCCG 
AGCCCTGCeC 
CAGCCTCAGG 
CCGACAGAAG 
GATCCTTCGG 
CCTAAAGGTA 
TGACATCATG 
TGGGGGCATC 
CGACTATGAT 
AGCCCATGAA 
GTCCGCCTTC 
TCAACACCTA 
CCAGGCTGGG : 
CTGTGAGGCC 
GGGCTTTGTG, 
TCGCCACTGG 
CATTTGGTTC 
CCCCGCACCC- 
GGGTCCCGAG' 
CAGCACCCGG. 
CTCTGAGATC 
CCTCTACTGG 
GGGTCCTGAC 
ATGCCCTCAG 
ATCTTTGTGG: 
GGTGGGGTAC 
AGCGACTGTC 
GGGACCCGCT. 
GTAGCACCAT. 
TCCTTCCAGG 
TGAGCAACTG 
ATCTGTCTGC 
GTTCACAGTC 
CAACATACCT 
ATCCTCCAAA 
TTTTTAAACT 



11 
I 

GCCCCGGGGC 
CCCCCGATGC 
GACGTCCACC 
AGTAGCCCGG 
CCTCCCCGCT 
AGGTTCGTGC 
TTCCCATGGC 
TGG AGCGATG : 
ATCGACTTCG'. 
CTGGCCCATG.- 
GAGACCTGGA" 
TTTGGCCACG 
TACACCTTTC 
TATGGCCAGC 
ATAGACACCA 
TCCTTTGACG 

tggcgcctcc 
cagggactgc; 
ttccaaggtg: 
ctcaccgagc 
aagaacaaga- 
cgtgtagaca 
gacgctgcct, 
aagtttgacc: 

TTCTTTGGCT'. 
GGGTGCTGAC 
CTGTGGGCAC 
AACCACCATG: 
TCAGACTGGG 
ATGCAGGTCC 

ggcaggactg: 
ggctggcact 
ggctgtaggg 

CTTCTGGCTG 
AAATGGGGAG 
CAATCCTGTC 
GCCATTGTAA 
GAGGATTGTC 



21 



I 

GGATGGCTCC. 
TGCTGCTGCT 
ACCTCCATGC 
CACCTGCCCC 
GTGGCGTGCG 
TTTCTGGCGG 
AGTTGGTGCA 
TGACGCCACT 
CCAGGTACTG 
CCTTCTTCCC 
CTATCGGGGA: 

tgctggggct; 
gctacccact 
cctggcccac 
atgagattgc 

CGGTCTCCAC 
GTGGGGGCCA 
CCAGCCCTGT 
CTCAGTACTG 
TGGGCCTGGT 
TCTACTTCTT 
GTCCCGTGCC 
TCCAGGATGC 
CTGTGAAGGT 
GTGCCGAGCC 
CCCTGCCAGG 
CAGGCATGGG- 
ACAACTGCCG 
CAGGGAGGCT 
TGGCAAACCT 
GGGGAACTGG 
GAAGCAAGGG 
CAGGGCCACT. 
ACAATCCTGG 
GGGTATTCTT 
CCAGGCCGGA 
ATGTGTGTAC 
ATTAAACACA 



Seq ID NO: 431 Protein sequence 
Protein Accession 8: NP_005931 



I 

MAPAAWLRSA 
PAPATQEAPR 
LVQEQVRQTM 
FFPKTHREGD 
YPLSLSPDDC 
VSTIRGELFP 
QYWVYDGEKP 
PVPRRATDWR 
AEPANTFL 



11. • 
I 

AARALLPPML 
PASSIiRPPRC 
AEALKVWSDV 
VHFDYDETWT-- 
RGVQHLYGQP 
PKAGFVWRLR 
VLGPAPLTEb 
GVPSEIDAAF 



21 
I 

LLLLQPPPItI< 
GVPDPSDGLS 
TPLTPTEVHE 
IGDDQGTDLL 
WPTVTSRTPA 
GGQLQPGYPA 
GLVRFPVHAA 
QDADGYAYFL 



31 
I 

GGCCGCCTGG 
GCTCCAGCCG 
CGAGAGGAGG 
TGCCACGCAG 
CGACCCATCT 
GCGCTGGGAG 
GGAGCAGGTG 
CACCTTTACT 
GCATGGGGAC 
CAAGACTCAC 
TGACCAGGGC 
GCAGCACACA. 
GAGTCTCAGC 
TGTCACCTCC 
ACCGCTGGAG: 
CATCCGAGGC 
GCTGCAGCCC 
GGACGCTGCC 
GGTGTACGAC 
GAGGTTCCCG s 
CCGAGGCAGG 
CCGCAGGGCC 
TGATGGCTAT- 
GAAGGCTCTG 
TGCCAACACT 
CCACGAATAT 
ACTGAGCCCA 
GGAGGGCCAC 
TTGGCATGAC 
GGCTGCCCTG, 
AGTGTCCTTG 
TGCTGGGGCC 
TCCTGAGGTC 
AAATCTGTTC 
■ CATGCAGGAG 
TCCTCCTGAA 
AGTGTGTATA 
GTTGTTTTCT 



31 
I 

ARALPPDVHH 
ARNRQKRFVL 
GRADIMIDFA 
QVAAHEFGHV 
LGPQAGIDTN 
LASRHKOGLP 
LVWGPEKNKI 
RGRLYWKFDP 



41 

I 

CTCCGCAGCG 
CCGCCGCTGC 
GGGCCACAGC 
GAAGCCCCCC 
GATGGGCTGA' 
AAGACGGACC 
CGGCAGACGA ■ 
GAGGTGCACG 
GACCTGCCGT 
CGAGAAGGGG 
ACAGACCTGC 
ACAGCAGCCA" 
CCAGATGACT 
AGGACCCCAG 
CCAGACGCCC: 
GAGCTCTTTT- 
GGCTACCCAG 
TTCGAGGATG. 

ggtgaaaagc 
gtccatgctg; 
gactactggc 
actgactgga : 

GCCTACTTCC- 
GAAGGGTTCC 
TTCCTCTGAC 
CAGGCTAGAG- 
TGTCTCCTGC 
GCAGGTCGTG : 
TTAAGAGGAA 1 
TCTCATCCCT 
CTGTATCCCT 
CCATGGCCTT 
AGGTCTTGGT 
TCCAGAATCC. 
ACCCCAGGCC 
GCCCTTTTCG 
AACCTTCTTC 



41 
I 

LHAERRGPQP 
SGGRWEKTDL 
RYWHGDDLPF 
LGLQHTTAAK 
EIAPLEPDAP 
SPVDAAFEDA 
YFFRGRDYWR 
VKVKALEGFP 



51 
I 

CGGCCGCGCG 

TGGCCCGGGC 

CCTGGCATGC 

GGCCTGCCAG . 

GTGCCCGCAA 

TCACCTACAG 

TGGCAGAGGC 

AGGGCCGTGC 

TTGATGGGCC 

ATGTCCACTT 

TGCAGGTGGC 

AGGCCCTGAT 

GCAGGGGCGT 

CCCTGGGCCC 

CGCCAGATGC 

TCTTCAAAGC 

CATTGGCCTC 

CCCAGGGCCA 

CAGTCCTGGG- 

CCTTGGTCTG 

GTTTCCACCC 

GAGGGGTGCC 

TGCGCGGCCG. 

CCCGTCTCGT,. 

CATGGCTTGG. 

ACCCATGGCC,- 

AGGGGGATGG 

GTCACCTGCC 

GGGCAGTCTT 

GTCCCTCAGG 

GTTGTGAGGT 

CAGCCCTGGC 

AGGTGCCTGC 

AGGCCAAAAA- 

CTGGAGGCTG 

CAGCACTGCT 

TTCTTTTTTT 



51 
1 

WHAAIiPSSPA 
TYRILRFPWQ 
DGPGGILAHA 
ALMSAFYTFR 
PDACEASFDA 
QGHIWFFQGA 
FHPSTRRVDS 
RLVGPDFFGC • 



Seq ID NO: 432 DNA sequence 
Nucleic Acid Accession #: NMJ>24022 
Coding sequence: 202.. 1563 



11 



31 



41 



51 



21 ' V i" I 

ACCGGGCACC GGAOGGCTGG GGTACTTTCG TTCTTAATTA GGTCATGCCC GTGTGAGCCA 
GGAAAGGGCT GTGTTTATGG GAAGCCAGTA ACACTGTGGC CTACTATCTC TTCCGTGGTG 
CCATCTACAT -TTTTGGGACT CGGGAATTAT GAGGTAGAGG TGGAGGCGGA GCCGGATGTC 
AGAGGTCCTG AAATAGTCAC CATGGGGGAA AATGATCCGC CTGCTGTTGA AGCCCCCTTC 
TCATTCCGAT CGCTTTTTGG CCTTGATGAT TTGAAAATAA GTCCTGTTGC ACCAGATGCA 



60 
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60 
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840 
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1080 
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60 
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180 
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300 
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WO 02/086443 

GATGCTGTTG, CTGCACAGAT 
ATTGGGATCA TTGCATTGAT 
TCAGGGAAGT ACAGATGTCG 
GGAGTCTCGG ATTGCAAAGA 
AATGCCGTGC TCCAGGTGTT 
AAGGGTCACT ACGCAAATGT 
GATAACCTCA GAGTGAGCTC 
CACCTCTTGC CAGATGACAA : 
TGTGCCTCTG GCCACGTGGT^ 
AGCTCACGCA TCGTGGGTGG 
CTTCAGTTCC AGGGCTACCA 
ACTGCTGCAC ACTGTGTTTA 5 
CTAGTTTCCC TGTTGGACAA 
AGCAAGTACA AGCCAAAGAG 
CTCACGTTCA ATGAAATGAT 
GATGGAAAAG TGTGCTGGAC 
CCTGTCCTGA ACCACGCGGC. 
GTGTACGGTG GCATCATCTC 
GACAGCTGCC AGGGGGACAG 
TTAGTGGGAG. CGACCAGCTT 
ACCCGTGTCA CCTCCTTCCT 
TGAAGAGGAA GGGGACAAGT 
TCCCCTGGAC TCCCGTGTAG 
CGGCACCAGT AGCAGGCCCG 
GCTGCTTTTX GTTTTTTGTT 
GCAGTGGCGA AATCCCTGCT 
CCTCAGCTTC CCCAGTAGCT. 
TATTTTTAGT AGAGACAGGG 
CAAATGATGT GCCTGCTTCA' 
CCTAGCCTCA- CGCTCCTTTC 
GCGGCCTTTC CCACTGGTCC 
AGATAAGCAG TTATGTGACC 
CCAGCCCAGA AGTGCAGAAC 
AACCCACCCT TTCTACTTCC 
ACTCGTTTAA GGCCTATTTT 
TCCTTTGATT CCAAATAATA 



CCTGTCACTG 
ATTAGCACTG 
CTCATCCTTT 
CGGGGAGGAC 
CACAGCTGCT 
TGCCTGTGCC 
GCTGGAGGGG 
GGTGACTGCA 
TACCTTGCAG 
AAACATGTCC 
CCTGTGCGGG 
TGACTTGTAC 
TCCAGCCCCA 
GCTGGGCAAT 
CCAGCCTGTG 
GTCAGGATGG 
CGTCCCTTTG 
CCCCTCCATG 
CGGGGGGCCC 
TGGCATOGGC. 
GGACTGGATC 
AGCCACCTGA 
GAACCTGCAC 
AAAGAGGCAC 
TTTTTGAGGT. 
CACTGCAGCC. 
GGGACCACAG 
TTTCACCATG 
GCCTCCCACA 
TGATCTTCAC 
ATCTGGTTTT. 
TCACGTGCAA 
TGCAGTCACT 
AAGACTTATT 
CATGATTTCT 
TGTTTCCTTC 



Seq ID NO: 433 Protein sequence 
Protein Accession # : NP_076927 



MGENDPPAVE 
LAIAIGLGIH 
TAASWKTMCS 
VTALHHSVYV 
LCGGSVITPD 
LGNDIALMKL 
VPLISNKICN 
GIGCAEVNKP 



11 
1 

APFSFRSLFG 
PDCgGKYRCR 
DDWKGHYANV 
REGCASGHW 
WIITAAHCVY 
AGPLTFNEMI 
HRDVYGGIIS 
GVYTRVTSFL 



21 
1 

LDDLKI SPVA 
SSPKCIELIA 
ACAQLGFPSY 
TLQCTACGHR- 
DLYLPKSWTI 
QPVCLPNSBB 
PSMLCAGYLT 
DWIHEQMERD 



CTGCCATTGA 
GCCATTGGTC 
AAGTGTATCG 
GAGTACCGCT 
TCGTGGAAGA 
CAACTGGGTT 
CAGTTCCGGG 
TTACACCACT. 
TGCACAGCCT 
TTGCTCTCGC 
GGCTCTGTCA 
CTCCCCAAGT 
TCCCACTTGG 
GACATCGCCC 
TGCCTGCCCA- 
GGGGCCACAG 
ATTTCCAACA 
CTCTGCGCGG 
CTGGTGTGTC 
TGCGCAGAGG 
CACGAGCAGA 
GTTCCTGAGG. 
ACGAGCAGAC 
CCTTCCATCT' 
GGAGTCTOGC 
TCCGCTTCCC 
GTGCCCGCCA 
TTGGCCAGGC 
GTGCTGGGAT 
TAAGAACAAA 
CTCTCCAGGG: 
AGGCACCAAC: 
GCACGTTTTC 
TTCACATGTG" 
TTGTAGCATT 
CCTCAAAAAA 



31 
1 

PDADAVAAQI 
RCDGVSDCKD 
VSSDNLRVSS 
RGYSSRIVGG 
QVGLVSLLDN^ 
NFPDGKVCWT 
GGVDSCQGDS 
LKT 



Seq ID NO: 434 DNA sequence 

Nucleic Acid Accession #i NMJ>00493.2 

Coding sequence j 97.. 2139 



1 
I 

CACCTTCTGC 
CCAGGAACTC 
CTGCTAGTAT 
ACAGGCATAA 
AAGAGTAAAG 
GGACCTCGAG 
GGACTCCAAG 
GGTGTGCCAG 
GTTGGACCAG 
CCGGCTGGAA 
CCCAGGGGCT 
GGGGAAATGG 
GGTCCCACAG 
GGACAGCCAG 
CCAGGTCCCC 
GCTGGAGCCC 
ATAGCTGGGC 
GAAAGAGGAC 
GGTCTTCCTG 
AAAGGCATCC 
CCTGCAGGAT 
GGGTACCCAG 
AAAGGTGATC 
GCAAAGGGAA 
GGTACTAGAG 
CCAGGAAGTC 
ACCGGGCCAC 
CCCGCTGGGC 
GGCCAAAGGC 



11 

-I- 

ACTGCTCATC 
CCAGCACGCA 
CCTTGAACTT 
AAGGCCCACT 
GTATAGCAGT 
GGCACCCAGG 
GAGAGCCAGG 
GACTCCCAGG 
CTGGCCTACC 
TTTCTGTGCC. 
TTCCTGGAGA- 
GATATGGTGC 
GACCATCTGG 
GCATCAAAGG 
AAGGCCCTCC 
CAGGCCAGCC 
CCCCAGGGCC' 
CTGCTGGCCT 
GGAAGCCAGG 
CGGGTAGCCA 
ACCCTGGGGC 
GAAAACCAGG 
CTGGAGTTGG 
TGCCCGGACA 
GCCCTATTGG 
CCGGTCCTCC 
CAGGGCCTCC 
CTCCAGGCCC 
CCAGTCTTTC 



21 
I 

TGGGCAGAGG 
GAATCCATCT 
GGTTCATGGA 
ACCCAACACC 
AAG AG GAGAG 
TCCTTCTGGA 
GTTGCCAGGA 
AAAACCAGGA 
AGGACCCCGG 
AGGAAAACCT. 
AAAGGGTGCA 
TCCTGGTCGT 
CCCTCCTGGA 
TGATAGAGGT 
TGGGGAACGA 
AGGGATTCCA 
TCCTGGCTTT 
TCCTGGGGGT 
TCTGACTGGA 
TGGTCTCCCA 
TAAGGGTGAA 
TCTCGATGGT 
AGGACCTCCT 
CAATGGAGAG 
GCCACCAGGC 
■TGGCCCAGCT 
AGGTCXIAAGA 
ACCAGGTCAA 
TGGGACCCCT 



31 
I 

AAGCTTCAGA 
GAGAATATGC 
GTGTTTTACG 
AAG ACACAGT 
CAAGGTACTC 
CCACCAGGAA 
CCACCGGGAC 
GAGAGAGGAC 
GGCCCACCAG 
GGACAACAGG 
CCAGGAGTCC 
CCAGGTGAGA 
GTGGGAAAAA 
TTTCCGGGAG 
GGGCCAGAAG 
GGAACAAAAG 
GGGAAACCAG 
CCAGGTGCCA 
CCCCCTGGGA 
GGCCCTAAAG 
AGGGGTTCCC 
CCTAAGGGTA 
GGTCTCCCAG 
GCTGGCCCAA 
ATTCCAGGAT 
GGCATAGCAA 
GGCCACTCTG 
GCAGTCATGC 
CTTGTTAGTG 



A GTTTTTTC C 
TGGGCATCCA 
AGCTGATAGC 
GTGTCCGGGT 
CCATGTGCTC 
TCCCAAGCTA 
AGGAGTTTGT 
CAGTATATGT 
GTGGTCATAG 
AGTGGCCCTG 
TCACGCCCCT 
CATGGACCAT' 
TGGAGAAGAT 
TTATGAAGCT 
ACTCTGAAGA 
AGGATGGAGG 
AGATCTGCAA 
GCTACCTGAC 
AAGAGAGGAG 
TGAACAAGCC 
TGGAGAGAGA- 
TGATGAAGAC 
ACCCTTGGAG 
GATTCCAGCA 
TCTGTTGCCC 
TGGTTCAAGC 
CCACACCCAA 
TGCTCTCAAA 
TACAGGCATG 
AGAAGCAGCA 
GTCTTGCAAA 
AGCGACTCAG 
ATCTCTAGGG 
GGGAGGTTAA 
TGGTGCTTGA 
AAAAAAAAAA 



41 
I 

LSLLPLKFFP 
GEDEVRCVRV 
LEGQFREEPV 
NMSLLSQWPW 
PAPSHLVEKI 
SGWGATEDGG 
GGPLVCQERR 



41 
I 

AAGCTGCCAA 
TGCCACAAAT 
CTGAACGATA 
TCTTCATTCC 
CTGGTCCACC 
AACCAGGCTA 
CATCAGCTGT 
CATATGGACC 
GACCACCTGG 
GACCCACAGG 
CTGGTATGAA 
GGGGTCTTCC 
GAGGTGAAAA 
AAATGGGACC 
GCATTGGAAA 
GTCTCCCTGG 
GCTTGCCAGG 
AAGGGGAACA 
ATATGGGACC 
GTGAGACAGG 
CTGGGTCAGA 
ACCCAGGGTT 
GCCCTGTGGG 
GAGGTGCCCC 
TCCCTGGGTC 
CTAAGGGCCT 
GAGAGCCTGG 
CTGAGGGTTT 
CCAACCAGGG 



AATCATCGTC 

CTTCGACTGC 

TCGATGTGAC 

GGGTGGTCAG 

CGATGACTGG 

TGTGAGTTCA 

GTCCATCGAT 

GAGGGAGGGA 

AAGGGGCTAC 

GCAGGCCAGC 

GTGGATCATC 

CCAGGTGGGT 

TGTCTACCAC . 

GGCCGGGCCA 

GAACTTCCCC 

TGACGCCTCC 

CCACAGGGAC 

GGGTGGCGTG 

GCTGTGGAAG 

TGGGGTGTAC . 

CCTAAAAACC 

AGCCCGATCC 

CTCTGAGTTC 

CAACCTTCAA . 

AGGCTGGAGT : 

GATTCTCTTG 

CTAATTTTTG 

CCCCTGACCT. 

GGCCACCACG- 

ACTTGCAAGG 

ATTCCTGACG 

AAAAGACGCA 

ACCAGAACCA 

TCTAGGAATG. 

CGTATTATTG 

AAAAAAAAAA 



51 
I 

IIVIGIIALI 
GGONAVLQVP 
SIDHLLPDDK 
QASLQFQGYH 
VYHSKYKPKR 
DASPVLNHAA 
LWKLVGATSF 



51 
1 

GGCACCATCT 

ACCCTTTTTG 

CCAAATGCCC 

CTACACCATA 

AGGCCCTGCT 

CGGAAGTCCT 

AGGGAAACCA 

AAAAGGAGAT 

AATCCCTGGA 

AGCCCCAGGA 

TGGACAGAAA 

AGGCCCTCAG 

TGGGGTTCCA 

AATTGGCCCA 

GCCAGGAGCT 

GGCTCCAGGA 

CCTGAAGGGA 

AGGGCCAGCA 

CCAAGGACCA 

GCCAGCTGGG 

TGGAAAACCA • 

ACCAGGTCCA 

CCCAGCAGGA 

TGGAATACCA 

TAAAGGGGAT 

CAATGGACCC 

TCTTCCAGGG . 

TATAAAGGCA 

GGTAACAGGA 
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WO 02/086443 PCT/US02/12476 

ATGCCTGTGT CTGCTTTTAC TGTTATTCTC TCCAAAGCTT ACCCAGCAAT AGGAACTCCC 1800 

. ATACCATTTG ATAAAATTTT GTATAACAGG CAACAGCATT ATGACCCAAG GACTGGAATC 1860 . , 

TTTACTTGTC AGATACCAGG AATATACTAT TTTTCATACC ACGTGCATGT GAAAGGGACT 1920 

CATGTTTGGG TAGGCCTGTA TAAGAATGGC ACCCCTGTAA TGTACACCTA TGATGAATAC. 1980 

5 ACCAAAGGCT ACCTGGATCA GGCTTCAGGG AGTGCCATCA TCGATCTCAC AGAAAATGAC : 2040 

CAGGTGTGGC TCCAGCTTCC CAATGCCGAG TCAAATGGCC TATACTCCTC TGAGTATGTC 2100 

CACTCCTCTT TCTCAGGATT CCTAGTGGCT CCAATGTGAG- TACACCCCAC AGAGCTAATC 2160 

TAAATCTTGT GCTAGAAAAA GCATTCTCTA ACTCTACCCC ACCCTACAAA" ATGCATATGG = 2220 

. AGGTAGGCTG AAAAGAATGT AATTTTTATT TTCTGAAATA CAGATTTGAG . CTATCAGACC , 2280 

10 AACAAACCTT CCCCCTGAAA AGTGAGCAGC AACGTAAAAA CGTATGTGAA GCCTCTCTTG 2340 

AATTTCTAGT TAGCAATCTT AAGGCTCTTT AAGGTTTTCT CCAATATTAA AAAATATCAC 2400. 

CAAAGAAGTC CTGCTATGTT AAAAACAAAC AACAAAAAAC AAAGCAACAA AAAAAAAAAT 2460 

TAAAAAAAAA AACAGAAATA GAGCTCTAAG TTATGTGAAA " TTTGATTTGA GAAACTCGGC 2520 

ATTTCCTTTT. TAAAAAAGCC TGTTTCTAAC TATGAATATG AGAACTTCTA GGAAACATCC 2580 

15 AGGAGGTATC ATATAACTTT GTAGAACTTA AATACTTGAA TATTCAAATT- TAAAAGACAC 2640 

TGTATCCCCT AAAATATTTC TGATGGTGCA CTACTCTGAG GCCTGTATGG CCCCTTTCAT 2700 

CAATATCTAT TCAAATATAC AGGTGCATAT AT ACTTGTTA AAGCTCTTAT ATAAAAAAGC- 2760 

CCCAAAATAT TGAAGTTCAT CTGAAATGCA AGGTGCTTTC ATCAATGAAC CTTTTCAAAA 2820 

CTTTTCTATG - ATTGCAGAGA AGCTTTTTAT ATACCCAGCA- TAACTTGGAA ACAGGTATCT 2880 

20 GACCTATTCT TATTTAGTTA ACACAAGTGT "GATTAATTTG ATTTCTTTAA" TTCCTTATTG 2940 

AATCTTATGT GATATGATTT TCTGGATTTA CAGAACATTA GCACA7GTAC CTTGTGCCTC 3000 

CCATTCAAGT GAAGTTATAA : TTTACACTGA GGGTTTCAAA^ ATTCGACTAG AAGTGGAGAT 3060 

ATATTATTTA TTTATGCACT GTACTGTATT TTTATATTGC TGTTTAAAAC TTTTAAGCTG 3120 

TGCCTCACTT ATTAAAGCAC AAAATGTTTT- ACCTACTCCT TATTTACGAC ACAATAAAAT .3180 

25 AACATCAATA GATTTTTAGG CTGAATTAAT TTGAAAGCAG CAATTTGCTG TTCTCAACCA 3240 

TTCTTTCAAG GCTTTTCATT . CGACACAATA AAATAACATC AATAG 



30 



45 



70 



Seq ID HOt 435 Protein sequence 
Protein Accession ft: NP_000484.2 



1 11 21 31 41 51 

I I I I" I I 

MLPQIPFLUL VSLNLVHGVF YAERYQMPTG IKGPLPNTKT QFFIPYTIKS KGIAVRGEQG 60 

TPGPPGPAGP RGHPGPSGPP GKPGYGSPGL QGEPGLPGPP GPSAVGKPGV PGLPGKPGBR 120 

3 5 GPYGPKGDVG PAGLPGPRGP PGPPGI PGPA GISVPGKPGQ QGPTGAPGPR- GFPGEKGAPG 180 

- VPGMNGQKGE MGYGAPGRPG ERGLPGPQGP TGPSGPPGVG KRGEKGVPGQ PGIKGDRGFP 240 

GEMGPIGPPG PQGPPGERGP EGIGKPGAAG APGQPGIPGT KGLPGAPGIA GPPGPPGPGK 300 

PGLPGLKGER GPAGLPGGPG AKGEQGPAGLr PGKPGLTGPP GNMGPQGPKG IPGSHGLPGP 360 

KGETGPAGPA GYPGAKGERG SPGSDGKPGY PGKPGLDGPK GNPGLPGPKG DPGVGGPPGL 420 

40 PGPVGPAGAK GMPGHNGEAG PRGAPGIPGT RGPIGPPGIP' GFPGSKGDPG SPGPPGPAGI 480 

ATKGLNGPTG PPGPPGPRGH SGEPGLPGPP GPPGPPGQAV MPEGFIKAGQ RPSLSGTPLV 540 

SANQGVTGMP VSAFTVILSK AYPAIGTPIP FDKILYNRQQ HYDPRTGIPT CQIPGIYYFS 600 

YHVHVKGTHV. WVGLYKNGTP VMYTYDBYTK GYLDQASGSA IIDLTENDQV WL0LPNAESN 660 
GLYSSEYVHS SFSGFLVAPM 



Seq ID NO: 436 DNA sequence 

Nucleic Acid Accession #: XM_062811 

Coding sequence: 1..8 88 



50 1 11 21 31 41 51 

I I 1 I I I 

ATGTGGGGCG CTCGCCGCTC GTCCGTCTCC TCATCCTGGA ACGCCGCTTC GCTCCTGCAG 
CTGCTGCTGG CTGCGCTGCT GGCGGCGGGG GCGAGGGCCA GCGGCGAGTA CTGCCACGGC 

TGGCTGGACG CGCAGGGCGT- CTGGCGCATC GGCTTCCAGT GTCCCGAGCG CTTCGACGGC 180 

55 GGCGACGCCA- CCATCTGCTG CGGCAGCTGC GCGTTGCGCT ACTGCTGCTC CAGCGCCGAG 240 

GCGCGCCTGG ACCAGGGCGG CTGCGACAAT GACCGCCAGC AGGGCGCTGG CGAGCCTGGC 300 

CGGGCGGACA AAGACGGCCC CGACGGCTCG GCAGTGCCCA TCTACGTGCC GTTCCTCATT. 360 

GTTGGCTCCG TGTTTGTCGC GTTTATCATC TTGGGGTCCC TGGTGGCAGC CTGTTGCTGC 420 
AGATGTCTCC GGCCTAAGCA GGATCCCCAG CAGAGCCGAG CCCCAGGGGG TAACCGCTTG . 480 

60 ATGGAGACCA TCCCCATGAT CCCCAGTGCC AGCACCTCCC GGGGGTCGTC CTCACGCCAG 540 

TCCAGCACAG CTGCCAGTTC CAGCTCCAGC GCCAACTCAG GGGCCCGGGC GCCCCCAACA 600 

AGGTCACAGA CCAACTGTTG CTTGCCGGAA GGGACCATGA ACAACGTGTA TGTCAACATG 660 

CCCACGAATT- TCTCTGTGCT GAACTGTCAG CAGGCCACCC AGATTGTGCC ACATCAAGGG 720 

CAGTATCTGC ATCCCCCATA CGTGGGGTAC ACGGTGCAGC ACGACTCTGT GCCCATGACA 780 

65 GCTGTGCCAC CTTTCATGGA CGGCCTGCAG CCTGGCTACA GGCAGATTCA GTCCCCCTTC 840 
CCTCACACCA ACAGTGAACA GAAGATGTAC CCAGCGGTGA CTGTATAA 



Seq ID NO: 437 Protein sequence 
Protein Accession 8j XP_0628ll 



60 
120 



1 11 21 31 41 .51 

I I.I I I I 

MWGARRSSVS SSWNAASLLQ LLLAALLAAG ARASGEYCHG WLDAQGVWRI GFQCPERFDG 60 

GDATICCGSC ALRYCCSSAE ARLDQGGCDN DRQQGAGEPG RADKDGPDGS AVPIYVPFLI 120 

75 VGSVFVAFII LGSLVAACCC RCLRPXQDPQ QSRAPGGNRL MET I PM I PSA STSRGSSSRQ 180 

SSTAASSSSS ANSGARAPPT RSQTNCCLPE GTMNNVYVNM PTNFSVLNCQ QATQIVPHQG 240 
QYLHPPYVGY TVQHDSVPMT AVPPFMDGLQ PGYRQIQSPP PHTNSEQKMY PAVTV 

■ Seq ID NO: 438 DNA sequence 
80 Nucleic Acid Accession ft: NM_004004.1 
Coding sequence: 1.-681 

1 11 21 31 41 51 

or I I I III 

85 ATGGATTGGG GCACGCTGCA GACGATCCTG GGGGGTGTGA ACAAACACTC CACCAGCATT 60 

GGAAAGATCT GGCTCACCGT CCTCTTCATT TTTCGCATTA TGATCCTCGT TGTGGCTGCA 120 

AAGGAGGTGT GGGGAGATGA GCAGGCCGAC TTTGTCTGCA ACACCCTGCA GCCAGGCTGC 180 



349 



,10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75- 
80 
85 



WO 02/086443 

AAGAAOGTGT GCTACGATCA 
CTGATCTTCG TGTCCAGCCC 
GAGAAGAAGA GGAAGTTCAT 
ATCAAAACCC AGAAGGTCCG 
TTCTTCCGGG TCATCTTCGA- 
TTCTCCATGC AGCGGCTGGT 
TTTGTGTCCC GGCCCACGGA 
ATTTGCATCC TGCTGAATGT 
AAGTCAAAAA AGCCAGTTTA 



PCT7US02/12476 



CTACTTCCCC 
AGCGCTCCTA 
CAAGGGGGAG 
CATCGAAGGC 
AGCCGCCTTC 
GAAGTGCAAC 
GAAGACTGTC 
CACTGAATTG 
A ' 



ATCTCCCACA 
GTGGCCATGC 
ATAAAGAGTG. 
TCCCTGTGGT 
ATGTACGTCT. 

gcctggcctt: 
ttcacagtgt 

TGTTATTTGC 



TCCGGCTATG 
ACGTGGCCTA 
AATTTAAGGA 
GGACCTACAC 
TCTATGTCAT 
GTCCCAACAC 
TCATGATTGC 
TAATTAGATA 



GGCCCTGCAG 
CCGGAGACAT 
CATCGAGGAG 
AAGCAGCATC 
GTACGACGGC 
TGTGGACTGC 
AGTGTCTGGA 
TTGTTCTGGG 



Seq ID NO: 43 9 Protein sequence 
Protein Accession #: NP_003995.1 



51 



1 . 11 ' 21 31 41 * 

I : I I .1 I l 

MDKGTLQTIL GGVNKHSTSI GKIWLTVLFI FRIMILWAA KEVKGDEQAD FVCNTLQPGC 
KNVCYDHYFP ISHIRLWALQc LIFVSSPALL; VAKHVAYRRH EKXRKF I KGE IKSEFKDIEE 
IKTQKVRIEG SLWWTYTSSI FFRV I FEAAF MYVFYVMYDG FSKQRLVKCN AWPCPNTVDC 
FVSRPTEKTV FTVFMIAVSG ICILUNVTEL CYiLIRYCSG KSKKPV 



Seq ID NO: .440 DNA sequence 

Nucleic Acid Accession 8: XM_061091.1 

Coding sequence: 1..2481 



ATGCCAAATA 
CACCGAGCGC 
CCCGGGTACC. 
CAGATGCATG 
GTTGGGAAGC 
ATTTCAGCTG 
GATGGGTCTA" 
GTCTGTGACG- 
TCCACTCCTC 
AGAATCAAGA 
CTTCTGCACA" 
GTCACTGATG 
GGTGTCACTG. 
GCCAGCGAGC 
GGCCTCTTCA 
CCCGAGCTTG. 
CAGCCCTGCC 
TGCCCGCTGG 
GTCGACCTCC 
GCCAAAGTCT- 
GTGGGTGTGG 
GATGTGCCTG: 
ACGGGCAGTG- 
CAGGACCGGC; 
GCGGGCCCAG- 
GCCGTGCGGG;. 
GATCCTCAGG 
CGGCCAGGGT 
GTAGGGCCCG 
GAGGTGAACC 
GCCTTCGGGC. 
CCCTACCTAG 
ATGACCGTCC 
GGGAGAGGCG 
GTCTTGGTCG 
CGGGATTCCC. 
ATTGAGTGGC 
ATGAATGAGG 
TGGGAGGGCC 
GGATGGATTC 
ACCCCTCCCA 
GTCTGTGCCC 



11 
I 

CTTCAGGAAC 
TGGTCGCCGC 
CGCCAGTGCC 
GTGAGAAGGT 
CTCCTTTTGA 
CCAGCAAAAT 
ACAGCGTCGG 
GTCTGGACAT 
ATCTGGAATT 
GGATGGTTTT 
GAGGGTTGCC 
GGAAGTCCCA- 
TGTTTGCTGT 
CTAGAGGGCA 
GCACCCTCAG 
TCTTCATGGA- 
AGAATGGAGG 
CCTTTGGAGG 

TCGTGAAGCG 
CCACATACAG 
ACCTGGTCTG 
CCTTGCGGCA 
CACGTAGAGT 
CGCGTCACGC 
CAGAGCTGGA 
ATCTGTTCAA' 
GCCGGACACA 
AGAATTTTGC 
CTGACGTGAC 
TGGACACCAA 
GTGGGGTGGG 
AGAGGGGTGC 
CAGAGGATGC 
TGGGCGTGGG 
TGATCCACGT 
TGTGTGGAGA 
GCAGCTGCGT 
CCCACTGCGA 
TTGAGACGCC 
GCAACTACAG 
CAGGTCCTTA 



21 
I 

AACCAGGATT 
TCTCCTTCCG 
GGCTGCCGAT 
AGATCTCTGG 
GGCAAACGAA 
GATGTGGTGC 
GAAAGGGAGC 
CAGCCCCGAG 
CCCCTTGGAT 
CAAAGGAGGG 
TGGAGGCAGA 
GGGGGATGTG 
GGGGGTCAGG 
GCACGTGCTG 
CAGCTCGGCC 
GCGGTTAATG 
CACATGTGTT 
GGAGGCTAAC 
GGACAGCTCT 
GTTTGTGCGG 
CAGGGAGCTG 
GAGCCTCGAT 
GGCGGCAGAG 
GGTGGTTTTG 
AAGGGCGCGA 
GGAGATCACA 
CCAAATCCCr 
AGCCCTGGAC 
TCAGATGCAG 
ACAGGTCGGC 
ACCCACCCGG 
CTCAGCCGGC 
CCGGCCTGGT 
AGCCGTTCCT 
GCCTGTCCTA 
GGCAGCTTAC 
AGCCAAGCAG 
CCTGCAGAAT 
GAACCGTGAG 
CCTGAGGCAC 
AGAAGGCCTG 
G 



31 
I 

GAAATTTGGC 
GTGAGTCCCA" 
GACCGATTCA 
AGCCTTGGTG 
GTCCATGTAA 
TCGGCTGCAG 
TTTGAAAGGT 
AGGGTCAGAG 
TCATTTTCAA" 
CGCACGGAGA 
AATGCTTCTG 
GCACTGCCAT 
TTTCCCAGGT 
TTGGCTGAGC 
ATCTGCTCCA- 
GGCATCTCTC 
CCAGAAGGAC 
TGTGCCCTGA 
GCGGGCACCA- 
GCCGTGCTGA" 
CTGGTGGCGG' 
GGCATTCCCT 
CGTGGCTTCG 
CTCACTGAGT 
GAGCTGCTCC 
GGCAGCCCAA-. 
GAGCTGCAGG 
CTCGTCTTCA' 
AGCTTTGTGA- 
CTGGTGGTGT' 
GCTGCGATGC 
ACCGCCCTGC 
GTCCCCAAAG 
GCCCAGAAGC: 
AGTGAGGGTC 
GCCGACCTGC 
CCAGTCAACC 
GGGAGCTACC 
TGGAGCTCTT 
ATGGCTCCCG 
GGCACTGAAA 



Seq ID NOs 441 Protein sequence 
Protein Accession XP_061091.1 



1 

1 

MPNTSGTTRI 
QHHGEKVDLW 
DGSNSVGKGS 
RIKRMVFKGG 
GVTVFAVGVR 
PELVFMERLM 
VDLLFLLDSS 
DVPDLVWSLD 
AGPARHARAR 
RPGCRTQALD 
AFGLDTKPTR 
GRGAEDAAVP 



11 

I 

EIWLLQEPPG 
SLGVLCYEFL 
FERSKHFAIT 
RTETELALKY 
FPRWEELHAL 
GISLIGPCDS 
AGTTLDGFLR 
GIPFRGGPTL 
ELLLLGVGSE 
liVFMLDTSAS 
AAMLRAISQA 
AQKLRNNGIS 



21 
I 

HRALVAALLP 
VGKPPFEANE 
VCDGLDISPE 
LLHRGLPGGR 
ASEPRGQHVL 
QPCQNGGTCV 
AKVFVKRFVR 
TGSALRQAAB 
AVRAELEEIT 
VGPENFAQMQ 
PYLGGVGSAG 
VLWGVGPVL 



31 
I 

VSPSPELALA 
VHVSKETIGK 
RVRVGAFQFS 
NASVPQILII^ 
LAEQVEDATN 
PEGLDGYQCL 
AVLSEDSRAR 
RGFGSATRTG 
GSPKHVMVYS 
SFVRSCALQF 
TALLHIYDKV 



41 
I 

TTCTCCAAGA 
GCCCCGAGTT 
CGCTCCCGAT 
TTCTTTGCTA 
GCAAAGAAAC 
TGGACATCAT 
CCAAGCACTT 
TGGGAGCATT 
CCCAACAGGA 
CGGAACTTGC 
TGCCCCAGAT 
CCAAGCAGCT 
GGGAGGAGCT 
AGGTGGAGGA 
GCGCCACGCC 
TGATAGGCCC 
TGGACGGCTA 
AGCTGAGCCT 
CTCTGGACGG 
GCGAGGACTC 
TGCCTGTGGG 
TCCGTGGTGG 
GGAGCGCCAC 
CACACTCCGA 
TGCTGGGTGT 
AGCATGTGAT : 
GGAAGCTGTG 
TGTTGGACAC 
GAAGCTGTGC; 
ATGGCAGCCA 
TGCGGGCCAT. 
TGCACATCTA 
CTGTGGTGGT 
TGAGGAACAA 
TGCGGAGGCT 
GGTACCACCA 
TCTGCAAACC 
GCTGCAAGTG 
GCTCTGTATG 
TGCAGGAGGG 
TGGTGCCTAC 



41 

I 

PGYPPVPAAD 
ISAASKMMWC 
STPHIiEPPLD 
VTDGKSQGDV 
GLFSTLSSSA 
CPLAFGGEAN 
VGVATYSREL 
QDRPRRVWL' 
DPQDLFNQIP 
EVNPDVTQVG 
MTVQRGARPG 
RDSLIHVAAY 



51 
I 

GCCGCCCGGG 
GGCTCTGGCG 
GATTGGAGGT 
TGAATTTTTA 
CATCGGGAAG 
GTTTCTGTTA 
- TGCCATCACA - 
CCAGTTCAGT 
AGTGAAGGCAi 
TCTGAAATAC V 
CCTCATCATC; 
GAAGGAAAGG: 
GCATGCACTG- 
TGCCACCAAC 
AGCTGGGAGC; 
CTGTGACTCG ■ 
CCAGTGCCTC 
GGAATGCAGG. 
CTTCCTGCGG 
TCGGGCCCGA 
GGAGTACCAG 
CCCCACCCTG 
CAGGACAGGC 
GGATGAGGTT 
AGGCAGTGAG 
GGTCTACTCG 
CAGCCGGCAG 
CTCTGCCTCA 
CCTCCAGTTT 
GGTGCAGACT 
TAGCCAGGCC 
TGACAAAGTG 
GCTCACAGGC 
TGGCATCTCT 
TGCAGGTCCC 
GGACGTGCTC 
CAGCCCGTGC 
TCGGGATGGC 
TGTGAGCCAG 
CAGCAGCCGT 
CTTCTGGAAT 



51 
I 

DRFTLPMIGG 
SAAVDIHFLL 
SFSTQQEVKA 
ALPSKQLKER 
ICSSATPAGS 
CALKLSLECR 
LVAVPVGEYQ 
LTESHSEDEV 
ELQGKLCSRQ 
LWYGSQVQT 
VPKAWVLTG 
ADLRYHQDVL 



240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 



66 
120 
.180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



350 



5 

10 

15 
20 

25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 . 

IEWLCGEAKQ PVNLCKPSPC MNEGSCVLQN GSYRCKCRDG WEGPHCE 
GWILETPLRH MAPVQEGSSR TPPSNYREGL GTEMVPTFWN VCAPGP 

Seq ID NO; 442 DNA sequence 

Nucleic Acid Accession §T Eos sequence 

Coding sequence : * 1. . 2424 



1 
I 

ATGCCCCCTT 
TCTCTCCCTC 
AGCAAAATGA 
AGCGTCGGGA 
CTGGACATCA 
CTGGAATTCC 
ATGGTTTTCA" 
GGGTTGCCTG 
AAGTCCCAGG 
TTTGCTGTGG 
AGAGGGCAGC 
ACCCTCAGCA 
CCCTGTGAGC 
AGAGGATCGC 
AGAGTGTTCC 
TCGCAGCCCT 
CTCTGCCCGC 
AGGGTCGACC 
CGGGCCAAAG 
CGAGTGGGTG 
CAGGATGTGC 
CTGACGGGCA 
GGCCAGGACC 
GTTGCGGGCC 
GAGGCCGTGC 
TCGGATCCTC. 
CAGCGGCCAG 
TCAGTAGGGC 
TTTGAGGTGA 
ACTGCCTTCG 
GCCCCCTACC 
GTGATGACCG 
GGCGGGAGAG 
TCTGTCTTGG 
CCCCGGGATT> 
CTCATTGAGT 
TGCATGAATG 
GGCTGGGAGG 
CAGGGATGGA 
CGTACCCCTC 
AATGTCTGTG 



11 
I 

TCCTGTTGCT 
TCCAGGAAGT 
TGTGGTGCTC 
AAGGGAGCTT 
GCCCCGAGAG 
CCTTGGATTC 
AAGGAGGGCG: 
GAGGCAGAAA 
GGGATGTGGC 
GGGTCAGGTT 
ACGTGCTGTT. 

gctcggccat 

ACAGGACGCT 
GGCGGACCCT; 
TAACCCACCC 
GCCAGAATGG 
TGGCCTTTGG 
TCCTCTTCCT 
TCTTCGTGAA. 
TGGCCACATA 
CTGACCTGGT 
GTGCCTTGCG' 
GGCCACGTAG: 
CAGCGCGTCA? 
GGGCAGAGCT 
AGGATCTGTT 
GGTGCCGGAC 
CCGAGAATTT. 
ACCCTGACGT 
GGCTGGACAC 
TAGGTGGGGT- 
TCCAGAGGGG 
GCGCAGAGGA 
TCGTGGGCGT. 
CCCTGATCCA 
GGCTGTGTGG 
AGGGCAGCTG 
GCCCCCACTG 
TTCTTGAGAC 
CCAGCAACTA 
CCCCAGGTCC 



21 
I 

GGAGGCCGTC 
CCATGTAAGC 
GGCTGCAGTG 
TGAAAGGTCC 
GGTCAGAGTG 
ATTTTCAACC 
CACGGAGAGG 
TGCTTCTGTG 
ACTGCCATCC 
TCCCAGGTGG 
GGCTGAGCAG 
CTGCTCCAGC. 
GGAGATGGTC 
TGCGGTGCTG 
TGCCACCTGC 
AGGCACATGT 
AGGGGAGGCT" 
GCTGGACAGC 
GCGGTTTGTG 
CAGCAGGGAG. 
CTGGAGCCTC 
GCAGGCGGCA 
AGTGGTGGTT 
CGCAAGGGCG 
GGAGGAGATC 
CAACCAAATC 
ACAAGCCCTG 
TGCTCAGATG 
GACACAGGTC 
CAAACCCACC 
GGG CTCAGCC 
TGCCCGGCCT 
TGCAGCCGTT 
GGGGCCTGTC 
CGTGGCAGCT 
AGAAGCCAAG 
CGTCCTGCAG 
CGAGAACCGT 
GCCCCTGAGG 
CAGAGAAGGC 
TTAG 



31 
I 

.TGTGTTTTCC 
AAAGAAACCA 
GACATCATGT 
AAGCACTTTG 
GGAGCATTCC 
CAACAGGAAG 
GAACTTGCTC 
CCCCAGATCC 
AAGCAGCTGA 
GAGGAGCTGC 
GTGGAGGATG 
GCCACGCCAG 
CGGGAGTTCG 
GCTGCACAC7T 
TACAGGACCA 
GTTCCAGAAG 
AACTGTGCCC 
TCTGCGGGCA 

CTGCTGGTGG 
GATGGCATTC: 
GAGCGTGGCT 
TTGCTCACTG 
CGAGAGCTGC 
ACAGGCAGCC 
CCTGAGCTGC. 
GACCTCGTCT 
CAGAGCTTTG 
GGCCTGGTGG- 
CGGGCTGCGA' 
GGCACCGCCC 
GGTGTCCCCA 
CCTGCCCAGA 
CTAAGTGAGG 
TACGCCGACC 
CAGCCAGTCA 
AATGGGAGCT 
GAGTGGAGCT 
CACATGGCTC 
CTGGGCACTG 



41 
I 

TGTTTTCCAG 
TCGGGAAGAT 
rrCTGTTAGA 
CCATCACAGT 
AGTTCAGTTC 
TGAAGGCAAG 
TGAAATACCT 
" TCATCATCGT 
AGGAAAGGGG 
ATGCACTGGC 
CCACCAACGG 
ACTGCAGGGT 
CTGGCAATGC 
GTCCCTTCTA 
CCTGCCCAGG 
GACTGGACGG 
TGAAGCTGAG 
CCACTCTGGA 
TGAGCGAGGA 
CGGTGCCTGT 
CCTTCCGTGG 
TCGGGAGCGC 
AGTCACACTC 
TCCTGCTGGG 
CAAAGCATGT 
AGGGGAAGCT 
TCATGTTGGA 
TGAGAAGCTG 
TGTATGGCAG 
TGCTGCGGGC 
TGCTGCACAT 
AAGCTGTGGT- 
AGCTGAGGAA 
GTCTGCGGAG 
TGCGGTACCA 
ACCTCTGCAA 
ACCGCTGCAA 
CTTGCTCTGT 
CCGTGCAGGA 
AAATGGTGCC 



WSSCSVCVSQ 


780 


51 




| 

AGTGCCCCCA 


60 


TTCAGCTGCC 


120 


TGGGTCTAAC 


180 


CTGTGACGGT 


240 


CACTCCTCAT 


300 


AATCAAGAGG- 


360 


TCTGCACAGA- 


420 


CACTGATGGG 


480 


TGTCACTGTG 


540 


CAGCGAGCCT 


600 


CCTCTTCAGC 


660 


CGAGGCTCAC 


720 


CCCATGCTGG 


780 


CAGCTGGAAG 


840 


CCCCTGTGAC 


900 


CTACCAGTGC 


960 


CCTGGAATGC" 


1020 


CGGCTTCCTG u 


1080 


CTCTCGGGCC 


1140. 


GGGGGAGTAC 


1200 


TGGCCCCACC 


1260 


CACCAGGACA • 


1320 


CGAGGATGAG 


1380 


TGTAGGCAGT 


1440 


GATGGT CT AC 


1500 


GTGCAGCCGG 


1560 


CACCTCTGCC ' 


1620 


TGCCCTCCAG * 


1680 


CCAGGTGCAG - 


1740 


CATTAGCCAG ' 


1800 


CTATGACAAA 


I860 


GGTGCTCACA 


1920 


CAATGGCATC 


1980 


GCTTGCAGGT 


2040 


CCAGGACGTG 


2100 


ACCCAGCCCG- 


2160 


GTGTCGGGAT 


2220 


ATGTGTGAGC 


2280 


GGGCAGCAGC 


2340 


TACCTTCTGG 


2400 



PCT/US02/12476 



Seq ID NO: 443 Protein sequence 
Protein Accession #: Eos sequence 



1 
I 

MPPFLLLEAV 
SVGKGSFERS 
MVFKGGRTET. 
FAVGVRFPRW 
PCEHRTLEMV 
SQPCQNGGTC 
RAKVFVKRFV 
LTGSALRQAA 
EAVRAELEEI 
SVGPENFAQM 
APYLGGVGSA' 
SVLWGVGPV 
CMNEGSCVLQ 
RTPPSNYREG 



11 

I 

CVPLFSRVPP 
KHFAITVCDG 
ELALKYLLHR 
EELHALASEP 
REFAGNAPCM: 
VPEGLDGYQC 
RAVLSEDSRA 

ergfgsatrt 
tgspkhvmvy 
qsfvrscalq; 
gtallhiydk 
lseglrrlag 
ngsyrckcrd 
lgtemvptfw 



21 

I 

SLPLQEVHVS 
LDISPERVRV 
GLPGGRNASV 
RGQHVLLAEQ 
RGSRRTLAVL 
•LCPLAFGGEA 
RVGVATYSRB 
GQDRPRRWV 
SDPQDLFNQI 
FEVNPDVTQV 
VMTVQRGARP 
PRDSLIHVAA 
GV4EGPHCENR 
NVCAPGP 



31 
I 

KETIGKISAA 
GAFQFSSTPH 
PQILIIVTDG 
VEDATNGLFS 
AAHCPFYSWK 
NCALKLSLEC 
LLVAVPVGE1P- 
LLTESHSEDE 
PELQGKLCSR 
GLWYGSQVQ 
GVPKAVWLT 
YADLRYHQDV 
EWSSCSVCVS 



Seq ID NO: 444 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 89.. 2356 



GCCCCCTGGC 
GTCGCCGCTC 
TGTTTTCCTG 
AGAAACCATC 
CATCATGTTT 
GCACTTTGCC 
AGCATTCCAG 
ACAGGAAGTG 
ACTTGCTCTG 
CCAGATCCTC 



11 
I 

CCGAGCCGCG 
TCCTTCCGTT 
TTTTCCAGAG 
GGGAAGATTT 
CTGTTAGATG 
ATCACAGTCT 
TTCAGTTCCA 
AAGGCAAGAA 
AAATACCTTC 
ATCATCGTCA 



21 
I 

CCCGGGTCTG. 
ATATCAACAT 
TGCCCCCATC 
CAGCTGCCAG 
GGTCTAACAG 
GTGACGGTCT 
CTCCTCATCT 
TCAAGAGGAT 
TGCACAGAGG 
CTGATGGGAA 



31 
I 

TGAGTAGAGC 
GCCCCCTTTC 
TCTCCCTCTC 
CAAAATGATG 
CGTCGGGAAA 
GGACATCAGC 
GGAATTCCCC 
GGTTTTCAAA 
GTTGCCTGGA 
GTCCCAGGGG 



41 
I 

SKMMWCSAAV 
LEFPLDSFST 
KSQGDVALPS 
TLSSSAICSS 
RVFLTHPATC 
RVDLLFLLDS 
QDVPDLVWSL 
VAGPARHARA 
QRPGCRTQAL 
TAFGLDTKPT 
GGRGAEDAAV 
LIEWLCGEAK 
QGWILETPLR 



41 
I 

CGCCCGGGCA 
CTGTTGCTGG 
CAGGAAGTCC 
TGGTGCTCGG 
GGGAGCTTTG 
CCCGAGAGGG 
TTGGATTCAT 
GGAGGGCGCA 
GGCAGAAATG 
GATGTGGCAC 



51 
I 

DIMFLLDGSN 

QQEVKARIKR 

KQLKERGVTV ' 

ATPDCRVEAH 

YRTTCPGPCD 

SAGTTLDGFL 

DGIPFRGGPTr 

RELLLLGVGS 

DLVFMLDTSA 

RAAMLRAISQ 

PAQKLRNNGI 

QPVNLCKPSP 



51 
I 

CCGAGCGCTG 
AAGCCGTCTG 
ATGTAAGCAA 
CTGCAGTGGA 
AAAGGTCCAA 
TCAGAGTGGG 
TTTCAACCCA 
CGGAGACGGA 
CTTCTGTGCC 
TGCCATCCAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



351 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

GCAGCTGAAG . GAAAGGGGTG . 

. GGAGCTGCAT , GCACTGGCCA 
GGAGGATGCC ACCAACGGCC 
CACGCCAGAC TGCAGGGTCG 
GGAGTTCGCT GGCAATGCCC 
TGCACACTGT CCCTTCTACA 
CAGGACCACC TGCCCAGGCC 
TCCAGAAGGA- CTGGAOGGCT 
CTGTGCCCTG AAGCTGAGCC- 

. TGCGGGCACC, ACTCTGQACG. 
GG CCGTGCTG. * AGOGAGGACT. 

. GCTGGTGGCG GTGCCTGTGG 
TGGCATTCCC TTCCGTGGTG: 
GOGTGGCTTC GGGAGCGCCA 
GCTCACTGAG. TCACACTCCG 
AGAGCTGCTC CTGCTGGGTG 
AGGCAGCCCA' AAGCATGTGA'. 
TGAGCTGCAG- GGGAAGCTGT' 
CCTCGTCTTC ATGTTGGACA 
GAGCTTTGTG AGAAGCTGTG- 
CCTGGTGGTG TATGGCAGCC 
GGCTGCGATG ' CTGCGGGCCA 
CACCGCCCTG CTGCACATCT. 
TGTCCCCAAA GCTGTGGTGG 
TGCCCAGAAG CTGAGGAACA 
AAGTGAGGGT CTGCGGAGGC 
CGCCGACCTG' CGGTACCACC 
GCCAGTCAAC CTCTGCAAAC 

• TGGGAGCTAC CGCTGCAAGT 
CTTGAGACGC. CCCTGAGGCA 
AGCAACTACA ." GAGAAGGCCT- 
CCAGGTCCTT AGAATG TCTG, 
GAGGATGTCC CAACTGCAGC 
AAACGATGTT. GTTGAAAAGT 
GCCTTGTTGA- GGCTATGTCA' 
GACTTAAATT-; TAGCGGCCTG 
TGACACAGTA ATGCCCAGCA 



PCT/US02/12476 



TCACTGTGTT 
GCGAGCCTAG 
TCTTCAGCAC 
AGGCTCACCC 
CATGCTGGAG 
GCTGGAAGAG 
CCTGTGACTC 
ACCAGTGCCT 
TGGAATGCAG 
GCTTCCTGCG 
CTCGGGCCCG 
.GGGAGTACCA 
GCCCCACCCT; 
CCAGGACAGG 
AGGATGAGGT- 
TAGGCAGTGAT 
TGGTCTACTC 
GCAGCCGGCA 
CCTCTGCCTC 
CCCTCCAGTT 
AGGTGCAGAC. 
TTAGCCAGGC 
ATGACAAAGT 
TGCTCACAGG- 
ATGGCATCTC 
TTGCAGGTCC 
AGGACGTGCT 
CCAGCCCGTX3; 
GTCGGGATGGL 
CATGGCTCCC 
GGGCACTGAA 
CTTCCCGCCG 
CATGCTGCTT. 
TTTGATGTGT- 
TCTGCCACCT 
ACGTTCCTTT- 
GAGGCCTTTA 



TGCTGTGGGG 
AGGGCAGCAC 
CCTCAGCAGC 
CTGTGAGCAC 
AGGATCGCGG 
AGTGTTCCTA 
GCAGCCCTGC 
CTGCCCGCTG 
GGTCGACCTC 
GGCCAAAGTC 
AGTGGGTGTG 
GGATGTGCCT 
GACGGGCAGT. 
CCAGGACOGG 
TGCGGGCCCA ! 
GGCCGTGCGG 
GGATCCTCAG 
GCGGCCAGGG 
AGTAGGGCCC 
TGAGGTGAAC 
TGCCTTCGGG 
CCCCTACCTA- 
GATGACCGTC 
CGGGAGAGGC. 
TGTCTTGGTC. 
CCGGGATTCC 
CATTGAGTGG 
CATGAATGAG 
CTGGGAGGGC 
GTGCAGGAGG 
ATGGTGCCTA 
TGGCCAGGAC 
AGAGACAAGA 
AAGTAAATAC 
TTCCCTTGAG. 
GCACACAATC 
CTAGAGCATC 



GTCAGGTTTC 
GTGCTGTTGG 
TOGGCCATCT 
AGGACGCTGG 
OGGACCCTTG 
ACCCACCCTG 
CAGAATGGAG 
GCCTTTGGAG 
CTCTTCCTGC 
TTCGTGAAGC 
GCCACATACA 
GACCTGGTCT. 
GCCTTGCGGC 
CCACGTAGAG 
GCGCGTCACG 
GCAGAGCTGG 
GATCTGTTCA 
TGCCGGACAC 
GAGAATTTTG 
CCTGACGTGA" 
CTGGACACCA 
GGTGGGGTGG 
CAGAGGGGTG 
GCAGAGGATG 
GTGGGCGTGG 
CTGATCCACG 
CTGTGTGGAG 
GGCAGCTGCG. 
CCCCACTGCG 
GCAGCAGCCG 
CCTTCTGGAA- 
CACTATTCTC 
AAGCAGCTGA- 
CCACTTTCTG 
GATAAACAAG 
AATGCTCGCC 
CTTTGGACGG 



CCAGGTGGGA 
CTGAGCAGGT 
GCTCCAGCGC 
AGATGGTC03 



CCACCTGCTA 

GCACATGTGT 

GGGAGGCTAA 

TGGACAGCTC 

GGTTTGTGCG 

GCAGGGAGCT 

GGAGCCTCGA - 

AGGCGGCAGA 

TGGTGGTTTT 

CAAGGGCGCG 

AGGAGATCAC ; 

ACCAAATCCC 

AAGCCCTGGA 

CTCAGATGCA-" 

CACAGGTCGG. 

AACCCACCCG 

GCTCAGCCGG 

CCCGGCCTGG 

CAGCCGTTCC 

GGCCTGTCCT 

TGGCAGCTTA - 

AAGCCAAGCA- 

TCCTGCAGAA- 

AGAACCGATT " 

TACCCCTCCC.- 

TGTCTGTGCC 

actgagggag: 

TGTCACCCAC 
TACCTGCTGT 
GGGTCCTGAA 
AGAATGTTGT 



Seq ID NO!: 445 Protein sequence 
Protein Accession #: Eos sequence 



1 
I 

MPPPLLLEAV 
SVGKGSFERS 
MVFKGGRTET 
FAVGVRFPRW 
PCEHRTLEMV 
SQPCQNGGTC 
RAKVFVKRFV: 
LTGSALROAA 
EAVRAELEEIv 
SVGPENFAQM 
APYIiGGVGSA 
SVLWGVGPV 
CMNEGSCVLQ 



11 

I ■ 

CVFLFSRVPP 
KHPAITVCDG 
SLALKYLLHR 
EELHALASEP 
REFAGNAPCW 
VPEGLDGYQC 
RAVLSEDSRA 
ERGFGSATRT 
TGSPKHVMVY 
QSFVRSCALQ. 
GTALLHIYDK 
LSEGLRRLAG 
NGSYRCKCRD 



21 

1 

SLPLQEVHVS 
LDISPERVRV 
GLPGGRNASV 
RGQHVLLAEQ 
RGSRRTLAVL 
LCPLAFGGEA- 
RVGVATYSRE 
GQDRPRRVW 
SDPQDLFNQX 
FEVNPDVTQV^ 
VMTVQRGARP 
PRDSLIHVAA 



31 
I 

KETIGKISAA 
GAFQFSSTPH 
PQILIIVTDG 
VEDATNGLFS 
AAHCPFYSWK 
NCALKLSLEC 
LLVAVPVGEY 
LLTESHSEDE 
PELQGKLCSR 
GLWYGSQVQ 
GVPKAVWLT 
YADLRYHQDV 
FLRRP 



Seq ID NO: 446 DNA sequence 

Nucleic Acid Accession #: NMJ)31942.l 

Coding sequence: 145.. 1260 



CCCGAGCCCC 
TGCTCCTCCT 
CCGATCTGGG 
GTAAAGAAGA 
TCCTCTGATG 
TCAGTTCGGG 
GCGATGAAGT 
CAGCCCTCAG 
AATTTTTTGG. 
ATGTCTGAAT 
GACTCACAAT 
CCTGAACGGA 
GCTCTACCCA 
ACCGTGGATG 
GTGACCCTTC 
GTCTGCAGCA 
TGCCGTCAGA 
CGAGGCCAGT 
■ CTGCTGGATC 
CGGCAGCGAG 
TTTGGGAATG 
TATCTGGAAA 
TTTTTTCACT 
AATCAAGTTA 



11 

i 

GCCCCTCCGG. 
GCTGTGGGAC 
CACCCGCCAC 
ACTTAAAGAA- 
ACAGTTGTGA 
AAGGCTGTAG 
TTCCAGCGCG 
AGAATTCTGT. 
AGAAAAGGGC 
TAGAAAGCTT 
CAAGGAGACC 
GAGCTCGTCC 
TGGAGGAGGA 
GCTACATGAA 
CGCATATAAT 
ATTCTCGAGA 
AGACTATTGA 
TCTGTGGCCC 
CGAACTGGCA 
ATGGACGGTG 
TGCATGCCTA 
ATTTG CTGCC 
GAAACCTGAG 
ATCTTAGCAG 



21 
I 

GCCCGGGTCG 
CGCTGACCGC 
CAGCATGGAC 
ATTCAGATAT 
CAGCTTTGCT 
GACCCGCAGC 
GAGTACCAGG 
GACTGATTCC 
TTTAAATATA 
CCCTGGCTCG 
GCGAAGGCGT 
TCTTACCAGG 
GGAGGAAGAG 
TGAAGATGAC 
TCGCCCAGTG 
GAAGAT ATAT 
TACCAAAACA 
CTGCCTTCGA 
TTGCCCGCCT 
TGCGACTGGG 
CTTGAAAAGC 
TGCCrrCTAC 
TTAAAAATCT 
ACATGTGTTT 



31 
I 

GCGCGCCCAG 
GCGGCTGCTC 
GCTCGCCGCG 
GTGAAGTTGA 
TCTGATAATT 
CAGTGCAGGC 
GGAGCAACCA 
AACTCCGATT 
AAGCAAAACA 
TTCCGTGGAA 
ACATTCCCGG 
TCAAGGTCCC 
GATAAGTACA 
CTGCCCAGAA 
GAAGAAATTA 
AACCGTTCAC 
AACTGCAGAA 
AACCGTTATG 
TGTCGAGGAA 
GTCCTTGTGT 
CTGAAACAGG 
TTCTCAAATC 
TGATGATCAG 
CTGGAGCATC 



41 
I 

SKMMWCSAAV 
LEFPLDSFST 
KSQGDVALPS 
TLSSSAICSS 
RVFLTHPATC 
RVDLLFLLDS 
QDVPDLVWSL 
VAGPARHARA 
QRPGCRTQAL 
TAFGLDTKPT 
GGRGAEDAAV 
LIEWLCGEAK 



41 

I 

CCTGCCAGCC 
CGCTCTCCCC 
TGCCGCAGAA 
TTTCCATGGA 
TTGCAAACAC 
ACTCTGGACC 
ACAAAAAAGC 
CAGAAGATGA- 
AAGCAATGCT 
GACATCCCCT 
GTGTTGCTTC 
GGATCCTCGG 
TGTTGGTGAG 
GCCGTCGCTC 
CAGAGGAGGA 
TGGGCTCTAC 
ACCCAGACTG 
GTGAAGAGGT 
TCTGCAACTG 
ATTTAGCCAA 
AATTTGAAAT 
TTTCTTGTAA 
CCTGTTTCAT 
ACAGAAGGTA 



51 
I 

DIMFliLDGSN 
QQEVKARIKR 
KQLKERGVTV 
ATPDCRVEAH 
YRTTCPGPCD 
SAGTTLDGFL 
DGIPFRGGPT 
RELLLLGVGS. 
DLVFMLDTSA 
RAAMLRAISQ 
PAQKLRNNGI- 
QPVNLCKPSP 



51 
I 

GCGCTGCTGC 

GCTCCAAGCG 

AGATCTCAGA 

AACCTCGTCA 

GAGGCTGCAG 

TCTCAGGGTG 

AGAGTCCCGC 

AAGTGGAATG 

TGCAAAACTC 

CCCAGGCTCC 

CAGGAGAAAC 

GTCCCTTGAC 

AAAGAGGAAG 

CAGATCATCC 

GTTGGAGAAC . 

TTGTCATCAA .. 

CTGGGGCGTT 

CAGGGATGCT 

CAGTTTCTGC 

ATATCATGGC 

GCAAGCATAA . 

AAGTTTCCAA 

AAGAAACTCC 

TATTGCTAGT 



660. 

,720 
'780* 

040 
" 900 . 

960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
.1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2260 
2340 
2400 
2460 
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WO 02/086443 , 

TACACTTTGC CCTCCTGCAG TTTCTTCTCT GCTCCCAACC CCCATCTCAT AGCATCCCCC- 1500 

' TCTATTTCCA * ATGCTCCTCT ^CCAACCGCTT AGTTTCTGAA TTTCTTTTAA ATTACAGTTT 1560. 

' TATGAAAGCA TATTTTATTT. ACTTGGTGTT GAAATAGCCC TCATAAAACC TAAGCACTTG . 1620 

GAAACACAAT -AATAGTATTA ACTAACTAGA TCTATTGAAT TTCAGAGAAG. AGCCTTCTAA 1680 

CTTGTTTACA CAAAAACGAG* TATGATTTAG CACTCATACT AGTTGAAATT TTTAATAGAA 1740 

- TCAAGGCACA AAAGTCTTAA: AACCATGTGG AAAAATTAGG TAATTATTGC AGATTGATGT 1800 

CTCTCAATCC CATGTATTGC GCTTATGTTA CAAGTTGTTG TCACAGTTGA GACTTAATTT 1860 

CTCCTAATTT CTTCTGCCCG AAGGGTAAGT GGTGCGTCCA GCTTACACGA TCATAATTCA 1920 

AAGGTTGGTG GGCAATGTAA- TACTTAATTA AAATAATGAT GGAAGAGCTA TCTGGAGATT 1980 

ATGAGTAAGC TGATTTGAAT TTTCAGTATA AAACTTTAGT ATAATTGTAG TTTGCAAAGT 2040 

TTATTTCAGT TCACATGTAA GGTATTGCAA ATAAATTCTT . GGACAATTTT GTATGGAAAC 2100 

TTGATATTAA AAACTAGTCT GTGGTTCTTT GCAGTTTCTT GTAAATTTAT AAACCAGGCA" .2160 

CAAGGTTCAA GTTTAGATTT TAAGCACTTT TATAACAATG ATAAGTGCCT TTTTGGAGAT 2220 

GTAACTTTTA GCAGTTTGTT AACCTGACAT CTCTGCCAGT CTAGTTTCTG GGCAGGTTTC; 2280 

CTGTGTCAGT ATTCCCCCTC CTCTTTGCAT TAATCAAGGT ATTTGGTAGA GGTGGAATCT '2340. 

AAGTGTTTGT ATGTCCAATT TACTTGCATA TGTAAACCAT. TGCTGTGCCA TTCAATGTTT 2400 

GATGCATAAT TGGACCTTGA ATCGATAAGT GTAAATACAG CTTTTGATCT GTAATGCTTT 2460 
TATACAAAAG'. TTTATTTTAA TAATAAAATG TTTGTTCTAA AAAAAAAAAA » . 

Seq ID NO: 447 Protein sequence 
Protein Accession'*: NP_114148.1 

1 il 21 31 41 51 

I I I • I I I 

MDARRVPQKD LRVKKNLKKP RYVKLISMET SSSSDDSCDS FASDNFANTR LQSVREGCRT 60 

RSQCRHSGPL rvamkfpars trgatnkkae srqpsensvt. dsnsdsedes gmnflekral 120 

NIKQNKAMbA- KLMSELESFP GSFRGRHPLP GSDSQSRRPR RRTFPGVASR RNPERRARPL 180 

TRSRSRILGS' LDALPMEEEE EEDKYMLVRK RKTVDGYMNE DDLPRSRRSR SSVTLPHIIR 240 

PVEEITEEEL ENVCSNSREK IYNRSLGSTC HQCRQKTIDT KTNCRNPDCW GVRGQFCGPC 300 

EiRNRYGEEVR* DALLDPNWHC PPCRGICNCS FCRQRDGRCA TGVLVYLAKY HGFGNVHAYL' 360 
KSLKQEFEMQ A' . 

Seq ID NO: 448 DNA sequence 

Nucleic Acid Accession #: NM_019894 

Coding sequence: 1. .1314 

1 il 21 31 .41 51 

I I I I I I 

ATGTTACAGG ATCCTGACAG TGATCAACCT CTGAACAGCC TCGATGTCAA ACCCCTGCGC 60 

AAACCCOGTA TCCCCATGGA GACCTTCAGA AAGGTGGGGA TCCCCATCAT CATAGCACTA 120 

CTGAGCCTGG CGAGTATCAT CATTGTGGTT GTCCTCATCA AGGTGATTCT GGATAAATAC 180 

TACTTCCTCT GCGGGCAGCC TCTCCACTTC ATCCCGAGGA AGCAGCTGTG TGACGGAGAG 240 

CTGGACTGTC CCTTGGGGGA GGACGAGGAG CACTGTGTCA AGAGCTTCCC CGAAGGGCCT 300 

GCAGTGGCAG TCCGCCTCTC CAAGGACCGA TCCACACTGC AGGTGCTGGA CTCGGCCACA 360 

GGGAACTGGT. TCTCTGCCTG TTTCGACAAC TTCACAGAA3 CTCTCGCTGA GACAGCCTGT. 420 

AGGCAGATGG 'GCTACAGCAG: CAAACCCACT TTCAGAGCTG TGGAGATTGG CCCAGACCAG 480 

GATCTGGATG TTGTTGAAAT CACAGAAAAC AGCCAGGAGC TTCGCATGCG GAACTCAAGT 540 

GGGCCCTGTC TCTCAGGCTC CCTGGTCTCC CTGCACTGTC TTGCCTGTGG GAAGAGCCTG 600 

AAGACCCCCC GTGTGGTGGG TGGGGAGGAG GCCTCTGTGG ATTCTTGGCC TTGGCAGGTC 660 

AGCATCCAGT. ACGACAAACA- GCACGTCTGT GGAGGGAGCA TCCTGGACCC CCACTGGGTC 720 

CTCACGGCAG CCCACTGCTT CAGGAAACAT ACCGATGTGT TCAACTGGAA GGTGCGGGCA 780 

GGCTCAGACA AACTGGGCAG CTTCCCATCC CTGGCTGTGG CCAAGATCAT CATCATTGAA 840 

TTCAACCCCA TGTACCCCAA AGACAATGAC ATCGCCCTCA TGAAGCTGCA GTTCCCACTC 900 

ACTTTCTCAG GCACAGTCAG GCCCATCTGT CTGCCCTTCT TTGATGAGGA GCTCACTCCA 960 

GCCACCCCAC TCTGGATCAT TGGATGGGGC TTTACGAAGC AGAATGGAGG GAAGATGTCT 1020 

GACATACTGC TGCAGGCGTC AGTCCAGGTC ATTGACAGCA CACGGTGCAA TGCAGACGAT 1080 

GCGTACCAGG GGGAAGTCAC CGAGAAGATG ATGTGTGCAG GCATCCCGGA AGGGGGTGTG 1140 

GACACCTGCC AGGGTGACAG TGGTGGGCCC CTGATGTACC AATCTGACCA GTGGCATGTG 1200 

GTGGGCATCG TTAGCTGGGG CTATGGCTGC GGGGGCCCGA GCACCCCAGG AGTATACACC , 1260 
AAGGTCTCAG CCTATCTCAA CTGGATCTAC AATGTCTGGA AGGCTGAGCT GTAA 

Seq ID NO: 449 Protein sequence 
Protein Accession #t- NP_063947.1 

1 11 21 31 41 51. 

I, " I / II I I 

MLQDPDSDQP ' LNSLDVKPLR KPRIPMETFR KVGIPII1AL LSLASIIIW VLIKVILDKY. 60 

YFLCGQPLHF " IPRKQDCDGE LDCPLGEDEE • HCVKSFPEGP AVAVRLSKDR STLQVLDSAT. 120 

GNWFSACFDN-FTEALAETAC RQMGYSSKPT FRAVEIGPDQ DLDWEITEN SQELRMRNSS 180 

GPCLSGSLVS " LHCLACGKSL KTPRWGGEE ASVDSWPWQV SIQYDKQHVC GGSIU3PHWV 240 

LTAAHCFRKH TDVFNWKVRA - GSDKLGSFPS LAVAKI 1 1 IE FNPMYPKDND IALMXLQFPL 300 

TFSGTVRPIC'LPFFDEELTP ATPLWIIGWG FTKQNGGKMS DILLQASVQV IDSTRCNADD, 360 

AYQGEVTEKM MCAGIPEGGV DTCQGDSGGP LMYQSDQWHV VGIVSWGYGC GGPSTPGVYT . 420 



Seq ID NO: -450 DNA sequence 

Nucleic Acid Accession #: XM_051860.2 

Coding sequence: 52;. 3 04 2 

1 • ' 11 ■ 21 - 31 41 51 ' 

I \:~; I I I I I . 

GCTCACCCAG GAAAAATATG CAATCGTCCC ATTGATATAC AGGCCACTAC AATGGATGGA 60 

GTTAACCTCA GCACCGAGGT TGTCTACAAA AAAGGCCAGG ATTATAGGTT TGCTTGCTAC 120 

GACCGGGGCA GAGCCTGCCG GAGCTACCGT GTACGGTTCC TCTGTGGGAA GCCTGTGAGG 180 

CCCAAACTCA ' CAGTCACCAT TGACACCAAT GTGAACAGCA CCATTCTGAA CTTGGAGGAT 240 

AATGTACAGT CATGGAAACC TGGAGATACC CTGGTCATTG CCAGTACTGA TTACTCCATG 300 

TACCAGGCAG • AAGAGTTCCA GGTGCTTCCC TGCAGATCCT GCGCCCCCAA CCAGGTCAAA 360 
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WO 02/086443 , : : 

GTGGCAGGGA;AACCAATGTA CCTGCACATC GGGGAGGAGA TAGACGGCGT GGACATGCGG 420 
GCGGAGGTTG GGCTTCTGAG .CCGGAACATC. ATAGTGATGG GGGAGATGGA GGACAAATGC 480 
TACCCCTACA GAAACCACAT .CTGCAATTTC TTTGACTTCG ATACCTTTGG GGGCCACATC 540 
AAGTTTGCTC TGGGA.TTTAA GGCAGCACAC TTGGAGGGCA CGGAGCTGAA GCATATGGGA 600 
CAGCAGCTGG TGGGTCAGTA CCCGATTCAC TTCCACCTGG CCGGTGATGT AGACGAAAGG 660 
GGAGGTTATG ACCCACCCAC ATACATCAGG " GACCTCTCCA TCCATCATAC ATTCTCTCGC 720 
TGCGTCACAG TCCATGGCTC CAATGGCTTG TTGATCAAGG ACGTTGTGGG CTATAACTCT 780 
TTGGGCCACT GCTTCTTCAC GGAAGATGGG CCGGAGGAAC GCAACACTTT TGACCACTGT 840 
CTTGGCCTCC TTGTCAAGTC TGGAACCCTC CTCCCCTCGG ACCGTGACAG CAAGATGTGC 900 
AAGATGATCAv- CAGGAGACTC CTACCCAGGG TACATCCCCA AGCCCAGGCA AGACTGCAAT 960 
GCTGTGTCCA CCTTCTGGAT GGCCAATCCC AACAACAACC TCATCAACTG TGCCGCTGCA 1020 
GGATCTGAGG AAACTGGATT TTGGTTTATT TTTCACCACG TACCAACGGG CCCCTCCGTG' 1080 
GGAATGTACT. CCCCAGGTTA. TTCAGAGCAC ATTCCACTGG GAAAATTCTA TAACAACCGA 1140 
GCACATTCCA ACTACCGGGC TGGCATGATC ATAGACAACG GAGTCAAAAC CACCGAGGCC. 1200. 
TCTGCCAAGG: ACAAGCGGCC GTTCCTCTCA ATCATCTCTG CCAGATACAG CCCTCACCAG 1260 
GACGCCGACC CGCTGAAGCC CCGGGAGCCG GCCATCATCA GACACTTCAT TGCCTACAAG 1320 
AACCAGGACC ACGGGGCCTG GCTGCGCGGC GGGGATGTGT GGCTGGACAG CTGCCGGTTT 1380 
GCTGACAATG GCATTGGCCT GACCCTGGCC AGTGGTGGAA CCTTCCCGTA TGACGACGGC 1440 
TCCAAGCAAG AGATAAAGAA CAGCTTGTTT GTTGGCGAGA^ GTGGCAACGT GGGGACGGAA- 1500 
ATGATGGACA" ATAGGATCTG GGGCCCTGGC GGCTTGGACC ATAGCGGAAG GACCCTCCCT .1560. 
ATAGGCCAGA' ATTTTCCAAT; TAGAGGAATT CAGTTATATG ATGGCCCCAT CAACATCCAA 1620 
AACTGCACTT TCCGAAAGTT TGTGGCCCTG. GAGGGCCGGC ACACCAGCGC CCTGGCCTTC * 16 BO 
CGCCTGAATA ATGCCTGGCA- GAGCTGCCCC CATAACAACG TGACCGGCAT TGCCTTTGAG \ 1740 
GACGTTCOGA TTACTTCCAG AGT GTTCTT C' GGAGAGCCTG- GGCCCTGGTT CAACCAGCTG; 1800 
GACATGGATG GGGATAAGAC ATCTGTGTTC CATGACGTCG: ACGGCTCCGT GTCCGAGTAC 1860 
CCTGGCTCCT: ACCTCACGAA GAATGACAAC TGGCTGGTCC GGCACCCAGA. CTGCATCAAT> 1920 
GTTCCCGACT GGAGAGGGGC : . CATTTGCAGT- GGGTGCTATG CACAGATGTA CATTCAAGCC 1980 
TACAAGACCA GTAACCTGCG. AATGAAGATC ATCAAGAATG ACTTCCCCAG CCACCCTCTT 2040 
TACCTGGAGG GGGCGCTCAC. CAGGAGCACC CATTACCAGC AATACCAACC GGTTGTCACC 2100 
CTGCAGAAGG- GCTACACCAT. CCACTGGGAC CAGACGGCCC CCGCCGAACT- CGCCATCTGG'- 2160 
CTCATCAACT. TCAACAAGGG CGACTGGATC CGAGTGGGGC TCTGCTACCC GCGAGGCACC 2220 
ACATTCTCCA TCCTCTCGGA- TGTTCACAAT CGCCTGCTGA AGCAAACGTC CAAGACGGGCi 2280 
GTCTTCGTGA GGACCTTGCA GATGGACAAA GTGGAGCAGA- GCTACCCTGG CAGGAGCCAC 2340 
TACTACTGGG ACGAGGACTC AGGGCTGTTG: TTCCTGAAGC TGAAAGCTCA GAACGAGAGAr 2400 
GAGAAGTTTG CTTTCTGCTC CATGAAAGGC TGTGAGAGGA TAAAGATTAA' AGCTCTGATT; 2460 
CCAAAGAACG CAGGCGTCAG- TGACTGCACA GCCACAGCTT ACCCCAAGTT CACCGAGAGG . 2520 
GCTGTCGTAG ACGTGCCGAT GCCCAAGAAG CTCTTTGGTT' CTCAGCTGAA- AACAAAGGAC 2580 
CATTTCTTGG AGGTGAAGAT GGAGAGTTCC AAGCAGCACT TCTTCCACCT CTGGAACGAC " 2640 
TTCGCTTACA' TTGAAGTGGA TGGGAAGAAG- TACCCCAGTT- CGGAGGATGG CATCCAGGTG 2700 
GTGGTGATTG ACGGGAACCA AGGGCGCGTG GTGAGCCACA* CGAGCTTCAG GAACTCCATT 2760 
CTGCAAGGCA' TACCATGGCA^ GCTTTTCAAC TATGTGGCGA CCATCCCTGA CAATTCCATA 2820 
GTGCTTATGG . CATCAAAGGG'. AAGATACGTC TCCAGAGGCC CATGGACCAG AGTGCTGGAA-. 2880 
AAGCTTGGGG CAGACAGGGG": TCTCAAGTTG AAAGAGCAAA TGGCATTCGT TGGCTTCAAA 2940 
GGCAGCTTCC GGCCCATCTG. GGTGACACTG GACACTGAGG ATCACAAAGC CAAAATCTTC ' 3000 
CAAGTTGTGC CCATCCCTGT. GGTGAAGAAG AAGAAGTTGT GAGGACAGCT GCCGCCCGGT 3060 
GCCACCTCGT. GGTAGACTAT GACGGTGACT CTTGGCAGCA GACCAGTGGG GGATGGCTGG 3120 
GTCCCCCAGC CCCTGCCAGC AGCTGCCTGG GAAGGCCGTG TTTCAGCCCT GATGGGCCAA 3180 
GGGAAGGCTA TCAGAGACCC TGGTGCTGCC ACCTGCCCCT ACTCAAGTGT CTACCTGGAG; 3240 
CCCCTGGGGC GGTGCTGGCC AATGCTGGAA ACATTCACTT TCCTGCAGCC TCTTGGGTGC 3300. 
TTCTCTCCTA TCTGTGCCTC TTCAGTGGGG GTTTGGGGAC CATATCAGGA GACCTGGGTT 3360 
GTGCTGACAG CAAAGATCCA CTTTGGCAGG AGCCCTGACC CAGCTAGGAG GTAGTCTGGA 3420 
GGGCTGGTCA TTCACAGATC. CCCATGGTCT TCAGCAGACA AGTGAGGGTG GTAAATGTAG 3480 
GAGAAAGAGC CTTGGCCTTA- AGGAAATCTT TACTCCTGTA AGCAAGAGCC AACCTCACAG 3540 
GATTAGGAGC TGGGGTAGAA CTGGCTATCC TTGGGGAAGA GGCAAGCCCT GCCTCTGGCC 3600 
GTGTCCACCT TTCAGGAGAC TTTGAGTGGC AGGTTTGGAC TTGGACTAGA TGACTCTCAA 3660 
AGGCCCTTTT AGTTCTGAGA' TTCCAGAAAT CTGCTGCATT TCACATGGTA CCTGGAACCC. 3720 
AACAGTTCAT GGATATCCAC TGATATCCAT GATGCTGGGT GCCCCAGCGC ACACGGGATG 3780 
GAGAGGTGAG AACTAATGCC TAGCTTGAGG GGTCTGCAGT. CCAGTAGGGC AGGCAGTCAG- 3840 
GTCCATGTGC ACTGCAATGC CAGGTGGAGA AATCACAGAG AGGTAAAATG GAGGCCAGTG 3 900 
CCATTTCAGA GGGGAGGCTC AGGAAGGCTT CTTGCTTACA GGAATGAAGG CTGGGGGCAT - 3960 
TTTGCTGGGG GGAGATGAGG CAGCCTCTGG AATGGCTCAG: GGATTCAGCC CTCCCTGCCG 4020 
CTGCCTGCTG AAGCTGGTGA CTACGGGGTC GCCCTTTGCT CACGTCTCTC TGGCCCACTC 4080 
ATGATGGAGA AGTGTGGTCA GAGGGGAGCA ATGGGCTTTG CTGCTTATGA GCACAGAGGAv 4140 
ATTCAGTCCC CAGGCAGCCC TGCCTCTGAC TCCAAGAGGG TGAAGTCCAC AGAAGTGAGC 4200 
rCCTGCCTTA GGGCCTCATT TGCTCTTCAT CCAGGGAACT GAGCACAGGG GGCCTCCAGG 4260 
AGACCCTAGA TGTGCTCGTA CTCCCTCGGC CTGGGATTTC AGAGCTGGAA ATATAGAAAA 4320 
TATCTAGCCC AAAGCCTTCA" TTTTAACAGA TGGGGAAAGT GAGCCCCCAA GATGGGAAAG 4380 
AACCACACAG CTAAGGGAGG GCCTGGGGAG CCCCACCCTA GCCCTTGCTG CCACACCACA 4440 
TTGCCTCAAC AACCGGCCCC AGAGTGCCCA GGCACTCCTG AGGTAGCTTC TGGAAATGGG . 4500 
GACAAGTCCC CTCGAAGGAA AGGAAATGAC TAGAGTAGAA TGACAGCTAG CAGATCTCTT. 4560. 
CCCTCCTGCT CCCAGCGCAC ACAAACCCGC CCTCCCCTTG GTGTTGGCGG TCCCTGTGGC 4620 
CTTCACTTTG TTCACTACCT GTCAGCCCAG CCTGGGTGCA CAGTAGCTGC AACTCCCCAT 4680 
. TGGTGCTACC TGGCTCTCCT GTCTCTGCAG CTCTACAGGT GAGGCCCAGC AGAGGGAGTA- 4740. 
GGGCTCGCCA TGTTTCTGGT GAGCCAATTT GGCTGATCTT GGGTGTCTGA ACAGCTATTG 4800 
GGTCCACCCC AGTCCCTTTC AGCTGCTGCT TAATGCCCTG CTCTCTCCCT GGCCCACCTT 4860 
ATAGAGAGCC CAAAGAGCTC CTGTAAGAGG GAGAACTCTA TCTGTGGTTT ATAATCTTGC 4920 
ACGAGGCACC AGAGTCTCCC TGGGTCTTGT GATGAACTAC ATTTATCCCC TTTCCTGCCC 4980 
CAACCACAAA CTCTTTCCTT CAAAGAGGGC CTGCCTGGCT CCCTCCACCC AACTGCACCC 5040 ' 
ATGAGACTCG GTCCAAGAGT CCATTCCCCA GGTGGGAGCC AACTGTCAGG GAGGTCTTTC 5100 
CCACCAAACA TCTTTCAGCT GCTGGGAGGT GACCATAGGG CTCTGCTTTT AAAGATATGG 5160 
CTGCTTCAAA GGCCAGAGTC ACAGGAAGGA CTTCTTCCAG GGAGATTAGT GGTGATGGAG 5220 
AGGAGAGTTA . AAATGACCTC ATGTCCTTCT TGTCCACGGT TTTGTTGAGT TTTCACTCTT S280 
CTAATGCAAG GGTCTCACAC TGTGAACCAC TTAGGATGTG ATCACTTTCA GGTGGCCAGG 5340 
AATGTTGAAT GTCTTTGGCT- CAGTTCATTT AAAAAAGATA TCTATTTGAA AGTTCTCAGA 5400 
GTTGTACATA TGTTTCACAG TACAGGATCT GTACATAAAA GTTTCTTTCC TAAACCATTC 5460 
ACCAAGAGCC : AATATCTAGG CATTTTCTTG GTAGCACAAA' TTTTCTTATT GCTTAGAAAA 5520 
TTGTCCTCCT TGTTATTTCT GTTTGTAAGA CTTAAGTGAG TTAGGTCTTT AAGGAAAGCA 5580 
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ACGCTCCTCT GAAATGCTTG TCTTTTTTCT GTTGCCGAAA TAGCTGGTCC TTTTTCGGGA 5640 

GTTAGATGTA TAGAGTGTTT GTATGTAAAC. ATTTCTTGTA GGCATCACCA YGAACAAAGA 5700 

\ TAT ATTTTCT . ATTTATTTAT TATATGTGCA ' CTTCAAGAAG TCACTGTCAG AGAAATAAAG 5760 

AATTGTCTTA AATGTCAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAA i • 



*Seq id NO: 45V Protein sequence 
Protein Accession 8: XP_051860.2 



.MDGVNLSTEV 



DMRAEVGLLS 
HMGQQLVGQY 
YNSH3HCFFT 
DCNAVSTFWM 
NNRAHSNYRA 
AYKNQDHGAW: 
GTEMKDHRIW. 
LAFRLNNAWQ. 
SEYPGSYI/TK 
HPLYLEGALT 
RGTTFSILSD 
NEREKFAFCS 
TKDHFLBVKM- 
NSILOGIPWQ 
GFKGSFRPIW 



11 " ' 

I : 

VYKXGQDYRF 
GDTLVIASTD 
RNIIVMGEME 

' PIHFHIAGDV 
EDGPEERNTF 

rANPNNNLINC 
GMI IDNGVKT- 

' LRGGDWLDS 
GPGGLDHSGR 

.SCPHNNVTGI' 
NDNWLVRHPD. 
RSTHYQQYQP 
VHNRLLKQTS. 
MKGCERIKIK 
ESSKQHPFHIi. 
LFNYVATIPD 
VTLDTEDHKA 



21 ' 
I 

ACYDRGRACR 
YSMYQAEEFQ 
DKCYPYRNBT- 
DERGGYDPPT. 
DHCLGLLVKS 
AAAGSEETGF. 
TEASAKDKRP- 
CRFADNGIGL 
TLPIGQNFPI: 
AFEDVPITSR 
CINVPDWRGA 
WTLQKGYTI 
KTGVFVRTLQ: 
ALIPKNAGVS* 
WNDFAYIEVDr 
NSIVLMASKG 
KIFQWPIPVf 



31 
I 

SYRVRFLCGK- 
VLPCRSCAPN- 
CNFFDFDTFG 
YIRDLSIHKT. 
GTLLPSORDS- 
WFIFHHVPTG 
FLSIISARYS- 
TLASGGTFPV 
RGIQLYDGPI 
VFFGEPGPWF 
ICSGCYAQMY 
HWDQTAPAEL 
MDKVEQSYPG 
DCTATAYPKF 
GKKYPSSEDG 
RYVSRGPWTR 
VKKKKL 



41 
I 

PVRPKLTVTI 
QVKVAGKPMY, 
GHIKFALGFK 
FSRCVTVHGS: 
KMCKMITGDS 
PSVGMYSPGY. 
PHQDADPLKP 
DDGSKQEIKN- 
NIQNCTFRKFj 
NQIiDMDGDXT, 
IOATfKTSNLR 
AIWLINFNKG 
RSHYYWDEDS 
TERAWDVPM 
rQWVIDGNQ 
VLEKLGADRG 



51 



DTNWSTILN 

LHIGEEIDGV 

AAHLEGTELK 

NGLLIKDWG 

YPGYIPKPRQ 

SEHIPLGKFY- 

REPAIIRHFi: 

SLFVGESGNV; - 

VALEGRHTSA 

SVFHDVDGSV.. 

MKIIKNDFPS-- 

DWIRVGLCYP: 

GLLFLKLKAQ: 

PKKLFGSQLK 

GRWSHTSFR- 

LKLKEQMAFV 
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300 
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Seq ID NO i 452 DNA sequence 

Nucleic Acid Accession fh . Eos sequence 

Coding sequence: 261..2B61 

1 11 21 31 41 51 

I I I I I 1- 

GAGCTAGCGC TCAAGCAGAG CCCAGCGCGG TGCTATCGGA CAGAGCCTGG CGAGCGCAAG 60 

CGGCGCGGGG AGCCAGCGGG, GCTGAGCGCG GCCAGGGTCT GAACCCAGAT TTCCCAGACT 120 

AGCTACCACT CCGCTTGCCC ACGCCCCGGG AGCTCGCGGC GCCTGGCGGT CAGCGACCAG 180 
ACGTCCGGGG, CCGCTGCGCT CCTGGCCCGC GAGGCGTGAC ACTGTCTCGG. CTACAGACCC, 240 

AGAGGGAGCA' CACTGCCAGG ATGGGAGCTG CTGGGAGGCA GGACTTCCTC TTCAAGGCCA 300 

TGCTGACCATa CAGCTGGCTC ACTCTGACCT GCTTCCCTGG GGCCACATCC ACAGTGGCTG. 360 

CTGGGTGCCC- TGACCAGAGC CCTGAGTTGC AACCCTGGAA CCCTGGCCAT GACCAAGACC 420 

ACCATGTGCA- TATCGGCCAG GGCAAGACAC TGCTGCTCAC CTCTTCTGCC ACGGTCTATT 480 

CCATCCACAT ' CTCAGAGGGA GGCAAGCTGG TCATTAAAGA CCACGACGAG CCGATTGTTT 540 

TGCGAACCCG GCACATCCTG ATTGACAACG" GAGGAGAGCT GCATGCTGGG AGTGCCCTCT 600 

GCCCTTTCCA GGGCAATTTC ACCATCATTT TGTATGGAAG GGCTGATGAA GGTATTCAGC 660 

CGGATCCTTA. CTATGGTCTG AAGTACATTG GGGTTGGTAA AGGAGGCGCT CTTGAGTTGC 720 

ATGGACAGAA AAAGCTCTCC TGGACATTTC TGAACAAGAC CCTTCACCCA GGTGGCATGG 780 

CAGAAGGAGG CTATTTTTTT GAAAGGAGCT. GGGGCCACCG TGGAGTTATT GTTCATGTCA 840 

TCGACCCCAA ATCAGGCACA GTCATCCATT CTGACCGGTT: TGACACCTAT AGATCCAAGA 900 

AAGAGAGTGA ACGTCTGGTC CAGTATTTGA ACGCGGTGCC CGATGGCAGG. ATCCTTTCTG 960 

TTX3CAGTGAA TGATGAAGGT TCTCGAAATC TGGATGACAT GGCCAGGAAG GCGATGACCA 1020 

AATTGGGAAG: CAAACACTTC CTGCACCTTG: GATTTAGACA CCCTTGGAGT TTTCTAACTG 1080 

TGAAAGGAAA TCCATCATCT- TCAGTGGAAG ACCATATTGA ATATCATGGA CATCGAGGCT 1140 

CTGCTGCTGC CCGGGTATTC AAATTGTTCC AGACAGAGCA TGGCGAATAT TTCAATGTTT 1200 

CTTTGTCCAG TGAGTGGGTT CAAGACGTGG AGTGGACGGA G7GGTTOGAT CATGATAAAG 1260 

,TATCTCAGAC TAAAGGTGGG. GAGAAAATTT, CAGACCTCTG GAAAGCTCAC CCAGGAAAAA 1320 

TATGCAATCG TCCCATTGAT- ATACAGGCCA CTACAATGGA TGGAGTTAAC CTCAGCACCG 1380 

AGGTTGTCTA CAAAAAAGGC CAGGATTATA- GGTTTGCTTG CTACGACCGG GGCAGAGCCT 1440 

GCCGGAGCTA CCGTGTACGG. TTCCTCTGTG GGAAGCCTGT GAGGCCCAAA- CTCACAGTCA 1500 

CCATTGACAC CAATGTGAAC AGCACCATTC TGAACTTGGA GGATAATGTA CAGTCATGGA 1560 

AACCTGGAGA TACCCTGGTC ATTGCCAGTA- CTGATTACTC CATGTACCAG GCAGAAGAGT 162 0 

TCCAGGTGCT TCCCTGCAGA TCCTGCGCCC CCAACCAGGT CAAAGTGGCA GGGAAACCAA 1680 

TGTACCTGCA CATCGGGGAG GAGATAGACG GCGTGGACAT GOGGGCGGAG GTTGGGCTTC 1740 

TGAGCOGGAA CATCATAGTG ATGGGGGAGA TGGAGGACAA ATGCTACCCC TACAGAAACC 1800 

ACATCTGCAA TTTCTTTGAC TTCGATACCT TTGGGGGCCA CATCAAGTTT GCTCTGGGAT 1660 

TTAAGGCAGC ACACTTGGAG GGCACGGAGC TGAAGCATAT GGGACAGCAG CTGGTGGGTC 1920 

AGTACCCGAT TCACTTCCAC CTGGCCGGTG: ATGTAGACGA AAGGGGAGGT TATGACCCAC 1980 

CCACATACAT CAGGGACCTC TCCATCCATC ATACATTCTC TOGCTGCGTC ACAGTCCATG 204 0 

GCTCCAATGG * CTTGTTGATC ' AAGGACGTTG TGGGCTATAA CTCTTTGGGC CACTGCTTCT 2100 

TCACGGAAGA TGGGCCGGAG GAACGCAACA CTTTTGACCA CTGTCTTGGC CTCCTTGTCA 2160 

AGTCTGGAAC CCTCCTCCCC TCGGACCGTG ACAGCAAGAT GTGCAAGATG ATCACAGAGG 2220 

ACTCCTACCC AGGGTACATC CCCAAGCCCA GGCAAGACTG CAATGCTGTG TCCACCTTCT 2280 

GGATGGCCAA TCCCAACAAC AACCTCATCA ACTGTGCCGC TGCAGGATCT GAGGAAACTG 2340 

GATTTTGGTT TATTTTTCAC CACGTACCAA CGGGCCCCTC CGTGGGAATG TACTCCCCAG 2400 

GTTATTCAGA GCACATTCCA CTGGGAAAAT TCTATAACAA CCGAGCACAT TCCAACTACC 2460 

GGGCTGGCAT GATCATAGAC AACGGAGTCA AAACCACCGA GGCCTCTGCC AAGGACAAGC 2520 

GGCCGTTCCT CTCAATCATC TCTGCCAGAT ACAGCCCTCA CCAGGACGCC GACCCGCTGA 2580 

AGCCCCGGGA. GCCGGCCATC ATCAGACACT TCATTGCCTA CAAGAACCAG GACCACGGGG: 2640 

CCTGGCTGCG CGGCGGGGAT GTGTGGCTGG" ACAGCTGCCA TTTCAGAGGG GAGGCTCAGG 2700 

AAGGCTTCTT [ GCTTACAGGA ATGAAGGCTG GGGGCATTTT GCTGGGGGGA" GATGAGGCAG 2760 

CCTCTGGAAT GGCTCAGGGA TTCAGCCCTC CCTGCCGCTG CCTGCTGAAG CTGGTGACTA 2620 

CGGGGTCGCC CTTTGCTCAC GTCTCTCTGG . CCCACTCATG ATGGAGAAGT GTGGTCAGAG 2880 

GGGAGCAATG . GGCTTTGCTG CTTATGAGCA- CAGAGGAATT CAGTCCCCAG GCAGCCCTGC 2940 

CTCTGACTCC AAGAGGGTGA AGTCCACAGA AGTGAGCTCC TGCCTTAGGG CCTCATTTGC 3000 

TCTTCATCCA GGGAACTGAG CACAGGGGGC CTCCAGGAGA CCCTAGATGT GCTCGTACTC 3060 

CCTCGGCCTG., GGATTTCAGA GCTGGAAATA TAGAAAATAT CTAGCCCAAA GCCTTCATTT 3120 
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TAACAGATGG GGAAAGTGAG CCCCCAAGAT GGGAAAGAAC CACACAGCTA AGGGAGGGCC 
. TGGGGAGCCC CACCCTAGCC CTTGCTGCCA CACCACATTG CCTCAACAAC CGGCCCCAGA 
GTGCCCAGGC ACTCCTGAGG TAGCTTCTGG AAATGGGGAC AAGTCCCCTC GAAGGAAAGG 
AAATGACTAG AGTAGAATGA CAGCTAGCAG ATCTCTTCCC TCCTGCTCCC AGCGCACACA 
AACCCGCCCT CCCCTTGGTG TTGGCGGTCC CTGTGGCCTT CACTTTGTTC ACTACCTGTC 
AGCCCAGCCT . GGGTGCACAG TAGCTGCAAC TCCCCATTGG TGCTACCTGG CTCTCCTGTC 
TCTGCAGCTC TACAGGTGAG GCCCAGCAGA GGGAGTAGGG CTCGCCATGT TTCTGGTGAG 
CCAATTTGGC TGATCTTGGG. TGTCTGAACA GCTATTGGGT CCACCCCAGT CCCTTTCAGC 
TGCTGCTTAA' TGCCCTGCTC TCTCCCTGGC CCACCTTATA GAGAGCCCAA' AGAGCTCCTG 
TAAGAGGGAG AACTCTATCT GTGGTTTATA ATCTTGCACG AGGCACCAGA GTCTCCCTGG- 
GTCTTGTGAT GAACTACATT TATCCCCTTT CCTGCCCCAA CCACAAACTC TTTCCTTCAA 
AGAGGGCCTG CCTGGCTCCC TCCACCCAAC TGCACCCATG AGACTCGGTC. CAAGAGTCCA 
TTCCCCAGGT GGGAGCCAAC. TGTCAGGGAG GTCTTTCCCA: CCAAACATCT TTCAGCTGCT 
GGGAGGTGAC CATAGGGCTC. TGCTTTTAAA GATATGGCTG CTTCAAAGGC CAGAGTCACA- 
GGAAGGACTT CTTCCAGGGA GATTAGTGGT . GATGGAGAGG : AGAGTTAAAA TGACCTCATG- 
TCCTTCTTGT. CCACGGTTTT GTTGAGT TT T CACTCTTCTA ATGCAAGGGT CTCACACTGT 
GAACCACTTA GGATGTGATC ACTTTCAGGT. GGCCAGGAAT? GTTGAATGTC TTTGGCTCAG ,. 
' TTCATTTAAA- AAAGATATCr: ATTTGAAAGT- TCTCAGAGTT GTACATATGT TTCACAGTAC 
AGGATCTGTA' CATAAAAGTT TCTTTCCTAA ACCATTCACC AAGAGCCAAT ATCTAGGCAT 
TTTCTTGGTA GCACAAATTT TCTTATTGCT TAGAAAATTG TCCTCCTTGT TATTTCTGTT 
TGTAAGACTT AAGTGAGTTA GGTCTTTAAG: GAAAGCAACG CTCCTCTGAA' ATGCTTGTCT 
TTTTTCTGTT, GCCGAAATAG CTGGTCCTTT TTCGGGAGTT AGATGTATAG AGTGTTTGTA 
TGTAAACATT ' TCTTGTAGGC ATCACCATGA ACAAAGATAT ATTTTCTATT TATTTATTAT 
ATGTGCACTT CAAGAAGTCA CTGTCAGAGA AATAAAGAAT TGTCTTAAAT GTCATGATTG 
GAGATGTCCT TTGCATTGCT TGGAAGGGGT GTACCTAGAG CCAAGGAAAT TGGCTCTGGT 
TTGGAAAAAT TTTGCTGTTA TTATAGTAAA CATACAAAGG ATGTCAAAAA AAAAAAAAAA 
AAAAAAAAAA AAAAAAAAAA AA - 



Seq ID KO: 4S3 Protein sequence 
Protein Accession #: Eos sequence 



1 
I 

HGAAGRQDFL 
GKTLLLTSSA- 
TIII.YGRADE 
- ERSWGHRGVI. 
SRNLDDMARK 
KLFQTEHGEY 
IQATTMDGVNr 
STILNLEDNV 
EIDGVDMRAE; 
GTELKHMGQO-' 
KDWGYNSLG- 
PKPRQDCNAV- : 
LGKFYNNRAH. 
IRHFIAYKNQ 
FSPPCROxIiK 



11 
I 

FKAMLTISWL 
TVYSIHISEG 
GIQPDPYYGL 
VHVIDPKSGT 
AMTKLGSKHF 
FNVSIjSSEWV 
LSTEWYKKG » 
QSWKPGDTLV 
VGLLSRNIIV. 
LVGQYPIHFH 
HCFFTEDGPE 
STFWMANPNN: 
SNYRAGMIID. 
DHGAWLRGGD 
LVTTGSPFAH 



21 
I 

TliTCFPGATS 
GKLVIKDHDE 
KYIGVGKGGA 
VIHSDRFDTY 
LHLGFRHPWS 
QDVEWTEWFD 
QDYRFACYDR 
IASTDYSMYQ 
MGEMEDKCYP 
LAGDVDERGG 
ERNTFDHCLG: 
NLINCAAAGS. 
NGVKTTEASA 
VWLDSCHFRG 
VSLAHS 



31 



TVAAGCPDQS 
PIVLRTRHlIi 
LELHGQKKLS. 
RSKKESERLV 
FLTVKGNPSS 
HDKVSQTKGG 
GRACRSYRVR 
AEEFQVLPCR 
YRNHICNFFD 
YDPPTYIRDL 
LLVKSGTIjLP 
EETGFWFIFH- 
KDKRPFLSII 
EAQEGFLLTG 



41 
I 

PELQPWNPGH 
IDNGGELHAG 
WTFLNKTLHP 
QYLNAVPDGR- 
SVEDHIEYHG 
EKISDLWKAH 
FliCGKPVRPK 
SCAPNQVKVA 
FDTFGGHIKF 
SIHHTFSRCV 
SDEDSKMCKM 
HVPTGPSVGM 
SARYSPHQDA 
MKAGG IliLGG 



51 
I 

DQDHHVHIGQ 

SALCPFQGNF 

GGMAEGGYFF 

ILSVAVNDEG. 

HRG S AAARVF • - 

PGK1CNRPID 

LTVTIDTNVN:; 

GKPMYLHIGE 

ALGFKAAHLE 

TVHGSNGLLI- 

ITEDSYPGYI 

YSPGYSEHIP 

DPliKPREPAI 

DEAASGMAQG 



Seq IO NO: 454 DNA sequence 

Nucleic Acid Accession #: NM_013282.2 

Coding sequence: 85.. 2466 



CGACTCCTTA 
GTCCCTCCCC 
ACCCACACGG 
CAGGAGCTGT 
GAGGACGGCC 
GTCCGCCAGA 1 
ACCGACTCCG 
GCGGCCGCCG 
GGGCTGTACA 
GAGGCGCAGG 
ACGTCCAGGC 
GAGAACGGCG 
AAGTGGCAGG. 
AAGGAGCGGG 
CGGGAACTCT 
TTCGTGGACG 
. CCCATGAGAC 
TGCCGGGTCT. 
TGCGATGAGT 
CCCAGCGAGG 
GCGGGAGAGC 
TCACAGOGGG 
GTCCCGTCCA 
CGAGTCCAGG 
AGCAACGACG 
GGGAATTTTT 
GCGGAACAGT 

rrrGCTCccA 

GTCAGGGTGG 
AACCGCTACG 
TTTCTCGTGT 
GAGGGGAAGG 



11 
I 

GAGCATGGCA 
TCAGCGCCGA 
TGGACTCGCT 
TCCACGTGGA 
ATACCCTCTT 
GCCTCGTGCT, 
GCTGCTGCCT 
AGACTGACAG. 
AGGTCAATGA 
TGGTCAGGGT 
CGGCGCTGGA 
TGGTCCAGAT 
ACCTGGAGGT 
GCTTCTGGTA 
ACGCCAACGT 
AAGTCTTCAA 
GGAAGAGCGG 
GCGCCTGCCA 
GCGACATGGC 
ACGAGTGGTA 
GGCTGAGAGA' 
ACTGGGGCAA 
ACCACTACGG 
TCAGCGAGTC 
GAGCGTACTC 
TCACATACAC 
CTTGTGATCA 
TCAATGACCA 
TGCGCAATGT 
ATGGCATCTA 
GGCGCTACCT 
ACCGGATCAA 



21 

I 

TGGCTCAGAG 
CACCATGTGG 
GTCCAGGCTG 
GCCAGGCCTG 
CGACTACGAG 
CCCCCACAGC 
GGGCCAGAGT 
CAGGCCAGCC 
GTACGTCGAT 
GACGCGGAAG 
GGAGGACGTC 
GAACTCCAGG 
GGGCCAGGTG 
CGACGCGGAG 
GGTGCTGGGG: 
GATTGAGCGG 
GCCGTCCTGC 
CCTGTGCGGG 
CTTCCACATC 
CTGCCCTGAG 
GAGCAAGAAG 
GGGCATGGCC- 
ACCCATCCCG 
GGGTGTCCAT 
CCTAGTCCTG 
GGGTAGTGGT 
GAAACTCACC 
AGAAGGGGCC 
CAAGGGTGGC 
CAAGGTTGTG 
TCTGCGGAGG 
GAAGCTGGGG 



31 
I 

GTGCTGGTAA 
ATCCAGGTTC 
ACCAAGGTGG 
CAGAGGCTGT 
GTCGGCCTGA 
ACCAAGGAGC 
GAGTCAGACA 
GATGAGGACA 
GCTCGGGACA 
GCCCCCTCCC 
ATTTACCACG. 
GACGTCCGAG 
GTCATGCTCA 
ATCTCCAGGA" 
GATGATTCTC 
CCGGGTGAAG 
AAGCACTGCA 
GGCCGGCAGG 
TACTGCCTGG 
TGCCGGAATG 
AAGGCGAAGA 
TGTGTGGGCC 
GGGATCCCCG. 
CGGCCCCACG 
GCGGGGGGCT 
GGTCGAGATC 
AACACCAACA 
GAGGCCAAGG 
AAGAATAGCA 
AAATACTGGC 
GACGATGATG 
CTGACCATGC 



41 
I 

AACTGATGGG 
GGACCATGGA 
AGGAGCTGAG 
TCTACAGGGG 
ATGACACCAT 
GGGACTCCGA 
AGTCCTCCAC 
TGTGGGATGA 
CGAACATGGG 
GGGACGAGCC 
TGAAATACGA' 
CGCGCGCCCG 
ACTACAACCC 
AGCGCGAGAC 
TGAACGACTG 
GGAGCCCCAT 
AGGACGACGX 
ACCCCGACAA 
ACCCGCCCCT 
ATGCCAGCGA 
TGGCCTCGGC 
GCACCAAGGA 
TGGGCACCAT 
TGGCTGGCAT 
ATGAGGATGA 
TTTCCGGCAA 
GGGCGCTGGC 
ACTGGCGGTC 
AGTACGCCCC 
CCGAGAAGGG 
AGCCTGGCCC 
AGTATCCAGA 



51 
I 

GGTTTTTGCT 
OGGGAGGCAG 
GCGGAAGATC 
CAAACAGATG 
CCAGCTCCTG 
GCTCTCCGAC 
CCACGGCGAG 
GACGGAATTG 
GGCGTGGTTT 
CTGCAGCTCC 
CGACTACCCG 
CACCATCATC 
CGACAACCCC 
CAGGACGGCG- 
TCGGATCATC 
GGTTGACAAC 
GAACAGACTC 
GCAGCTCATG 
CAGCAGTGTT 
GGTGGTACTG 
CACATCGTCC 
ATGTACCATC 
GTGGCGGTTC 
ACACGGCCGG 
CGTGGACCAT 
CAAGAGGACC 
TCTCAACTGC 
GGGGAAGCCG 
CGCTGAGGGC 
GAAGTCCGGG 
TTGGACGAAG 
AGGCTACCTG 
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3360 
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3600 
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3900 
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4020 

4080 

4140 

4200 
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60 
120 
180 
240 
300 
360 
420 
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540 
600 
660 
720 
780 
840 
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720 
780 
840 
900 
960 
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1500 • 
1560 
1620 
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1740 
1800 
1860 
1920 
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GAAGCCCTGG CCAACCGAGA GCGAGAGAAG GAGAACAGCA AGAGGGAGGA GGAGGAGCAG 1980 

CAGGAGGGGG GCTTOGCGTC CCCCAGGAOG GGCAAGGGCA AGTGGAAGCG GAAGTCGGCA 2040 
GGAGGTGGCC CGAGCAGGGC CGGGTCCCCG CGCOGGACAT CCAAGAAAAC CAAGGTGGAG ■ 2100 

CCCTACAGTC TCACGGCCCA GCAGAGCAGC CTCATCAGAG AGGACAAGAG CAACGCCAAG 2160 

CTGTGGAATG AGGTCCTGGC GTCACTCAAG GACCGGCCGG CGAGCGGCAG CCCGTTCCAG. 2220 

TTGTTCCTGA GTAAAGTGGA GGAGACGTTC CAGTGTATCT GCTGTCAGGA GCTGGTGTTC 2280 

CGGCCCATCA OGACCGTGTG CCAGCACAAC GTGTGCAAGG ACTGCCTGGA CAGATCCTTT 2340 
CGGGCACAGG TGTTCAGCTG CCCTGCCTGC CGCTACGACC TGGGCCGCAG CTATGCCATG . . 2400 

CAGGTGAACC AGCCTCTGCA GACCGTCCTC AACCAGCTCT TCCCCGGCTA CGGCAATGGC 2460 

CGGTGATCTC CAAGCACTTC TCGACAGGCG TTTTGCTGAA AAOGTGTCGG AGGGCTCGTT 2520 

CATCGGCACX GATTTTGTTC TTAGTGGGCT TAACTTAAAC AGGTAGTGTT TCCTCCGTTC 2580 

CCTAAAAAGG TTTGTCTTCC TTTTTTTTTA TTTTTATTTT TCAAATCTAT ACATTTTCAG 2640 

GAATTTATGT ATTCTGGCTA AAAGTTGGAC TTCTCAGTAT TGTGTTTAGT TCTTTGAAAA 2700 

CATAAAAG CC TG CAATTT CT. CGACAAAACA ACACAAGATT TTTTAAAGAT GGAATCAGAA 2760 

ACTACGTGGT- GTGGAGGCTG TTGATGTTTC TGGTGTCAAG TTCTCAGAAG TTGCTGCCAC 2820 

CAACTCTTTA' AGAAGGCGAC AGGATCAGTC CTTCTCTAGG GTTCTGGCCC CCAAGGTCAG 2680 

AGCAAGCATC TTCCTGACAG CATTTTGTCA' TCTAAAGTCC AGTGACATGG TTCCCCGTGG 2940, 

TGGCCCGTGG CAGCCCGTGG.' CATGGCGTGG CTCAGCTGTC TGTTGAAGTT GTTGCAAGGA 3000 

AAAGAGGAAA- CATCTCGGGC CTAGTTCAAA CCTTTGCCTC AAAGCCATCC CCCACCAGAC 3060 

TGCTTAGCGT. CTGAGATCCG CGTGAAAAGT CCTCTGCCCA CGAGAGCAGG GAGTTGGGGC 3120 

CACGCAGAAA ; TGGCCTCAAG- GGGACTCTGC TCCACGTGGG GCCAGGCGTG TGACTGACGC ' 3180 

TGTCCGACGA- AGGCGGCCAC GGACGGACGC CAGCACACGA AGTCACGTGC AAGTGCCTTT 3240 

GATTCGTTCC TTCTTTCTAA AGACGACAGT. CTTTGTTGTT AGCACTGAAT TATTGAAAAT 3300 

GTCAACCAGA TTCTAGAAAC. TGCGGTCATC CAGTTCTTCC. TGACACCGGA TGGGTGCTTG 3360 

GGAACCGTTT GAGCCTTATA GATCATTT AC - ATTCAATTTT TTTAACTCAG CAAGTGAGAA 3420 

' CTTACAAG AG GGTTTTTTTT, TAATTTTTTT TTCTCTTAAT GAACACATTT TCTAAATGAA 3480. 

TTTTTTTTGT AGTTACTGTA TATGTACCAA" GAAAGATATA ACGTTAGGGT TTGGTTGTTT 3540 

TTGTTTTTGT ATTTTTTTTC TTTTGAAAGG GTTTGTTAAT TTTTCTAATT TTACCAAAGT 3600 

TTGCAGCCTA TACCTCAATA AAACAGGGAT ATTTTAAATC ACATACCTGC AGACAAACTG 3660 

GAGCAATGTT. ATTTTTAAAG GGTTTTTTTC ACCTCCTTAT TCTTAGATTA- TTAATGTATT 3720 

AGGGAAGAAT. GAGACAATTT TGTGTAGGCT TTTTCTAAAG TCCAGTACTT TGTCCAGATT 3780 
TTAGATTCTC AGAATAAATG TTTTTCACAG ATTGAAAAAA AAAAAAAA 



Seq ID NO: 455 Protein sequence 
Protein Accession fct NP_037414.2 

1 11 21 31 41 51 

I I I I I I 

MWIQVRTMDG RQTHTVDSLS RLTKVEELRR KIQELFHVEP. GLQRLFYRGK QMEDGHTLFD 60 
YEVRLNDTIQ. LLVRQSLVLP HSTKERDSEL SDTDSGCCLG QSBSDKSSTH GEAAAETDSR" 120 
PADEDMWDET- ELGLYKVNEY- VDARDTNMGA WFEAQWRVT RKAPSRDEPC SSTSRPALEE 180 
DVIYHVKYDD, YPENGWQMM SRDVRARART: IIKWQDLEVG QWMLNYNPD : NPKERGFWYD 240 
AEI SRKRETR ' TARELYANW LGDDSI^DCR- IIFVDEVPKI ; ERPGEGSPMV DNPMRRKSGP' 300 
SCKHCKDDVN' RLCRVCACHL CGGRQDPDKQ W4CDECDMAF HIYCLDPPLS SVPSEDEWYC 360 
PECRNDASEV. VLAGERLRES- KKKAKMASAT SSSQRDWGKG MACVGRTKEC TIVPSNHYGP .420 
IPGIPVGTMW RFRVQVSESG VHRPHVAGIH- GRSNDGAYSL VLAGGYEDDV DKGNFPTYTG 4 BO 
SGGRDLSGNK RTAEQSCDQK LTNTNRALAL NCFAPINDQE^ GAEAKDWRSG KPVRWRNVK 540 . 
GGKNSKYAPA EGNRYDGIYK WKYWPEKGK SGFLVWRYIjL RRDDDEPGPW TKEGKDRIKK 600 
LGLTMQYPEG" YLEALANRER- EKENSKREEE EQQEGGFASP RTGKGKWKRK SAGGGPSRAG 660 
SPRRTSKKTK VEPYSLTAQQ. SSLIREDKSN AKLWNEVLAS LKDRPASGSP FQLFLSKVEE 720 
TFQCICCQEL VFRPITTVCQ HNVCKDCLDR SFRAQVFSCP ACRYDIjGRSY AMQVNQPLQT 

Seq ID NO: 456 DNA sequence 

Nucleic Acid Accession #: NM_001200.1 

Coding sequence: 325.. 1514 

1 11 21' 31 41 51 

I I I I ! I 

GGGGACTTCT TGAACTTGCA GGGAGAATAA CTTGCGCACC CCACTTTGCG CCGGTGCCTT 60 
TGCCCCAGCG GAGCCTGCTT CGCCATCTCC GAGCCCCACC GCCCCTCCAC TCCTCGGCCT 120 
TGCCCGACAC TGAGACGCTG TTCCCAGCGT GAAAAGAGAG ACTGCGCGGC CGGCACCCGG 180 
GAGAAGGAGG AGGCAAAGAA AAGGAACGGA CATTCGGTCC TTGCGCCAGG TCCTTTGACC 240 
AGAGTTTTTC CATGTGGACG CTCTTTCAAT GGACGTGTCC CCGCGTGCTT CTTAGACGGA 300 
CTGCGGTCTC CTAAAGGTCG ACCATGGTGG CCGGGACCCG- CTGTCTTCTA GCGTTGCTGC 360 
TTCCCCAGGT CCTCCTGGGC GGCGCGGCTG GCCTCGTTCC GGAGCTGGGC CGCAGGAAGT 420 
TCGCGGCGGC GTCGTCGGGC CGCCCCTCAT CCCAGCCCTC TGACGAGGTC CTGAGCGAGT 480 
TCGAGTTGCG GCTGCTCAGC ATGTTCGGCC TGAAACAGAG ACCCACCCCC AGCAGGGACG 540 
CCGTGGTGCC CCCCTACATG CTAGACCTGT ATCGCAGGCA CTCAGGTCAG CCGGGCTCAC 600 
CCGCCCCAGA CCACCGGTTG GAGAGGGCAG CCAGCCGAGC CAACACTGTG CGCAGCTTCC 660 
ACCATGAAGA ATCTTTGGAA GAACTACCAG AAACGAGTGG GAAAACAACC CGGAGATTCT- 720 
TCTTTAATTT AAGTTCTATC CCCACGGAGG AGTTTATCAC CTCAGCAGAG CTTCAGGTTT 780 
TCCGAGAACA GATGCAAGAT GCTTTAGGAA ACAATAGCAG TTTCCATCAC CGA ATTAA TA 840 
TTTATGAAAT - CATAAAACCT GCAACAGCCA ACTCGAAATT" CCCCGTGACC AGACTTTTGG 900 
ACACCAGGTT GGTGAATCAG AATGCAAGCA GGTGGGAAAG TTTTGATGTC ACCCCCGCTG 960 

TGATGCGGTG GACTGCACAG GGACAOGCCA ACCATGGATT CGTGGTGGAA GTGGCCCACT 1020 

TGGAGGAGAA ACAAGGTGTC TCCAAGAGAC ATGTTAGGAT AAGCAGGTCT TTGCACCAAG 1080 

ATGAACACAG CTGGTCACAG ATAAGGCCAT TGCTAGTAAC TTTTGGCCAT GATGGAAAAG 1140 

GGCATCCTCT CCACAAAAGA GAAAAACGTC AAGCCAAACA CAAACAGCGG AAACGCCTTA 1200. 

AGTCCAGCTG TAAGAGACAC CCTTTGTACG TGGACTTCAG TGACGTGGGG TGGAATGACT 1260 

GGATTGTGGC TCCCCCGGGG TATCACGCCT. TTTACTGCCA CGGAGAATGC CCTTTTCCTC 1320 

TGGCTGATCA TCTGAACTCC ACTAATCATG CCATTGTTCA GACGTTGGTC AACTCTGTTA 1380 

ACTCTAAGAT TCCTAAGGCA TGCTGTGTCC CGACAGAACT CAGTGCTATC TCGATGCTGT 1440 

ACCTTGACGA GAATGAAAAG GTTGTATTAA AGAACTATCA GGACATGGTT GTGGAGGGTT 1500 
GTGGGTGTCG CTAGTACAGC AAAATTAAAT ACATAAATAT ATATATA 

Seq ID NO: -457 Protein sequence 
Protein Accession #: "NP_001191.1 
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1 .11 21 31 41 51 

I- .1 V ! II I 

MVAGTRCLLA UiPQVLLGG AAGLVPELGR RKFAAASSGR PSSQPSDEVL SEFELRLLSM 60 
5 PGLKQRPTPS RDAWPPYMI* DLYRRHSGQP GSPAPDHRLB RAASRAHTVR SFHHEESLEE 120 
LPETSGKTTR RFFFNLSSIP TSEPITSAEI* QVFREQKQDA LGNNSSFHHR INIYEIIKPA 180 
: TANSKFPVTR . LLDT 

A Seq ID NO :. .458 DNA sequence 
10 Nucleic Acid. Accession ft: NMJJ01999.2 
Coding sequence: 1..8736 

1 • 11 " 21 31 41 51 

1C I . -I I I 1 - I 

. I J ATGGGGAGAA GACGGAGGCT GTGTCTCCAG CTCTACTTCC TGTGGCTGGG CTGTGTGGTG 60 
CTCTGGGCGC AGGGCACGGC CGGCCAGCCT CAGCCTCCTC CGCCCAAGCC GCCCCGGCCC 120 
■* CAGCCGCCGC: CGCAACAGGT TCGGTCGGCT ACAGCAGGCT. CTGAAGGCGG GTTTCTAGCG 180 
CCCGAGTATC GCGAGGAGGG TGCCGCAGTG GCCAGCCGCG TCCGCCGGCG AGGACAGCAG 240 
GACGTGCTCC.' GAGGGCCCAA- . CGTGTGCGGC TCCAGATTCC ACTCCTACTG CTGCCCTGGA 300 

20 '. TGGAAGACGC ' TCCCTGGAGG' AAACCAGTGC ATTGTCCCGA TTTGTAGAAA TAGTTGTGGA 360 
" GATGGATTTT; GTTCCCGTCC TAACATGTGT ACTrGTTCCA GTGGGCAAAT ATGATCAACC 420 
TGTGGATCAA AATCAATTCA' GCAGTGCAGT GTGAGATGCA: TGAATGGTGG GACCTGTGCA 480 
GATGACCACT GCCAGTGCCA GAAAGGATAT ATTGGAACTT ATTGTGGACA ACCTGTCTGT. 540 
GAAAATGGAT GTCAGAATGG TGGACGTTGC ATCGCCCAAC CGTGTGCTTG . TGTTTATGGG 600 

.25 .TTCACTGGTC . CACAGTGTGA AAGAGATTAC AGGACAGGCC CGTGTTTCAC TCAGGTCAAC . S60 
AACCAGATGT GCCAAGGGCA - GCTGACAGGC ATTGTCTGCA : CGAAGACTCT. GTGCTGTGCC 720 
ACCACTGGAC GGGCGTGGGG CCATCCCTGT GAGATGTGTC CAGCCCAGCC TCAGCCCTGC 780 
CGACGGGGTT. TCATCCCCAA CATCCGCACT GGAGCTTGCC AAGATGTTGA TGAATGCCAG 840- 
. GCTATCCCAG GGATATGCCA AGGAGGAAAC TGTATCAATA CAGTGGGCTC TTTTGAATGC 900 

30 AGATGCCCTG . CTGGTCACAA- ACAGAGTGAA ACTACTCAGA- AATGTGAAGA CATTGATGAG 960 
TGCAGCATCA'TTCCTGGGAT ATGTGAAACT GGTGAATGTT CCAACACCGT GGGAAGCTAT 1020 
TTTTGTGTTT. GTCCACGTGG ATATGTAACC TCAACAGATG GCTCTCGATG CATCGATCAG 1080 
AGAACAGGCA. TGTGTTTCTC GGGCCTGGTG AATGGCCGCT GTGCACAAGA GCTCCCGGGG 1140 
AGAATGACGA AAATGCAGTG CTGCTGTGAG CCTGGCCGCT GCTGGGGCAT CGGAACCATT 1200 

35 CCTGAAGCCT GTCCTGTCAG AGGTTCTGAG GAATATCGCA GACTTTGCAT GGATGGACTT 1260 
' CCAATGGGAG GAATTCCAGG GAGTGCTGGT, TCCAGACCTG, GAGGCACTGG GGGAAATGGC 1320 
TTTGCCCCAA- GTGGCAATGG- CAATGGCTAT GGCCCAGGAG GGACAGGCTT CATCCCCATC 1380 
CCTGGAGGCA ATGGCTTTTC TCCTGGCGTT GGGGGAGCCG r GTGTGGGGGC CGGGGGACAG 1440 
- GGACCTATCA TCACTGCACT- AACAATTCTG AACCAGACAA TAGA TATCTG" TAAGCATCAT 1500 

40 GCTAACCTTT GTTTAAATGG ACGCTGTATA CCAACTGTCT CAAGCTSCCG ATGTGAATGC 1560 
AACATGGGTT ATAAGCAGGA TGCAAATGGA GATTGTATAG ATGTTGATGA ATGCACATCA- 1620 
AATCCCTGCA CTAATGGAGA TTGTGTTAAC ACACCTGGTT CCTATTATTG TAAATGTCAT 1680 
GCTGGATTCC AGAGGACTCC TACCAAG CAA GCATGCATTG' ATATTGATGA' GTGCATCCAG. 1740 
AATGGGGTTC TTTGTAAAAA CGGTCGATGC GTGAACTCAG ATGGAAGTTT CCAGTGCATT 1800 

45 TGCAATGCCG GCTTTGAATT : AACTACAGAT GGAAAAAACT GTGTTGATCA TGATGAATGT 1860 
ACAACTACCA ACATGTGTTT GAATGGAATG TGCATCAATG AAGATGGCAG CTTCAAGTGC 1920 
~" ATCTGCAAAC CAGGATTTGT . CTTGGCTCCA AATGGGCGTT ACTGTACTGA TGTTGATGAA 1980 
TGCCAGACCC CAGGAATCTG CATGAATGGG CACTGCATCA ACAGTGAAGG GTCCTTCCGC 2040 
TGTGACTGTC CCCCAGGCCT GGCTGTGGGC ATGGATGGAC GTGTGTGTGT TGATACTCAC 2100 

50 ATGCGCAGTA CCTGCTATGG AGGAATCAAG AAAGGAGTGT GTGTGCGTCC TTTCCCCGGT 2160 
GCAGTGACCA AGTCCGAATG CTGCTGTGCC AATCCAGACT ATGGTTTTGG AGAACCCTGC 2220 
CAGCCATGCC CTGCAAAAAA TTCAGCTGAA TTCCACGGCC TTTGTAGTAG TGGAGTAGGT 2280 
ATCACTGTGG ATGGAAGAGA TATCAATGAA TGTGCTTTGG ATCCTGATAT ATGTGCCAAT 2340 
GGGATTTGTG AAAACTTACG TGGTAGTTAC CGTTGTAATT GCAACAGTGG CTATGAACCA 2400 

55 GATGCCTCTG GAAGAAACTG TATTGACATT GATGAATGTT TAGTAAACAG ACTGCTTTGT 2460 
GATAACGGAT TGTGCCGAAA CACGCCAGGA AGTTACAGCT GTACGTGCCC ACCAGGGTAT 2520 
GTGTTCAGGA CTGAGACAGA GACCTGTGAA GATATAAATG AATGTGAAAG CAACCCATGT 2580 
GrCAATGGGG CCTGCAGAAA CAACCTTGGA" TCTTTCAATT GTGAATGTTC GCCCGGCAGC 264 0 
AAACTCAGCT CCACAGGATT GATCTGTATT GACAGCCTGA AGGGGACCTG TTGGCTCAAC 2700 

60 ATCCAGGACA GCCGCTGTGA GGTGAAT ATT AATGGAGCCA CTCTGAAATC TGAATGCTGT 2760 
GCCACCCTCG. GAGCCGCCTG GGGGAGCCCC TGTGAGCGGT GTGAACTAGA TACAGCTTGC 2820 
CCAAGAGGGC TTGCCAGGAT' TAAAGGTGTT ACGTGTGAAG ATGTTAATGA GTGTGAGGTG 2880 
TTCCCTGGCG TTTGTCCAAA TGGACGCTGT. GTCAACAGTA AGGGATCTTT TCATTGCGAG 2940 
TGCCCTGAAG GCCTTACGTT GGATGGGACT GGCCGTGTAT GTTTGGATAT TCGCATGGAG 3000 

65 CAGTGTTACT TGAAGTGGGA TGAAGATGAA' TGCATCCACC CCGTTCCTGG AAAGTTCCGC 3060 
ATGGATGCCT GCTGCTGTGC TGTCGGGGCG GCTTGGGGCA CCGAGTGTGA GGAGTGCCCC 3120 
AAACCTGGCA CCAAGGAATA CGAGACACTG- TGCCCCCGCG GGGCTGGCTT TGCTAACCGA 3180 
GGGGATGTTC TTACTGGGCG GCCATTTTAC AAAGACATCA ATGAATGCAA AGCATTTCCT 3240 
GGGATGTGCA CTTATGGGAA GTGCAGAAAT ACAATCGGAA GCTTCAAATG CCGTTGCAAT 3300 

70 AGTGGCTTTG CTCTAGACAT GGAGGAAAGA AACTGCACGG ACATCGACGA GTGCAGGATT 3360 
TCTCCTGACC TCTGTGGCAG TGGAATCTGC GTCAATACAC CGGGCAGCTT TGAGTGCGAG 3420 
TGCTTCGAAG .GCTATGAAAG TGGCTTCATG ATGATGAAGA ACTGCATGGA CATTGACGGA 3480 
■ TGTGAACGTA ACCCTCTCCT ' TTGTAGGGGT GGCACCTGTG TGAACACTGA GGGCAGCTTT 3540 
CAGTGTGACT GCCCACTGGG ACACGAGCTG TCACCATCCC GTGAGGACTG TGTGGATATT 3600 

75 ' AATGAATGCT CCCTGAGTGA CAATCTCTGC AGAAATGGAA AATGTGTGAA CATGATTGGA 3660 
ACCTATCAGT GCTCTTGCAA TCCTGGATAT CAGGCTACGC CAGACCGCCA GGGCTGTACA 3720 
•. GATATTGATG AATGTATGAT AATGAACGGA GGCTGTGACA CCCAGTGCAC AAATTCAGAG 3780 
GGAAGCTACG AATGCAGCTG CAGTGAGGGT TATGCCCTGA TGCCAGATGG GAGATCGTGT 3840 
GCAGACATTG ATGAATGTGA- AAACAATCCT GATATCTGTG: ATGGCGGCCA GTGTACCAAC 3900 

80 • ATTCCTGGAG AGTATCGCTG CCTCTGCTAT GATGGCTTCA TGGCTTCCAT GGACATGAAA 3960 
ACATGCATTG ATGTCAATGA ATGTGACCTA AATTCAAATA TCTGCATGTT TGGGGAATGT 4020 
GAGAACACAA AGGGATCCTT CATTTGCCAC TGTCAGCTGG GTTACTCAGT GAAGAAGGGG 4080 
ACCACAGGAT GTACAGATGT GGATGAGTGT GAAATTGGTG CTCATAACTG CGACATGCAT 4140 
GCCTCATGTC TGAATATCCC AGGAAGCTTC AAGTGTAGCT GCAGAGAAGG CTGGATTGGA 4200 

85 . AACGGCATCA AGTGTATTGA TCTGGACGAA TGTTCTAATG GAACCCACCA GTGTAGCATC 4260 
AATGCTCAGT ' GTGTAAATAC CCCGGGCTCA TACCGCTGTG CCTGCTCCGA AGGTTTCACT. 4320 
GGTGATGGCT TTACCTGCTC AGATGTTGAT GAGTGTGCAG AAAACATAAA CCTCTGTGAG 4380 
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AACGGACAGT GCCTTAATGT CCCGGGTGCA TATCGCTGCG AGTGTGAGAT GGGCTTCACT 4440 

CCAGCCTCAG ACAGCAGATC CTGCCAAGAT ATTGATGAAT GCTCCTTCCA AAACATTTGT 4500 

GTCTCTGGAA CATGTAATAA CCTGCCTGGA ATGTTTCATT GCATCTGCGA TGATGGTTAT 4560 

.GAATTGGACA GAACAGGAGG GAACTGTACA GATATTGATG AGTGTGCAGA TCCTATAAAC 4620 

TGTGTCAATG GCCTATGTGT CAACACGCCT GGTCGCTATG AGTGTAACTG CCCACCCGAT 4680 

TTTCAGTTGA ACCCAACTGG ' TGTGGGTTGT GTTGACAACC GTGTGGGCAA CTGCTACCTG * 4740 

AAGTTTGGAC CTCGAGGAGA TGGGAGTCTG TCTTGCAACA CCGAGATCGG GGTGGGCGTC 4800 

AGTCGCTCTT CATGCTGCTG ' CTCTCTGGGA" AAGGCCTGGG GAAACCCCTG TGAGACATGC 4860 

CCCCCTGTCA ATAGCACTGA ATATTACACC CTGTGTCCCG GAGGTGAAGG CTTCAGACCT 4920 

AACCCCATCA CAATCATTTT AGAAGACATT GAGGAATGCC AGGAGTTACC AGGTCTCTGC 4 9 BO 

CAGGGTGGAA ACTGCATCAA CACTTTTGGG AGCTTCCAGT GTGAGTGCCC ACAAGGCTAC 5040 

TACCTCAGCG AGGATACCCG CATCTGTGAG GATATTGATG AGTGTTTTGC ACATCCTGGT 5100 

GTGTGTGGGC CTGGGACCTG CTATAACACC CTGGGAAATT ACACCTGCAT TTGCCCACCT 5160 

GAGTACATGC AGGTCAATGG AGGCCACAAC TGCATGGACA' TGAGAAAAAG" CTTTTGCTAC 5220 

CGAAGCTATA ATGGAACCAC TTGTGAGAAT GAGTTGCCTT TCAATGTGAC AAAAAGGATG 5280 

TGCTGCTGCA CATATAATGT GGGCAAAGCT GGGAACAAAC CTTGTGAACC ATGCCCAACT - 5340 

CCAGGAACAG CTGACTTTAA AACCATATGT GGAAATATTC CTGGATTCAC. CTTTGACATT < 5400 

CACACAGGAA' AAGCTGTTGA- CATTGATGAA TGTAAAGAGA- TTCCAGGCAT- TTGTGCAAAT 54 60 

GGTGTGTGCA TTAACCAGAT TGGCAGTTTC CGCTGTGAAT GCCCTACAGG ATTCAGTTAC 5520 

AATGACCTGC TGTTGGTTTG TGAAGATATA ' GATGAGTGCA GCAATGGTGA TAATCTCTGC 5580 

CAGCGGAATG CAGACTGCAT CAATAGTCCT GGTAGTTACC GCTGTGAATG- TGCCGCGGGT 5640 

TTCAAACTTT CACCCAATGG" GGCCTGTGTA GATCGCAATG AATGTTTAGA' AATTCCTAAC 5700 

GTTTGCAGTC ATGGCTTGTG TGTTGATCTG CAAGGAAGTT ACCAGTGCAT CTGCCACAAT 5760 

GGCTTTAAGG CTTCTCAGGA CCAGACCATG TGCATGGATG TTGATGAGTG CGAGCGGCAC 5820 

CCATGTGGAA ATGGAACTTG TAAAAACACC. GTTGGATCCT ATAACTGTCT GTGCTACCCA 5680 

GGGTTTGAAC TCACTCATAA TAATGATTGC CTGGACATAG ATGAGTGCAG TTCCTTTTTT ■ 5940 

GGTCAGGTGT GCAGAAATGG ACGTTGTTTT AATGAAATTG GTTCTTTCAA GTGTCTATGT 6000 

AACGAAGGTT ATGAACTTAC. CCCAGATGGC AAAAACTGTA TAGACACTAA TGAGTGTGTC. 6060" 

GCCCTTCCCG GCTCTTGCTC TCCTGGTACC TGTCAGAATT-: TGGAGGGATC CTTCAGATGC 6120 

ATCTGTCCCC CAGGGTATGA AGTAAAAAGC. GAGAACTGCA TTGATATAAA TGAATGTGAT 6180 

GAAGATCCCA ACATTTGTCT TTTTGGTTCC TGTACTAATA" CTCCAGGGGG CTTCCAGTGC 6240 

CTCTGCCCCC CTGGCTTTGT. ACTATCTGAT AATGGACGGA GATGCTTTGA TACTCGCCAG. 6300 

AGCTTCTGCT TCACAAATTT TGAAAATGGA AAGTGTTCTG TACCCAAAGC TTTCAACACC . 6360 

ACAAAAGCAA AATGCTGCTG; TAGTAAGATG. CCAGGAGAGG GCTGGGGGGA CCCCTGTGAG 6420 

CTGTGGCCCA AAGACGATGA AGTTGCATTT CAGGATTTGT GTCCATATGG CCATGGAACT 6480 

GTCCCTAGTC TTCATGATAC ACGTGAAGAT. GTCAATGAGT GTCTTGAGAG. CCCAGGCATT ■ 6540 

TGTTCAAATG GTCAATGTAT CAACACCGAC GGATCTTTTC GCTGTGAATG TCCAATGGGC 6600 

TACAACCTTG ACTACACTGG - AGTACGCTGT GTGGATACTG ATGAGTGTTC AATCGGCAAT 6660 

CCGTGTGGAA ATGGTACATG CACCAATGTT ATTGGGAGTT TTGAATGCAA TTGCAATGAA 6720 

GGCTTTGAGC CAGGGCCCAT GATGAATTGT GAAGATATCA ACGAATGTGC ' CCAGAACCCA 6780 

CTGCTGTGTG CTTTACGCTG CATGAACACT. TTTGGGTCCT ATGAATGCAC GTGCCCGATT 6840 

GGCTATGCCC TCAGGGAAGA- TCAAAAGATG 1 TGCAAAGATC TGGATGAATG TGCTGAAGGG 6900 

TTACACGACT GTGAATCTAG GGGCATGATG: TGTAAGAATC TAATCGGCAC CTTCATGTGC 6960 

ATCTGCCCTC CTGGAATGGC CCGAAGGCCC GATGGAGAAG GCTGTGTAGA TGAAAATGAA 7020 

TGCAGGACCA AGCCAGGAAT CTGTGAAAAT GGACGTTGTG TTAACATTAT TGGAAGCTAT 7080 

AGATGTGAGT GTAATGAAGG. ATTCCAGTCA AGTTCTTCAG GCACTGAATG CCTTGACAAT 7140 

CGACAGGGTC . TCTGCTTTGC AGAGGTACTG. CAGACAATAT GTCAAATGGC ATCCAGTAGT 7200 

CGCAATCTCG: TCACTAAGTC AGAATGCTGC TGTGATGGTG GGCGAGGCTG GGGCCACCAG 7260 

TGCGAGCTTT GCCCACTTCC TGGAACTGCC CAGTACAAAA AGATATGTCC TCATGGCCCA 7320 

GGATATACAA CTGATGGAAG - AGATATTGAT GAATGTAAGG TAATGCCAAA CCTCTGCACC 7380 

AATGGTCAGT- GCATCAATAC" CATGGGCTCA' TTCCGATGCT- TCTGCAAGGT TGGCTACACC 7440 

ACAGACATCA GTGGAACCTC 7 TTGTATAGAC CTTGATGAAT GCTCCCAGTC CCCGAAACCA 7500 

TGCAACTACA TCTGCAAGAA CACTGAGGGG AGTTATCAGT GTTCATGTCC GAGGGGGTAT 7560 

GTCCTGCAAG AGGATGGAAA GACATGCAAA GACCTTGATG AATGTCAAAC AAAGCAGCAT 7620 

AACTGCCAGT TCCTCTGTGT CAACACCCTG GGGGGGTTTA CCTGTAAATG TCCACCTGGT 7680 

TTCACACAGC ATCACACTGC TTGTATCGAC. AACAACGAAT GTGGGTCTCA ACCTTTGCTT 7740 

TGTGGAGGAA AGGGAATCTG TCAAAACACT CCAGGCAGTT TCAGCTGTGA ATGCCAAAGA 7800 

GGGTTCTCTC TTGATGCCAC CGGACTGAAC TGTGAAGATG TTGATGAATG TGATGGGAAC 7860 

CACAGGTGCC AACACGGCTG CCAGAACATC CTGGGTGGCT ACAGATGTGG CTGCCCCCAA 7920 

GGCTACATCC AGCACTACCA GTGGAATCAG TGTGTCGATG AGAATGAATG CTCCAATCCC 7 9 BO 

. AATGCCTGTG GCTCTGCTTC CTGCTACAAC ACCCTGGGGA GTTACAAGTG CGCCTGCCCC 8040 

TCGGGGTTCT CCTTCGACCA GTTCTCCAGT GCCTGCCACG ACGTGAATGA GTGCTCGTCC 8100 

TCCAAGAACC CCTGCAATTA CGGCTGCTCT AACACGGAGG GGGGCTACCT CTGTGGCTGC 8160 

CCCCCTGGGT ATTACAGAGT GGGACAAGGC CACTGTGTCT CAGGAATGGG ATTTAACAAG 8220 

GGGCAGTACC TGTCACTGGA TACAGAGGTC GATGAGGAAA ATGCTCTGTC CCCAGAAGCA 82 BO 

TGCTACGAGT . GCAAAATCAA CGGCTATCCT AAGAAAGACA GCAGGCAGAA GAGAAGTATT 8340 

CATGAACCTG ATCCCACTGC TGTTGAACAG ATCAGCCTAG AGAGTGTCGA CATGGACAGC 8400 

CCCGTCAACA TGAAGTTCAA CCTCTCCCAC CTCGGCTCTA AGGAGCACAT CCTGGAACTA 8460 

AGGCCCGCCA TCCAGCCCCT CAACAACCAC ATCCGTTATG TCATCTCTCA AGGGAACGAT 8520 

GACAGCGTCT TCCGCATCCA CCAAAGGAAT GGGCTCAGCT ACTTGCACAC GGCCAAGAAG 8580 

AAGCTCATGC CCGGCACATA CACACTGGAA ATCACTAGCA TCCCTCTCTA CAAGAAGAAG 8640 

GAGCTTAAGA AACTGGAAGA GAGCAATGAG GATGACTACC TCCTAGGGGA GCTTGGGGAG 8700 

GCTCTCAGAA TGAGGCTGCA* GATTCAGCTC TATTAACCGT TCACAGACTT GGGCCCAGGC 8760 

TCAAATCCTA GCACAGCCAG TCTGCAGAAG CATTTGAAAA GTCAAGGACT AATTTTAAAG 8820 

AGGAAAAATA ATAATAACTC TTGTTTCTTT CCTCCCTGTC TTAGACTTTG AATGTTGACC 8880 

CTCACAGGGA GGGATAATTT AGACTCTGGT ATGGCCAAAG ATTTGAGCTC AAAGGCAACC 8940 

GTGGTTACTG TATTTTTTAT- . ATAACTTCAT TTTAAAATAT ATTAAAAGAA ACCTAAATGT 9000 

TCAAGATATC AGCATATGGC ACTAAATGCA CAA^AATAAT GTGAGCTTTT TTTTTTTTTT 9060 

CCTGTTAGCA GTCTGTAACA CTTTGGGTAT TTTGCTATAG TTGCTAATTA AAAAAATATA* 9120 

GATGTTTATT TATTTTTAAT GCAGTAATAT ATGGAGAAAT GAACAAACTA TGTAAACAAA 9180 

AAGGGAAACT CACTTGTTTT TCTTTAGATT TATAAATTTG AGCTATTTTT TTTAGAGGTG 9240 

CTTTTTAAAA ATCCAATAGA TACAAGAGAT GTTTCCTTTG GTTTTCTGCC AGTCATCCAG 9300 

CTGATACACA CCTGATCGAT TTTAAAGAAA GCCACACAGA GCTGAATCGG GCAGTGCTAA 9360 

TCAATAATTT AAAAGACATG AATGTCATTA GATCCTTTAT AACGTAGATC GAAGCCAAAG 9420 

CAGCTCATTT GTGACAACAT TTCATATCAC CAGACACACC AGGCAACAGA AGTTGAAGCA ' 9480 

CAACCACTGT AGCAAAATAC CTTGACTGCT TGTGAGACCA TTAGCATTGC AGGCCAAACC 9540 

GTACTGTATT TCCTTCTCAT AACCTCAAGG AACCATATGT GCTACCCACA ACACCTCATT 9600 
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. CTTACCCAGG GTGCGCTGCG TCCTCATGGT ACTGTAGGCA GCTGAAGAAC CGCCGTTCCC 9660 

. TTGAAAGGGA ACACCTGGCA TTCTGTGGTG TTTCGTGCTG TCTTAAATAA TGGTGCATTT 9720 

ATTATGTTCA AGTTATTTCA GGATTGCCAT ATGTGCAAAC AAATCATGCA ATGCAGCCAA 9780 

GGAATATATG TTGTTGTTGT TGTTTTAAAC CCATTTTTTT TTTAGAATTT TCATTAATAC 9840 

• TGT AGITATA CACCATATGC CTCATTTTAT CATAGCCTAT TGTGTATGAA AGATGTTTGT 9900 
ACAATGAATT GATGTTTAGT TTGCTTTAGT CATTTAAAAA GATATTGTAC CAGGATGTGC 9960 
TATTAAGAGC : ACGTATCCAT TATTCTTCTC AACCCAAGAA CCTGTTTCCT GGACCAGTGA 10020 
CCAAACCTCA- TATGTGAAAT GGCCAAAGCA CATGCAGGCT. CCTGGTTGTT CCTCTCAAAC 10080 
CTGTGCTGAC CAAAGATTAG TAACCAGTTA TACCCAGTAT TTTGAGGTTT TATTGTTTTT 10140 
TTAATAACTA AAAAAAAACT CGTGCC 

Seq ID NO: 459 Protein sequence 
Protein Accession #: NP_001990-1 

1 11 21 31 41 51 

I I i I J I 

MGRRRRLCLQ LYFLWLGCW LWAQGTAGQP QPPPPKPPRP .QPPPQQVRSA TAGSEGGFLA . 60 

" PEYREEGAAV. ASRVRRRGQQ DVLRGPNVCG SRFHSYCCPG WKTLPGGNQC IVPICRNSCG 120- 

DGFCSRPNMC TCSSGQISST CGSKSIQQCS- VRCMNGGTCA' DDHCQCQKGY IGTYCGQPVC 1B0 

ENGCQNGGRC, IAQPCACVYG FTGPQCERDYi RTGPCFTQVN NQMCQGQLTG- IVCTKTLCCA 240 
TTGRAWGHPC EMCPAQPQPC RRGFIPNIRT GACQDVDECQ AIPGICQGGN- CTNTVGSFEC- 300 

RCPAGHKQSE^ TTOKCEDIDE CSIIPGICET GECSNTVGSY. FCVCPRGYVT: STDGSRCIDQ 360 

RTGMCFSGLV NGRCAQELPG, RMTKMQCCCE PGRCWGIGTI- PEACPVRGSE EYRRLCMDGLi 420 

PMGGIPGSAG. SRPGGTGGNG FAPSGNGNGY GPGGTGFIPI PGGNGFSPGV GGAGVGAGGQ. 480 

GPIITGLTIL NQTIDICKHH ANLCLNGRCT PTVSSYRCEC NMGYKQDANG DCIDVDECTS. 540 

NPCTNGDCVN TPGSYYCKCH AGFQRTPTKQ ACIDIDECIQ. NGVLCKNGRC VNSDGSPQCI 600 

CNAGFELTTD GKNCVDHDEC. TTTNMCLNGM- CINEDGSFKC ICKPGFVLAP NGRYCTDVDE 660 

CQTPGICMNG HCINSEGSFR CDCPPGLAVG. MDGRVCVDTH MRSTCYGGIK KGVCVRPFPG 720 

AVTKSECCCA. NPDYGFGEPC QPCPAKNSAE FHGLCSSGVG. ITVDGRDINE CALDPDICAN 780 

. GICENLRGSY RCNCNSGYEP DASGRNCZDI DECLVNRLLC DNGLCRNTPG SYSCTCPPGY 840 

VFRTETETCE' DINECESNPC VNGACRNNLG SFNCECSPG3 KLSSTGLICI DSLKGTCWLN' 900 

IQDSRCEVNI NGATLKSECC ATLGAAWGSP CERCELDTAC PRGLARIKGV TCEDVNECEV 960 

FPGVCPNGRC VNSKGSFHCE CPEGLTLDGT GRVCLDIRME- QCYLKWDEDE CIHPVPGKFR 1020 

MDACCCAVGA AWGTECEECP - KPGTKEYETL CPRGAGPANR GDVLTGRPFY. KDINECKAFP 1080 

GMCTYGKCRN TIGSFKCRCN SGFALDMEER- NCTDIDECRI- SPDLCGSGIC VNTPGSFECE 1140 

CFEGYESGFM : MMKNCMDIDG CERNPLLCRG GTCVNTEGSF QCDCPLGHEL SPSREDCVDI 1200 

NECSLSDNLC RNGXCVNMIG TYQCSCNPGY QATPDRQGCT DIEECMIMNG GCDTQCTNSE 1260 

GSYECSCSEG" YALMPDGRSC ADIDECENNP DICDGGQCTN IPGEYRCLCY DGFMASMDMK ■ 1320 

TCIDVNECDLi NSNICMFGEC ENTKGSFICH CQLGYSVKKG TTGCTDVDEC EIGAHNCDMH 1380 

ASCLNIPGSF' KCSCREGWIG. NGIKCIDLDE CSNGTHQCSI: NAQCVNTPGS. YRCACSEGFT 1440 

GDGFTCSDVD*. ECAENINLCE NGQCLNVPGA YRCECEMGFT PASDSRSCQD IDECSFQNIC 1S00 

VSGTCNNLPG MFHCICDDGY ELDRTGGNCT DIDECAQPIN CVNGLCVNTP GRYECNCPPD 1560 

FQLNPTGVGC. VDNRVGNCYL KFGPRGDGSIr SCNTEIGVGV SRSSCCCSIK3 KAWGNPCETC " 1620 

PPVNSTEYYT- LCPGGEGFRP NPITIILEDI- DECQELPGLC QGGNCINTFG SFQCECPQGY 1680 

YLSEDTRICE" DIDECPAHPG. VCGPGTCYNT LGNYTCICPP EYMQVNGGHN CMDMRKSFCY- 1740 

RSYNGTTCEN ELPFNVTKRM- CCCTYNVGKA GNKPCEPCPT- PGTADFKTIC- GNIPGFTFDI 1800 

HTGKAVDIDE CKEIPGICAN GVCINQIGSF RCECPTGFSY- NDLLLVCEDI' DECSNGDNLC 1860 

QRNADCINSP GSYRCECAAG" FKLSPNGACV DRNECLEIPN VCSHGLCVDL' QGSYQCICHN 1920 

• GFKASQDQTM- CMDVDECERH PCGNGTCKNT. VGSYNCLCYP GFELTHNNDC LDIDECSSFF 1980 
GQVCRNGRCF NEIGSFKCLC NEGYELTPDG- KNCIDTNECV ALPGSCSPGT CQNLEGSFRC 2040 
ICPPGYEVKS ENCIDINECD EDPNICLFGS CTNTPGGFQC LCPPGFVLSD NGRRCFDTRQ. 2100 
SFCFTNFENG KCSVPKAFNT TKAKCCCSKM PGEGWGDPCE LCPKDDEVAF QDLCPYGHGT 2160 
VPSLHDTRED VNECLESPGI CSNGQCINTD GSFRCECPMG YNLDYTGVRC VDTDECSIGN. 2220 
PCGNGTCTNV. IGSFECNCNE GFEPGPMMKC EDINECAQNP LLCALRCMNT FGSYECTCPI 2280 
GYALREDQKM CKDLDECAEG LHDCESRGMM CKNLIGTFMC ICPPGMARRP-" DGEGCVDENE 2340 
CRTKPGICEN GRCVNIIGSY RCECNEGFQS SSSGTECLDN RQGLCFAEVL QTICQMASSS 2400 
RNLVTKSECC CDGGRGWGHQ CELCPLPGTA QYKKICPHGP GYTTDGRDID ECKVMPNLCT 2460 
NGQCINTMGS FRCFCKVGYT TDISGTSCID LDECSQSPKP CNYICKNTEG SYQCSCPRGY 2520 
VLQEDGKTCK DLDECQTKQH NCQFLCVNTL GGFTCKCPPG FTQHHTACID NNECGSQPLL 2580 
CGGKGICQNT PGSFSCECQR GFSLDATGUJ-' CEX3VDECDGN' HRCQHGCQNI LGGYRCGCPQ 2640 
GYIQHYQWNQ' CVDENECSNP* NACGSASCYN TLGSYKCACP. SGFSFDQFSS ACHDVNECSS 2700 
SKNPCNYGCS NTEGGYLCGC PPGYYRVGQG HCVSGMGFNK GQYLSLDTEV DEENALSPEA 2760 

' CYECKINGYP KKDSRQKRSI- HEPDPTAVEQ ISLESVDMDS PVNMKFNLSH LGSKEHILEL 2820 

RPAIQPLNNH IRYVISQGND DSVFRIHQRN GLSYLHTAKK KLMPGTYTLE ITSIPLYKKK 2880 
ELKXLEESNE DDYLLGELGE ALRMRLQIOL Y 

Seq ID NO*. 460 DNA sequence 

Nucleic- Ac id -Access ion 8: NM_013372.1 

Coding sequence: 63.. 617 

1 ll 21 31 41 . 51 

I ,:| I I II 

GCGGCCGCAC TCAGCGCCAC GCGTCGAAAG CGCAGGCCCC GAGGACCCGC CGCACTGACA 60 

GTATGAGCCG CACAGCCTAC ACGGTGGGAG CCCTGCTTCT CCTCTTGGGG ACCCTGCTGC 120 

CGGCTGCTGA AGGGAAAAAG AAAGGGTCCC'' AAGGTGCCAT CCCCCCGCCA GACAAGGCCC 180 

AGCACAATGA CTCAGAGCAG ACTCAGTCGC" CCCAGCAGCC TGGCTCCAGG AACCGGGGGC 240 

GGGGCCAAGG GCGGGGCACT GCCATGCCCG. GGGAGGAGGT GCTGGAGTCC AGCCAAGAGG 300 

CCCTGCATGT GACGGAGCGC AAATACCTGA AGCGAGACTG GTGCAAAACC CAGCCGCTTA 360 

AGCAGACCAT CCACGAGGAA GGCTGCAACA GTCGCACCAT CATCAACCGC TTCTGTTACG 420 

GCCAGTGCAA CTCTTTCTAC ATCCCCAGGC ACATCCGGAA GGAGGAAGGT TCCTTTCAGT 480 

CCTGCTCCTT CTGCAAGCCC AAGAAATTCA CTACCATGAT GGTCACACTC AACTGCCCTG 540 

AACTACAGCC ACCTACCAAG AAGAAGAGAG TCACACGTGT GAAGCAGTGT- CGTTGCATAT 600 

CCATCGATTT GGATTAAGCC AAATCCAGGT GCACCCAGCA TGTCCTAGGA ATGCAGCCCC 660 

AGGAAGTCCC AGACCTAAAA CAACCAGATT CTTACTTGGC TTAAACCTAG AGGCCAGAAG 720 

AACCCCCAGC TGCCTCCTGG CAGGAGCC7G CTTGTGCGTA GTTCGTGTGC ATGAGTGTGG 780 

ATGGGTGCCT GTGGGTGTTT TTAGACACCA GAGAAAACAC AGTCTCTGCT AGAGAGCACT 840 

CCCTATTTTG TAAACATATC TGCTTTAATG GGGATGTACC AGAAACCCAC CTCACCCCGG 900 
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CTCACATCTA AAGGGGOGGG 
GGGGACCAGA ATCTCCTTTC GGAATGAATG 
GACCTGTTTT AGTGCTGCAT - TCGACATGGA 
CTTTCCTCCT CCTCCTCACA ATCCATCTCT 
ATCTCTTGTT .TGCCAAGGTT CCTAAATTAA 

. TTTTGTGAAG ACCCTCCAGA CTCTGGGAGA 
TGGAGTGAGA- AAGGGAGGGT GGAGGGTGAG 
GACATTGCAG AAGCTTGAAA GGCCAATACC 
TTTTCCTAGT ATTTAACAGA ACCCAAGTGA 
ATTAACTTTG GCCGTTGCAA TCTGCTCAAA- 
ACCACTCCTA TGTTCGGACC CAAGCAAGTT 
CCCTCAGGTG' GAAAAGAGAG GTAGTTTAGA 
AAACCKCAGA- GGCTGAAATT" CCTAATACCT 
TCCATTCCAC TATTTCCCAT AATGCTTCTG 
GCCTCPGCTG AGTGTACCTG ACAGTAAGTC 
TTTTAGCAAG ARATATTKTG: GGGGTCTTTT 
RAGAGAAGAC" GACGAGAGTA AGGAAATAAA 
GGTGTTAATA CCTGGTAGAA ATGTAAGGGA 
AGGATCTGAG GGGACCCTGT TAGGAGAGCA 
CTACTGGTTG* GATGGACATA ACTATTGTAA 
TCTGATTAAA CTTGGCCTAC TGGCAATGGC 
AGGGTGGGTG: AACTTTATTG TACTTTGGAT 
TTTTATATAC AAACTCCCTG AATACTCTTT 
AGTCCTATGT AATATGGAAA ACAAACACTG 
TCTGGCATTC. AGAGAACCCT. TGCAACTCGA 
TCCAGTGCTC' TCCCATCTAA CAACTAAACA 
ACACCCAAAA TGTTGGGTCT ■ GATTTTCAAA 
CTAGGCGAAT TTGTCCAAAC ACATAGTGTG 
CCAAATCTTT GTATTGTCCA CATTCTCCAA 
ACACAAATGC TAAGGCAGAA TTTTGAGGGT 
ATGTAAAACC ACACCAGGGA GGAAAAATGA- 
TTGTTGTTTT AACTCTGCCA CAAGAATGCA 
AGCAGTAATC TTCTTTTAGG AGCTTGTACC 
CAAGTAAAGA GAATTTCCTC AACACTAACT 

' TAAAAGCATA" TCACTAGCCA". AAGAGGGAAA 
GACTAGTACA AATGTGGTGT GTCTTCCAAC 
TTTTATTCGA GTCACTGATG. ATGTAATGAT 
TTATGGCAAG ATATTTGTGG TCTTGATCAT 
TGAATTTTAT GATGTACACT : TTGTGCTTGG 
GTCTGTAAGT TGTTTTTTGT TACTGTAGGT 
TGGAGGAGAG - GATAATTTCC ACTGTGTGGA 
ATTTAATGTA ATTATTACTT CAAATCCTTT 
TTCTCCTTTA" TATGACTTTC TCTGAGTTGG 
TTAGAGTCTT TTATCTGGTC AGGGGAAACA 
GAGTCAGTGC CTGAATCTTT ATTT TTT AAA- 
AAGCAATATT AAGAAAGACT' TTAAATGTTA 
ACGAATAGCA GATAATGATG: ACTAGTTCAC 
AATGAAAAGT GGATAAACAG AACATTTATA 
AGTTCTATTG ACATTCCTCA' AGATATTTAA 
AAATCATTTA- AAAACGGCAA AGAATTATAT 
ATGAAAGGGG, AGTTGATAGT CTCATAAAAC 
ACTAGAATTT AATTTTCACC CCAATAATGT 
TAAATTAAAC CTATTCTTTC AAAAAAAAA 

Seq ID NO: 461 Protein sequence 
Protein Accession #: NP_037504.1 
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GGTTCTGACT 
TTCATGGAAG 
AAAGTCCTTT 
TCTTAAGTTG 
TTCACTTAAC 
GGCTGGTGTG 
GCCAAATCAG 
AGAACACAGG 
ACAGAGGAGA 
CCTAACACCA 
AGCTAAACCA 
ACTCTCTGCA 
TTCCTTTATC 
AGAGCCACTA 
TAAAGATGAR 
TGTTTTAACT" 
GGGRATTGCC. 
TATGACCTCC 
TAGCATCATG 
CTATTCAGTA 
TACTTAGGAT 
TTGGTTAACC 
TTGCCTTGTA 
CAGACTTGAG' 
GAAGCTGTTT 
GGAGCCATTT- 
CTTTTAAACT 
TGTGTTTTGT 
CAATAAAGCA 
GGGAGAGAAG 
CATTCAGAAC 
ATTTCGTTAA 
ACAGTCTTGC 
TCACTGGGAT 
TATCTGTTCT. 
TTTCATTGAA 
ATATTTTTTC 
ACCTATTAAA 
CATTAAAAGA 
CTTCAAAGTT 
ATGTGAATAG 
GGTCACTGTG 
GCAAAGAAGA' 
AAATCTTGAC 
TTGAATGTTC 
TTTTGGAAGA 
ACATAAAGTC 
AGTGATCAGT 
TATCAACTGC 
AGACTATGAG 
TAATTTGGCT 
TCTATATAGC 



TTGTGTTTTT 
AGGCTCCTCT 
TAACCTGTGC 
ATAGTGACTA 
CATGATGCAA 
GGCAAGGACA 
GTCCAGCAAA 
CTGATGCTTC 
AATGAGATTG 
AACTGAAAAC 
AACCAACTCC 
TAGGGGTGGG 
GTGGTTATAG 
ACTTGATTGA 
AGAGTTTAGG' 
ATTGTCAGGA 
TCTGGCTAGA 
CTTTCTTTAT 
ATGTATTAGC 
TTTACTGGTA 
TGATCTAAGG 
TGTTTTCTTC 
TCTTCTCAGC: 
ATTCAGTTGC. 
TTATTTCGTT 
CAAGGCGGGA 
CACTACTGAT 
ATACACTGTA 
CAGAGTGGAT 
AAAAGGGAAA 
CAGCAAACAC 
TGGAGATGAC 
ACATAAGTGC 
AATCAGCAGC 
TCTTACTGTG 
AATGCCATAT 
ATTATTATAG 
ATAATGCCAA 
AAAAAACACA 
AAGAGTGTAA 
TTAAATGAAA 
ATTTCAAGCA 
AGCTGACACA 
CCAGCTGAAC 
CTTAAAGGTT 
CTTACGATGC 
CTTTTAAGGA 
TAATGCCTAA 
ATTATGTATT 
GTACCTTGCT 
TCAAGTTTCA 
CTTTGCTAAA 



GTGCCCTCCT 

GAGGGCAAGA 

TTGCATCCTC 

TGTCAGTCTA' 

ATGTTTTTCA 

AGCAGGATAG' 

AGTCAGTAGG 

TGAGAAAGTC 

CCAGAAAGTG 

ATAAATACTG 

TCTGCTTTGT 

AATTAATCAA 

TCAGCTCATT 

TAAAGATCCT 

GACTACTCTG 

GATTGGGCTA 

GAGTAAGTTA 

GTGCTCACTG 

TGTTCATCTG 

GGCACTGTCC 

GCCAAAGTGC 

AAGCCTGAGG 

CTCCTAGCCA 

CGATCAAGGC 

TTTGTTTTGA - 

GATATTTTAA - 

GATTCTCACG 

TGACCCCACC 

TTAATTAAGC 

GAAGCTGAAA- 

TGAATTTCTC 

TTAAGTTGGC 

AGATTTGGCT 

GTAACTACCC 

CCTATATTAA 

CTATACCATA 

TAGAATATTT 

ACACCAAATA • 

CATCCTGGAA 

GTGAAAAATC 

AGTTATGGTT. 

TGTTTTCTTT 

CCGTATGTTG 

ATGTCTTCCT 

AACATTTCTA 

ATGTATACAA 

GAAAATCTAA- 

GAGTGAAAGT 

ATGTCTGCTT 

GTGTAGGAGG- 

TGAATCTGTA 

GAGCAACTAA 



960 
1020 
1080" 
li'40 
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1320 
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. 1740 
1B00 
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1920. 
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2280 
2340 
2400 
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3720 
3780 
3840 
3900 
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4020 



1 11. 21 31 41 51 
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MSRTAYTVGA LLLLLGTLLP AAEGKKKGSQ GAIPPPDKAQ HNDSEQTQSP QQPGSRNRGR 
• GQGRGTAMPG - EBVLESSQEA LHVTERKYLK RDWCKTQPLK QTIHEEGCNS RTIINRFCYG 
QCNSFYIPRH IRKEEGSFQS CSFCKPKKFT TMMVTLNCPB LQPPTKKKRV TRVKQCRCIS 

idld 

Seq ID NO: 462 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..2733 



ATGAAAGTTG 
TTCCTGGGGA 
CATCTAGGCC 
GAGAAAAGAG 
GGGCTAATTA 

' CTGCAGTGf A 
AACTGCTACC 
AGCCAGAGTG 
AGGTTTACAA 
ATTGAAATTC 
ACCCAATTTC 

' TCTGAACTGC 
CTGTTTCCAT 
GTCTTTGGAT 
GGAAACATCA 
CTCTCTCTGC 
GCAGCTGTGT 



11 
I 

GAGTGCTGTG 
AAAATGATGG 
CAGTCGAAGA 
ATTTGAGAAA 
GAATTATCAG 
CCTGTGAAGA 
TTCACACGGC 
TCAATTTCTG 
ATGACCTTTT 
AACTTAAAAA 
GAAATGGAAG 
TGTCAGCCAT 
TAGAAGACGG 
TTGGGTCCAA 
CAGCCAAGTG 
TTGAAGAACT 
CATCCTTCGT 



21 
I 

GCTCATTTCT 
CATCAAAACA 
ATATCAGCTG 
TTTTCTGAAG 
AGCAAAGGCT 
CAGCTACACC 
TGGAGCACTC 
TGAGAGAACA 
GAATTCATCT 
AGCATATGAA 
CATCGTTGCT 
TGAACATGTT 
CTCTTTCAGA 
GGATGATGAA 
TGAGTCCTCT 
GAACAAGAAT 
GCAAAATCTT 



31 
I 

TTCTTCACCT 
AAAAAAGAAC 
CTGCTTCAGG 
CTCTTGAAGC 
ACCACAGACT 
TGGTTTCCTC 
CCAAGCTGTG 
AAGATTTGGG 
TCTGCTATAT 
AGAATTCAAG 
GGGTATGAAG 
GCCGAGAAGG 
GTGTTCGGAA 
TATACCCTGC 
GGGTGGCAGG 
TTCAGTATGA 
TCTGTCATCA 



41 

I 

TCACTGACGG 
TCATTGTGAA 
TGACCTATAG 
CTCCATTATT 
GCAACAGCCT 
CCTCATGCCT 
AATGTCATCT 
GCACTTTCAA 
ACTCCAAATA 
GTTTTGAGTC 
TTGTTGGCTC 
CTAAGACAGC 
AAGCCCAGTG 
CCTGCAGCAG 
TCATCAGGGA 
TTGTAGGCAA 
TTCGGCAAAA 



51 
i 

CCACGGTGGC 
TAAGAAAAAA 
AGATTCCAAG 
ATGGTCACAT 
GAATGGAGTC 
TGATCCCCAG 
CAACAACCTC 
AATTAATGAA 
TGCAAATGGA 
GGTTCAGGTC 

cagcagtgca" 
ccttcacaag 
taatgacatt 

TGGCTACAGG 
GACTTGTGTG 
TGCCACTGAG 
CCCATCAACC 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



361 



. 5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

ACAGTGGGGA ATCTGGCTTC GGTGGTGTCG ATTCTGAGCA ATATTTCATC TCTGTCACTG 
GCCAGCCATT. TCAGGGTGTC CAATTCAACA ATGGAGGATG TCATCAGTAT AGCTGACAAT 
ATCCTTAATT CAGCCTCAGT. AACCAACTGG ACAGTCTTAC TGCGGGAAGA AAAGTATGCC 
AGCTCACGGT TACTAGAGAC ATTAGAAAAC ATCAGCACTC TGGTGCCTCC GACAGCTCTT 
CCTCTGAATT-. TTTCTCGGAA ATTCATTGAC TGGAAAGGGA TTCCAGTGAA CAAAAGCCAA 
CTCAAAAGGG GTTACAGCTA TCAGATTAAA ATGTGTCCCC AAAATACATC TATTCCCATC 
AGAGGCCGTG TGTTAATTGG GTCAGACCAA TTCCAGAGAT CCCTTCCAGA AACTATTATC 
AGCATGGCCT CGTTGACTCT GGGGAACATT CTACCCGTTT CCAAAAATGG AAATGCTCAG 
GTCAATGGAC CTGTGATATC CACGGTTATT CAAAACTATT CCATAAATGA ■ AGTTTTCCTA 
TTTTTTTCCA- AGATAGAGTC AAACCTGAGC CAGCCTCATT GTGTGTTTTG GGATTTCAGT 
CATTTGCAGT GGAACGATGC ~AGGCTGCCAC CTAGTGAATG AAACTCAAGA CATCGTGACG 
TGCCAATGTA CTCACTTGAC CTCCTTCTCC ATATTGATGT CACCTTTTGT- CCCCTCTACA- 
ATCTTCCCCG. TTGTAAAATG GATCACCTAT GTGGGACTGG GTATCTCCAT- TGGAAGTCTC 
ATTTTATGCC TGATCATCGA GGCTTTGTTT TGGAAGCAGA~ TTAAAAAAAG CCAAACCTCT 
CACACACGTC GTATTTGCAT GGTGAACATA GCCCTGTCCC TCTTGATTGC TGATGTCTGG 
TTTATTGTTG GTGCCACAGT GGACACCACG GTGAACCCTT CTGGAGTCTG CACAGCTGCT 
GTGTTCTTTA CACACTTCTT .CTACCTCTCT- TTGTTCTTCT GGATGCTCAT GCTTGGCATC 
CTGCTGGCTT ACCGGATCAT CCTCGTGTTC . CATCACATGG CCCAGCATTT GATGATGGCT 
GTTGGATTTT GCCTGGGTTA TGGGTGCCCT- CTCATTATAT CTGTCATTAC CATTGCTGTC 
ACGCAACCTA GCAATACCTA •CAAAAGGAAA' GATGTGTGTT GGCTTAACTG GTCCAATGGA 
AGCAAACCAC TCCTGGCTTT. TGTTGTCCCT GCACTGGCTA TTGTGGCTGT.' GAACTTCGTT 
GTGGTGCTGC TAGTTCTCAC ' AAAGCTCTGG AGGCCGACTG TTGGGGAAAG ACTGAGTCGG 
GATGACAAGG CCACCATCAT CCGCGTGGGG AAGAGCCTCC TCATTCTGAC CCCTCTGCTA 
GGGCTCACCT-- GGGGCTTTGG AATAGGAACA ATAGTGGACA GCCAGAATCT. GGCTTGGCAT 
GTTATTTTTG; CTTTACTCAA TGCATTCCAG. GGATTTTTTA TCTTATGCTT TGGAATACTC 
TTGGACAGTA AGCTGCGACA- ACTTCTGTTC AACAAGTTGT CTGCCTTAAG TTCTTGGAAG 
CAAACAGAAA AGCAAAACTC ATCAGATTTA TCTGCCAAAC CCAAATTCTC AAAGCCTTTC 
AACCCACTGC AAAACAAAGG CCATTATGCA TTTTCTCATA CTGGAGATTC CTCCGACAAC 
ATCATGCTAA CTCAGTTTGT CTCAAATGAA TAA 



Seq ID NO:. 463 Protein sequence 
Protein Accession 8 : Eos sequence 



MKVGVL.WLIS. 
EKRDLRNFLK 
NCYLHTAGAL 
IEIQLKKAYE 
LFPLEDGSFR. 
LSLLEELNKN: 
ASHFRVSNST- 
PLNPSRKFID 
SMASLTLGNI 
HLQWNDAGCH; 
ILCLIIEALF- 
VFFTHFFYLS. 
TQPSNTYKRK 
. DDKATIIRVG 
LDSKIiRQLLF 
IMLTQFVSNE 



11 
I 

FFTFTDGHGG. 
LLKPPLLWSH 
PSCECHLNNL- 
RIQGFESVQV, 
VFGKAQC2iDI: 
FSMIVGNATE 
MEDViSIADN- 

wkgipvnksq: 
lpvskngnaq 
lvnetqdivt: 
wkoikksqts 
lffwmlmlgi- 
dvcwlnwsng 
kslliltplu 

NKLSALSSWK 



21 . 
J 

FLGKNDGIKT 
GLIRIIRAKA- 
SQSVNFCERT. 
TQFRNGS IVA ' 
VFGFGSKDDE 
AAVSSFVQNU 
ILNSASVTNW 
LKRGYSYQIK 
VNGPVISTVI 
CQCTHLTSFS 
HTRRICMVNI 
LIAYRIILVF 
SKPLLAFVVP 
GLTWGFGIGT 
QTEKQNSSDL 



31 



KKELIVNKKK 
TTDCNSLNGV 
KIWGTFKINE 
GYEWGSSSA 
YTLPCSSGYR' 
SVIIRQNPST. 
TVLLREEKYA 
MCPQNTSIPI. 
QNYSINEVFL 
ILMSPFVPST 
ALSLLIADVW-. 
HHHAQHLMMA 
ALAIVAVNFV 
IVDSQN1AWH' 
SAKPKFSKPF 



41 

I : 

HLGPVEEYQL 
LQCTCEDSYT. 
RPTNDLLNSS 
SELLSAIEHV 
GNITAKCESS 
TVGNLASWS 
SSRLLETLEN- 
RGRVLIGSDQ 
FFSKIESNLS 
IFPWKWITY 
FIVGATVDTT 
VGFCLGYGCP 
WLLVLTKLW 
VIFALLNAFQ 
NPLQNKGHYA 



51 
I 

IiLQVTYRDSK 

WFPPSCIiDPQ . 

SAIYSXYANG 

AEKAKTALHK 

GWQVIRETCV 

ILSNISSLSL 

ISTLVPPTAL 

FQRSLPETII, 

QPHCVFWDFS 

VGLGISIGSL 

VNPSGVCTAA- 

LIISVITIAV 

RPTVGERLSR 

GFFILCFGIL 



Seq ID NO: 464 DNA sequence 
Nucleic Acid Accession #: AB035089. 
Coding sequence: 9845.. 10219 



1 
I 

GGGCATGCAG 
CAGTTCTAGT 
CCAAGAGGAA 
TTGGTTTGAA 
AGGAGGAAGG 
GAGACTATTT 
AAGAAAGGAA 
GTAGACAGAA 
AATCTCCTCC 

AATAAAATGT 
AAGGAAGTGA 
GTAATGACAG 
GGATTCCTGG 
CAAGTGTTCA. 
TCAGGATATG 
TAGGAGAACT 
CAAGCTTTCT 
TTCAACCTTC 
CCATAGATTG 
- CCTCCATTCC 
. TGGTACCCGA 
CTGGATGCAG. 
TTTCCTCTTC 
CTATTCCCCG 
ATATATATTG 
GTGAGAAATT 
CCTATGTGTT 
AAAACAAACT 
TGTGGAGAAA 



11 
I 

CCATCGGGGA 
-AAAAGGGAGA 
TTAGGGAGAG 
AGCATACAGT 
GAGGCAAGAG 
CCCTCTCTGC 
AGCTAGTTAG 
TCCTTGGGAA' 
ACTAACCAGT 
TCTCTTGACT 
GGTCCAGGAA 
GATATTTCCT 
AGCCAATGAA 
TATGCAAAAA 
TCCAGTCTCA 
ATTTAGGAAC 
CTAAATTTAA" 
AGGGCAAACC 
GTCCCCTGTA 
CAGGATGAGC 
GTAAATCCAT 
ACTCAGCTGA 
TTTCTTTTTC 
ACTTCAATCA 
ATACAGGAGA 
TCCAGATTGG 
TCTGGCACCT. 
CACGGCTGGT 
TCAGAACTCT 



21 

I 

AAATCCATAG 
ACATCAATAT 
AGTTATAAGA 
AAATATGATG 
ATAATATCAT 
TTTTCAAACC 
TCTTGTTCTG 
TACAGTAATT 
TTCCCTATAG 
TTGTTACTTA 
AATGTAGGAG 
GAAAGTGTAA 
GTTGGTGTAT 
CTTCTTGGAA 
CACACCAGGA 
AGAAAAAAAT 
GCAAACTCCT 

ACCCCGGTTT 
TT GTTGCTTC 
CCTACTCCAA 
GAAGACCATT 
TTCTATCTTT 
GGGAACTTAT 
CCTAAGAAGA 
AAACACAGCT 
TGTTGTAGAT 
GTTAAAAAGG 
ATTCACGGTC 



31 
I 

TGCAGATAAA 
AGGATGTTTC 
GATCAGCAAG 
TCTGTCCCTG. 
TTTCTCTGTG 
TTACTGGAGT 
AGGTTGTTCA 
GACATATATT 
ATTGCCACAA 
ACAATGCTGA 
ATATTTCTTA 
TTTCCCATTG 
GTTTATGAAA 
TTTCTGAGTT 
TATGTCCTTT 
GCCTGAAATG 
GGTCATTTTC 
AGACGTTTAG 
GTCTCAGCTT 
TGTCCTATGA 
TAGAGGAAGG 
ATTCATTTTT 
GGATTTTTAG 
ACCTCTTAAA 
GCATGTCTTG 
TCCTTTCTCC 
AAATCTCCCT 
GCCCATGACA 
GGTTGGAATG 



41 

I 

GCAAGGAGGA 
TTAGCAATAG 
GGGACAGGGT 
GCAGTGTTGG 
CTCCAACTGT 
TGTTTTCCCT ; 
ATGTATACAT 
CTGTTATTTG 
GCACATAATA 
GAAAACTTTA 
ACCAATCTAT 
AGGA.TTTGTT 
TATCAAGAGA 
CTCTGTGGCA 
CTAGCCTGTC 
ATTTCTCATT 
AGTTAGTACC 
CCATAGTCTG 
GTTATCCTGT 
GACATTAGAT 
TCCATTTTTG 
GGAATTCTTT 
TCCATCAACG 
CTCATTCAGA 
GGGGTTGAGG 
CATCCAGCCC 
TGAC7TTGTG 
ATAGCAAGTG 
CACACTTGTG 



TCTTATAGCG 
ATCTCAGATA 
CCCCATTAGT 
GACTCAAAAC 
AAACAGGCAG 
CTACTTTCAG 
ATGTGCTGAG 
TTGGGGAGAA 
CAGAATTCTA 
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60 
120 
180 
240 
300 
360 
420 
4B0 
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600 
660 
720 
780 
840 
900 



60 
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. WO 02/086443 • " - 

TGGAGAAGAG TCTGGCATTT CCTCAAAATG TTAACCTGGA TTTACCATAT GACCCAGCGA 1800 

TTTCATTCAT AGGTTTATAC TCAAAAGAAA TGAAGAAATA TGCCATGCAA AAAAATGTAC 1860 

ATGAAAGGTC ACAACATCAT TATTCATAAT AGTAAAAGGA TGGAAACAAC ACAAATGTCC - 1920 

ATCAACTTAT GATTAAAGAA AATCTGGTCT ATTCATAGAA TGGAATATTA TTCGACCACA 1980 

AAAAGGAATG ATGTACTGAT CCATGCAATG ATGTGGACAA ACCATGAAAA TAACACTAGA 2040 

TTAAAGAAGC CAGTCACAAA AGGACTTACT GTATGATTCC ATTTACCTGA AATGTTTGGA 2100 

ATAGGCAAAT CCATAGAAAC AGGAGGTAGA TTCCTGGTTT- CCAGGGTCTC CAGGAAGGGA 2160 

AGAATGAAGT ACAAGATTTC TTTTGGAGGT AGTGAAATTG TTGTGGAATG AGATCATGAT 2220 

GATGATAGCA CAACTTTGTG AATATAATAA AATCATTGAA TTGTACAGTT GAATTTATGG 2280 

TATATAAATT ATATGTTAAT AAAAAGGGGG TCCACAAAAC AAACAGCCCC CCACTCTGGT 2340- 

TGTCAGGGAG ATATTGGATT- AAATGGCCTT GGACAACAAC CCCTCTCCCT GGCCACAGAC 2400 

ATTCTTCAGA TTACAAGATA TTCCAGGGGA AACACTGGAA TGAGTCTGAA GCCAGGTGCT 2460 

AAACAGAAGG ACCATTGAGA AATGTTGTGA'- TCCTGACAGG TCAAGCAATT TATTTTTCGG 2520 

CTTCATTTTT AAATGTAAAA TTAGAAAGCT GCCATTTAAA ATGGCCCGTC TGTTTCAATT 2580 

GCTCTTCTCA GTGTCAGCCT GTTAACTCAA TGTGTTAGTC TGTTTTCATG CTGCTGATAA 2640 

AAACATACCT GAGACTGGCA AGAAAAAGAG GTTTAATTGG GCTTAGAGTT CCACGTGATT 2700 

GGGGAGGCCT CAGAATCACA 1 GTAGGAGGCA' AAAGTTATTC TTACATGGTG GCTGCAAGAG 2760 

AAGATGAGGA AGAAGCAAAA GAAGAAACCC. CTGATAAACC CATCGGATCT CCTGAGGCTT 2820 

ATTAACTATC ATGAGAATAG CACAAGAAAG ACCGGCCCCC' ATGATTCAAT TACCTCTACC 2880 

TGGGTCCCTC CAATAACATG TGGAAATTCT GGTAGATACA ATTCAAGTTG AGATTTGGGT 2 940 

GGGAACACAG CCAAACCATA TCACTCAGC* AGGCAGATAA CTTTCTCACT GAGCCTATGC 3000 

AACAGAAAAC CATCTGGGAT GGTTGTAAGG . GGCACAGGAA GTGACTGGTA GGATCACTGC 3060 

CAAAGCTGAG. CACTCAGGAG: AAGGCAATAG AATCCTATTC TCCATAGTAT G CTATAAG AT 3120 

ACTGAAGTAC ACTTCTTCAC TATCTCTTTG GACTTAGAAT TAGCACTACA TTCCTTGTTA 3180 

TACAGAAAAA TTACTAAGGA AATTCATAGG. ATGACAAAAA CTTTCAGAAC TGAAAAACAG 3240 

GAAATGTAAG CTTTTTAGTT- CTTTGGTATT CGAAGTATGC CTAAAAGACA ATGCAAAATC 3300 

CAAGAAAAGA ATGGTGGGGT TTTTGTTTGT TTGGTTTTGT TTTTGTTTTA- CAGCTGGAGT 3360 

AGAATACAAA GGGATGGAGT TGAAACAAAT GAGAGGAAAT TGGAATTCTA AACTTATTCT 3420 

CATTGGCATT AGAAAGGCAC CTACATGTAT TTCACATGAG CCGGTGACTG CTGACTTGCA- 3480 

TTCTTATTTT TTCCCTATAG ATTAAAAAGG AGGTACAATG GTAGAACTGT AATCCTGTCC 3540 

TTTGTCArAA ATTTTCATAT TCATAAAGGT GAGTGTTAGC CCGCTTGTGA AATCTGAAGT 3600 

TGAGTAACTT CAAATACTAA" CCACAGAGGG AAAGGCAGCA AGAGGAGAGG CATAAATTTA 3660 

GGATCTCACC CTTCATTCCA CAGACACACA CAGCCTCTCT. GCCCACCTCT GCTTCCTCTA 3720 

GGAACACAGG TAAGAGCTTC AAGCCTCTCC AGCTTAATAA CATGAATTAT TTTTGAGAAT 37B0 

AATAATGATA CTGTGTTCTA TATCATGCAT CTCCTGCATT CTGTCTGATT ATATTTTACT 3840. 

TATTCTGCCA' GAGCAAAATT AAAATACCTA TTTCATCTGA TTTGTCCTTT ATCTAAATTG 3900 

CTTAGTTCCA AGTAAACCAA GGCACTTTTA GGAACACAGA GGGAGAGTGC CTTGCAGCCA 3960 

GAGAGTCTTG AAGGAGATGT CAGGGACGCA TCTTAACAGC TGGTTGGATG TGATCCACAG 4020 

AGGTCTCCTG TTAGCATTCA TTGTAAAGCC ATCCTACCTA GCTCTAGTGT AACCAGCAAT 4080 

GAAAGAAAGA TAAAGAGGGT- CGATTACTTA TTTACAAT AG TCTTTAAAAA CGTAGTTTTG 4140 

TAAGCCTTCT : AATTAGGACA TTAATATATT TAATATATGC ACATTGTAGA AAGATTGAAG 4200 

CGTTAAAAAT AAGAGAAAAA CTTTAAATGT CAAAATCTCA CAACCCAGAT ATATCATTTC 4260 

TTTAAGAAAA TTGTACTACA AAATACCATT CCATTTATTA AAGTCATTCT. GACAGGAATC 4320 

TGATGCTTTT CCAGGAGTTC CAGATCACAT CGAGTTCACC ATGAATTCAC TCAGTGAAGC 4380 

CAACACCAAG TTCATGTTCG ATCTGTTCCA ACAGTTCAGA AAATCAAAAG AGAACAACAT 4440 

CTTCTATTCC CCTATCAGCA TCACATCAGC ATTAGGGATG GTCCTCTTAG GAGCCAAAGA 4500 

CAACACTGCA CAACAAATTA GCAAGGTAGC TATCAGCATC ATTACGTTGT CCTGTTGCAG 4560 

TTTTTCTCTG GTTCCGTCGG CTAGCACGCA GATGGTAATA GATGTGGTGG TCTGATGGGT 4620 

AGCACAGGGG GCTGTGCAGG AATTCCCATA ACTGTGAGAC CACTGACTTA AACAGATCTT 4680 

TTGAGTAAAG TTTTCTTGTC CCGCTTCATG TCTCTTCCAG GTTCTTCACT TTGATCAAGT 4740 

CACAGAGAAC ACCACAGAAA AAGCTGCAAC ATATCATGTG AGTCACAGAG CACTCTGATT 4800 

CAGCTTTAGA TCCCTGAACA GGTCATAGTT TAAACCTGGA ACTTCACAAA AACTAAGAAA 4860 

AGGCCAGTTT TAGGGAAAAT CTTGGACACA- AAGATTGAGA CATACAGAGT GGGTTGGCAT 4920 

TTCATGGCAC ATAATTATTA TTCCTCATTT CTGCGTTACT AAAAGACAGT CAGCACTGTA- 4980 

CCTCAGAGCA TAGGTCTGGA- TCAGGATAGG CTGGGTTCAG ACTCCAGCTT TGCTCTTCAC 5040 

AAATGATGAA TAAGAGCAGG ACACAACTGC TCGGAGTCCC AGTGACCTCA TCCCAGAAAA 5100 

CTAAGGGTAA GAAAAAATCT GACTCAATAC ATGCAAATAC ATGCAAATGT TTACAACAGT 5160 

GCCTTGCCCA TAAAAGTCAT- AATAAATGTT. ATTATTATTA TAAAGTAGCT ATAATTATAC 5220 

TAATCATAAT AATGTGAAAA TAATTTAATT TTCATTGAGT CATTAATGAG ATTCAGAGGA 5280 

ATAAGCACAA GTCCAAGTAT ATTTTGGAAA ATGATTGCTA TGGAATATAT TGGTTTAGAG 5340 

CCTTAATAGT GCAAAATGCT- TTGCTGGAAG GTAGAAAGTT CTAGATTTAA ACAGGCTTAG. 5400 

GTTCAAAACT TGGCACTTCT AATTTATGTC TCTATAAACA GGGTTTTTTT CCCCATTCTC 5460 

TGAGCTTTCT TGTGTTCATC TGAATTGAAC TAAAGACTTA GAGTTACCCA TGTAAAGTCC 5520 

TTAGCCATGG ACCTGGCATA CACTCTTCTT ACGTGCAGAG AATGACCATC ATGAGGAAAG 5580 

AGCCACAGAT CAGTCAATGT GTCCTACAAG ATAATAGCAC CAACAGGTAT AACAGGGCTT 5640 

eCTGGCArAA TCTATTTAAA ATATCCAACC TTCAACATAC TCGTATCCTT GATGACTGTT 5700 

AGAAGTGAAA TATGGTCCTT. GCCCATAAGG AGCTGAGAGT TTAACTGGGA AGCTAAACCT 5760 

AACCCTTTAA ACCAACAAGG AGAAAATCTA CTGGTAGACA GCGCTGCATC TTTAGTTCAG 5820 

AAGAGAAAAG ■ ATTGCAGTAC- GTTAGAGCAA GAAGAATTTT CTGGAAGAAG TCAAATATAA 5880 

GGTGGATTTT GAAGGGTATT TGAGGTGAAA TACACCAATT ATCAGGGAAT AACATCAAAG 5940 

GTCCTCAATG AGACTACCAG. CATTTAGGGA CTGATCTAAC AGACTTAGCA TGGGTTTAGT 6000 

ATTTACATTG ATACAGCAAT TGAATGATCT CCTTTTTTGA TGTTTGAAGG TTGATAGGTC 6060 

- AGGAAATGTT CATCACCAGT TTCAAAAGCT TCTGACTGAA TTCAACAAAT CCACTGATGC 6120 

ATATGAGCTG AAGATCGCCA ACAAGCTCTT CGGAGAAAAG ACGTATCAAT TTTTACAGGT 6180 

AATTTGACCT GGCCTACCCA CATTTCATTT GCATCCTGAT GTCTGTGTCT CTGAGTGGCC 6240 

AAATGGAAGA AAGCAAGGCA" GATGAGCCTG. GCCGACCCAG GTGGAGAGCA TTTACTCAGA 6300 

GTGCATTAGC TCCATTTCCA CAACTCTCCC CCACTGGAGT GTCCCAGACC CCAACGATAC 6360 

ATCACTGAAG TGTGGATTTA GGGATAATCT TGTGATAAAA GAGGAGGTTG TGTAATAGAG 6420 

TGAGTAAGAG TAATAAGTAA TAAGATACCA TCGATAAACT GGCACTGACT CAGTCACATA 6480 

CGATACATCT TGGTGGGAAA TGTATGACTA ATGGGATATT ATTGGAATGG GCAGGCTTGG 6540 

GTGAGTTCCT GAGAATAGTT GAGGAAGTAC CAGGAAATAT TGAATGCACA GGATGAAAGA 6600 . 

CAAAAACAAA "gATCAGAAAC ATCATGGTTA AAATTACTGG AGAGAAGTCT GAGAAGCAAT 6660 

GAATCTCCTT CAGGGAAGCC TGCTCTGCAG TTTGCAAACC ACAGCCTCTT CTGCTTCTGC 6720 

CTTTTGCCAA GATGATATTG ACCTTCAGTG ACCTCTTTCT TGTGCCAGCC CACATTCCCC 6780 

TTTTGCATTG CCTACATGAC ACCTGTATAA AAATATCCAT GGACAGGAGA TACTGCATCT 6840 

ATTCAGGGTC TGGATTCAGC TTACTGTTGT TACAAATAAG TAAGTTTGGT AATATATAGT 6900 

TACATAAATT ACTCCTAATT CCTACTTCTT CCTTCATATC TCAAAGGAAT ATTTAGATGC 6960 
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CATCAAGAAA TTTTACCAGA ' CCAGTGTGGA ATCTACTGAT TTTGCAAATG 
AAGTCGAAAG AAGATTAACT CCTGGGTGGA AAGTCAAACG AATGGTAGGA 
TTATAGAAAC ACCTTTGAGA AACCTATGCC AGTGAGCCTT GTGCTTGACA 
GAACAGGTGT GGGGATTGAG ATGGGTTTGC AGGGAGGGCT GAAGAGGGCA 
5 AGGATTTGTC CAAATGAATA TGAAGAGAGC CTAGGGGAGC CAAGGAGGAA 
GCCAATTAGA" TGGAAACACA TCTGGAGAAT TATTTGCTTA TGGCCCTGCA 
TTTGTGGATC CCCTGTCTCC GCTCAGACCT ATTTTGAGAT CATATCCTTT 
AGACTCAAAT TTTTATGATG AATATTTAAT AGAAAACATT AGAAAGCGTC 
TTTACTAATT GGGAAACAAG CAGCTCTCTG GTAAATCACC CTTTTGTCTC 

10 CTGCCTGGAT CACATCTGTA GCCAATGTGT TCTGCAGGGA TTATCACAGC 
ATCAAGGGCA AAGAGCTTGA CAAAGTCTCC ATTCTACAGA CATCTTTCTT 
TCTCATTACA GGCCAAACTT ACAGCAACTC AACATGAGAG TGAATAGGAA 
GAAGTAGTGT CTGACAGCAC AGGACATGCG TTTCATATTA- CAGAGCTCAA^ 
CTAAAATGCA ATCAGGGCCT CCTTCCTCTG AATGGGGACC CCGTAGTTAA'" 

15 AAGTAGGAAG AGGAGGGAGG GAGAAAGGAA- AGACACATGT- TGGAAGAGTA 
GTTTATCAGT ATTCCAAATC AGATGATTGG. AGACATTCAT ACACAGAGAA 
TTCTCTATCA- CAAGAAGTGA* TGTCTCCATC: AAGGGTAACT. TTATACGACT 
AGAAAGCTGC ATCTGGTGAA CCACTGGTCA' GTGAGTCTAA CAATTCAAAG 
GTGAGTCTCA AGCAGGGATT TGGGTCAATA ATTAACGATC AGTCACGAAC 

20 CATCTTCCAG ACAAGCCATT TGTAGCTTGT- GTAAAAGACT CTTTTATTCT 
GAAAAAATTA AAAACCTATT TCCTGATGGG ACTATTGGCA ATGATACGAC 
GTGAACGCAA TCTATTTCAA AGGGCAGTGG G AGAATAAAT • TTAAAAAAGA 
GAGGAAAAAT TTTGGCCAAA CAAGGTATTG TCTATATTTT ATTTATATAG 
TAATACATGG. AATGTTAAAC ATTTCTGATG GAATGTAACA- TGATAAGTAA 

25 TTGTTCATGT' CTGTTATTTT GTTGTTTTAC TCTTATAACT TTATTTAGTT 
GAAAAACTAT TGTTTCTAAC TCATGGAATT CCTGGGTTAT TTCTTAGAAG 
TGTTGCTATC TCAATAATAT . TATCTTTTTT ; GTCTTGTGTT TCACGTGTTA 
ACATTGATTT ATTGCAGAAT ACATACAAAT CTGTACAGAT GATGAGGCAA 
TTAATTTTGC CTTGCTGGAG GATGTACAGG . CCAAGGTCCT GGAAATACCA 

30 AAGATCTAAG CATGATTGTG CTGCTGCCAA ATGAAATCGA TGGTCTGCAG 
CTTGCATCTA CAACTCTTCC TTCTACTGCC GGACATTTTT CCAAAGATAC 
CAAGGTAAAA GCTTATGACC GAGTTGCCTC AAAATGATGA AAAATTCTAA 
ATGACTCACC TTCATATTAC AAATATTTGA GCATAGGGCC TGACACAAAC 
GTTTTTGTTT GTTTGTTTGT TTTTATTATT ATTATTATAA TACTTTAAGC 

35 ATGTGCACAA' TGTGCAGGTT AGTTACATAT GTATACATGT GCCATGCTGG- 
CCATTAACTC. ATCATTTAGC GTTAGGTATA TCTCCTAATG • CTATCCCTCC 
CACCCCACAA CAGTCCTCAG AGTGTGATGT TACCTTCCTG TGTCCAAGTG 
TCAATTCCCA TCTATGATTT AATTCCATCT ATGGCTTAGT TAATGATTAA 
TTACATGCAT ' TGGATATCAA TTTGATGATA TTATTATGCA GCAATTTAAA 

40 AGAAATATAT" ACCAATGTGA GGAAAGTTTA : CAAATAGGCC GAGTAGAAAA 
ATTTAGGAAT. TTAGGGAATT ACAATTTAAT AATTGCAATG TGT ACT AAAT 
. GAAAAATATG ATGAGCCTAT TAAAAATTGA ■ CACATGTAGT AGGCTGTTGG 
AGTGATACAT. ACAGTTCATT GTGTACAAAA TAATGTAATC ATATTTTACA 
ACAGTTGTAT ACATACATAT GTACACATAT ACATATACGT AAAAACATGA 

45 ACATACATGT ATATACATAT ACACATATAA CCCAATGTAT; TTATATATTC 
TTTTACCTAT TAGAATAATA ATGTCTATTA AAGTGAACCT TCTGTATTTC 
CCAAAATAAC . GAATCTCCAC . ATAGTCAATT CATTGTTAAG GTGTATTAGA 
TAGTCATATC AGTTTCTTTT TTCCATTTGT ATAGCTTGAA < GAGAAACTCA 
ATTGATGGAA- TGGACAAGTT TGCAGAATAT GAGAGAGACA TGTGTCGATT 

50 TCGGTTCAAA ATGGAAGAGA GCTATGACCT CAAGGACACG TTGAGAACCA 
GAATATCTTC AATGGGGATG CAGACCTCTC AGGCATGACC TGGAGCCACG 
ATCTAAAGTC CTACACAAGG CCTTTGTGGA GGTCACTGAG GAGGGAGTGG 
TGCCACCGCT GTAGTAGTAG TCGAATTATC ATCTCCTTCA ACTAATGAAG 
TAATCACCCT TTCCTATTCT TCATAAGGCA AAATAAGACC AACAGCATCC 

55 CAGATTCTCA TCCCCATAGA TGCAATTAGT- CTGTCACTCC ATTTAGAAAA 
GAGGTGTTCT GGTAAACTGA TTGCTGGCAA CAACAGATTC TCTTGGCTCA 
CTATCTCATC TTGATGATGA TAGTCATCAT CAAGAATTTA ATGATTAAAA 
TTCTCTCTTT CTCTTAATAA GCCCACATAT AAATGTACTT TTCCTTCCAG 
CTTGAGGAAA AATGTCCAAG ATAAGATGAA TCATTTAATA CCGTGTCTTC 

60 ATATAATTCT. GTTTCTGACC TGTTTTAAAT ' GAACCAAACC AAATCATACT 
ATTTAGCAAC CTAGAAACAC ACATTTCTTT GAATTTAGGT GATACCTAAA 
GTTTCTAAAT TTTGTGATTC' TATAAAACAC ATCATCAATA AAATAATGAC 
TTTTGCTTTA CCTGTTTTCT CTCTGGAAAG GGCAAGTGTC CAGTTACACA 
AATTTAGAGA .TATATTAATC ATATATAAAG GAAAATTAAA AACAGAGTAG 

65 GCCTGGAGTA GAAGGCATAT CCCAGAACAG GAGGAGCCTT GTAAACCACA 
CTATTTTATG CTAAAGGGAT AAGAAACTCA TTACAGGCTT TGATGGTTGT 
GGGGCATAAA ATTATCATAT CCA CAT CT AG . AAAATACATC TCTGGCTACG 
TGGATGCGAG GAAAGAACAG TGTGGTTACC ATATATAAAT TAGGAAATCA 
GGGAGTGGAA ATGGAGAGAA AGAAAGAGCC TGGGGGAATT ATTTAGGAAA 

70 AGAAAGACAT CTAAGTTGCT GACCTATCTG ACTGGATGGA TGGAAGAATA 
GAGAGAAAAA AAGACTTTGG GTTTAAATTT GTACTTGATG AATTAAGGTA 
TCAAATGGAT TTGCCTGGCA . GGCACTTGAA GATATTAGTC TAAATCTCAG 
TGATCTGAAG CTCTAAATTT GTGATATTCA " ATATAAATAC TTTAGAGTCA 
TATGGTAGTT GTAGCTAAAA GCAAAAATAA . GATACTAGGG AGAAAGGATA 

75 AAAGAAGAAT CTAGAATTGA CCTTGAAGTA TATCAG CATG TGTAAAGATC 
CATTTTTATT TTCCAGAAAG TAGCTTTTCT TAGGGTTCCA TATTTACTCC 
TCCC 




ACCTCCCACC 
GATACCCCCG 
GTCACTCATC 
AAAAAAATAA 
GACAAAATCA 
CGTGAACTCC 
GGAGCCTTGA 
ATCAAAGTCA 
ATTTGCAAAG 
TTCCCTTGCA 
ACTGGTTCTT 
AAACACTAAA 
TGTAATATGT 
AAAATAAAAA 
AGGAATACCT 
AAGAAGGATG 
TTTGTTGGAC 
TACAATTCCT 
TACAAAGGCA 
AAGGTAAGAA 
CAAGTTTAAA 
ATGAGGAATG 
TGAAAGCTTA 
TTTAGGGTAC 
TGTGCTGCAC 
CCCCTCCCCC 
TTCTCATTGT 
TTTATTAGAG 
CTTGACTGGG 
GGGAATACAA 
AATGTATACA 
CACAAGAAAT' 
TGTGTATCAT: 
TTCTGTTTTT.' 
AGGACTCATA 
ACATTTATTG 
GATCGACAGT 
CTGCTGAGAA 
TACACTTACC 
TGGGAATGGT 
GTCTCTCAGT 
AAGCTGCAGC 
AGTTCTGTTG 
TCTTCTATGG 
TGTTCACCTA 
TATTTCTTTT 
TAGCATGCCT 
AAAAATTTCC 
TAAATTTGAA 
TTCTCTTCAA 
TCCTTCTTAT 
ATAAAATCAT 
TAGGAAAGAT 
TTCATGATGA 
TAGGAACTTC 
TTGTCAAAGA 
CTGATATCAA 
TTAGAGTATT 
TAATAGTTAC 
TCTTGTTTCT 
CTTTTAATAT 
AAACAGAATA 
TTGGGATAAA 
AAGTTAGAAG 
AGGAATTGAT 
CATAGATTCT 
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Seq ID NO: 465 Protein sequence 
Protein Accession #: BAB21525.1 

1 ii - 21 31 41 51 
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MMSLSEANTK FMFDLFQQFR KSKENNIFYS" PISITSALGM VLLGAKDNTA QQISKVLHFP 
QVTENTTEKA ATYHVDRSGN VHHQFQKLJjT EFNKSTDAYE LKIANKLFGE KTYQFLQEYL 
DAIKKFYQTS VESTDFANAP EESRKKDJSW VESQTNEKIK NLFPDGTIGN DTTLVLVNAl 
YFKGQWENKF KKENTKEEKP WPNKNTYKSV QMMRQYNSFK FALLEDVQAK VLEIPYKGKD 
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KMIVLLPHS IDGLQKLEEK * LTAEKLMEWT sLqNMRETCV DLHLPRFKMB ESYDLKDTLR- • 300 

TMGMVNI FNG ' DADLSGMTW S HGLSVSKVbH KAFVEVTEEG VEAAAATAW WELSSPSTN 360 
EEFCCNHPFb FFIRQNKTNS ILFYGRFSSP 

Seq ID NO: 466 DMA sequence. 

Nucleic Acid Accession 8: NM_001910.1 

Coding sequence: 50. .1240. 

i V .li" " .21 : 31 4i f 1 

GGAGAGAAGA AAGGAGGGGG CAAGGGAQAA' GCTGCTGGTC GGACTCACAA TGAAAACGCT 60 

CCTTCTTTTG CTGCTGGTGC TCCTGGAGCT GGGAGAGGCC CAAGGATCCC TTCACAGGGT 120 

GCCCCTCAGG AGGCATCCGT CCCTCAAGAA GAAGCTGCGG ■ GCACGGAGCC AGCTCTCTGA 180 

GTTCTGGAAA TCCCATAATT TGGACATGAT CCAGTTCACC GAGTCCTGCT CAATGGACCA • 240 

GAGTGCCAAG GAACCCCTCA TCAACTACTT GGATATGGAA TACTTCGGCA CTATCTCCAT 300 

TGGCTCCCCA CCACAGAACT TCACTGTCAT CTTCGACACT GGCTCCTCCA ACCTCTGGGT 360 

CCCCTCTGTG TACTGCACTA GCCCAGCCTG CAAGACGCAC AGCAGGTTCC AGCCTTCCCA 420 

GTCCAGCACA TACAGCCAGC CAGGTCAATC • TTTCTCCATT: CAGTATGGAA CCGGGAGCTT 480 

GTCCGGGATC- ATTGGAGCCG ACCAAGTCTC TGTGGAAGGA CTAACCGTGG TTGGCCAGCA 540 

GTTTGGAGAA AGTGTCACAG AGCCAGGCCA GACCTTTGTG GATGCAGAGT TTGATGGAAT 600 

TCTGGGCCTG GGATACCCCT CCTTGGCTGT GGGAGGAGTG ACTCCAGTAT- TTGACAACAT . 660 

GATGGCTCAG AACCTGGTGG ACTTGCCGAT GTTTTCTGTC TACATGAGCA GTAACCCAGA 720 

AGGTGGTGCG- GGGAGCGAGC TGATTTTTGG" AGGCTACGAC CACTCCCATT TCTCTGGGAG 780 

CCTGAATTGG, GTCCCAGTCA CCAAGCAAGC TTACTGGCAG ; ATTGCACTGG ATAACATCCA 840 

GGTGGGAGGC ACTGTTATGT TCTGCTCCGA GGGCTGCCAG' GCCATTGTGG ACACAGGGAC 900 

TTCCCTCATC ACTGGCCCTT COGACAAGAT TAAGCAGCTG " CAAAACGCCA TTGGGGCAGC 960 

CCCCGTGGAT GGAGAATATG CTGTGGAGTG TGCCAACCTT AACGTCATGC CGGATGTCAC 1020 

CTTCACCATT AACGGAGTCC CCTAfACCCT CAGCCCAACT, GCCTACXCCC TACTGGACTT 1080 

CGTGGATGGA ATGCAGTTCT GCAGCAGTGG CTTTCAAGGA CTTGACATCC ACCCTCCAGC 1140 

TGGGCCCCTC TGGATCCTGG GGGATGTCTT CATTCGACAG TTTTACTCAG TCTTTGACCG ■ 1200 

TGGGAATAAC CGTGTGGGAC' TGGCCCCAGC AGTCCCCTAA GGAGGGGCCT, TGTGTCTGTG 1260 

CCTGCCTGTC TGACAGACCT TGAATATGTT AGGCTGGGGC ATTCTTTACA- CCTACAAAAA-. 1320 

GTTATTTTCC AGAGAATGTA GCTGTTTCCA GGGTTGCAAC TTGAATTAAG ACCAAACAGA 1380 

ACATGAGAAT ACACACACAC ACACACATAT. ACACACACAC ACACTTCACA CATACACACC 1440 

ACTCCCACCA CCGTCATGAT GGAGGAATTA CGTTATACAT TCATATTTTG TATTGATTTT 1500- 

TGATTATGAA AATCAAAAAT TTTCACATTT-* GATTATGAAA ATCTCCAAAC ATATGCACAA • 1560 

GCAGAGATCA - TGGTATAATA AATCCCTTTG, CAACTCCACT CAGCCCTGAC; AACCCATCCA 1620 

CACACGGCCA GGCCTGTTTA- TCTACACTGC TGCCCACTCC TCTCTCCAGC TCCACATGCT 1680 

GTACCTGGAT CATTCTGAAG ; CAAATTCCGA" GCATTACATC ATTTTGTCCA : TAAATATTTC 1740 

TAACATCCTT' AAATATACAA' TCGGAATTCA - AGCATCTCCC ATTGTCCCAC AAATGTTTGG . 1800 

CTGTTTTTGT AGTTGGATTG TTTGTATTAG' GATTCAAGCA AGGCCCATAT- ATTGCATTTA 1860 

TTTGAAATGT CTGTAAGTCT CTTTCCATCT ACAGAGTTTA GCACATTTGA' ACGTTGCTGG .1920 

TTGAAATCCC GAGGTGTCAT TTGACATGGT TCTCTGAACT TATCTTTCCT ATAAAATGGT 1980 

AGTTAGATCT GGAGGTCTGA TTTTGTGGCA AAAATACTTC CTAGGTGGTG. CTGGGTACTT 2040 

CTTGTTGCAT CCTGTCAGGA GGCAGATAAT. GCTGGTGCCT CTCTATTGGT AATGTTAAGA 2100 
CTGCTGGGTG GGTTTGGAGT TCTTGGCTTT AATCATTCAT TACAAAGTTC AGCATTTT 

Seq ID NO:. 467 Protein \ sequence • 
Protein Accession #: NP_001901.1 

1 U 21 31 41 51 

MKTLLLLLLV LLHLGEAQGS LHRVPLRRHP SLKKKLRARS QLSEPWKSHN LDMIQFTESC 60 

SMDQSAKEPL INYLDMEYFG TISIGSPPQN FTVIFDTGSS- NLWVPSVYCT SPACKTHSRF 120 

QPSQSSTYSQ PGQSFSIQYG" TGSLSGIIGA. DQVSVEGLTV VGQQFGESVT EPGQTFVDAE 180 

FDGILGLGYP SLAVGGVTPV FDNMMAQNLV DliPMFSVYMS SNPEGGAGSE LIFGGYDHSH ; 240 

PSGSLNWVPV TKQAYWQIAI* DNIQVGGTVM- FCSEGCQAIV DTGTSLITGP SDKIKQLQNA 300 

IGAAPVDGEY AVECANLNVM PDVTFTINGV PYTLSPTAYT LIiDFVDGMQF CSSGPQGLDI 360 
HPPAGPLWIL GDVFIRQFYS VFDSGNNRVG . LAPAVP 

Seq ID NO: .468 DNA sequence 

Nucleic Acid Accession #: NM_01B058.1 

Coding sequence: 319.. 1575 

1 > 11 .- 21 31 J 1 j 1 

TACGCGCTGC GGGACCGGCA' GGGGAACGCC ATCGGGGTCA CAGCCTGCGA CATCGACGGG 60 

GACGGCCGGG AGGAGATCTA CTTCCTCAAC ACCAATAATG CCTTCTCGGG GGTGGCCACG 120 

TACACCGACA AGTTGTTCAA GTTCCGCAAT . AACCGGTGGG AAGACATCCT GAGCGATGAG 180 

GTCAACGTGG CCCGTGGTGT GGCCAGCCTC TTTGCCGGAC GCTCTGTGGC CTGTGTGGAC 240 

AGAAAGGGCT- CTGGACGCTA CTCTATCTAC ATTGCCAATT ACGCCTACGG TAATGTGGGC 300 

CCTGATGCCC TCATTGAAAT GGACCCTGAG "GCCAGTGACC TCTCCCGGGG CATTCTGGCG 360 

CTCAGAGATG TGGCTGCTGA GGCTGGGGTC AGCAAATATA CAGGGGGCCG AGGCGTCAGC 420 

GTGGGCCCCA TCCTCAGCAG .CAGTGCCTCG GATATCTTCT GCGACAATGA GAATGGGCCT 480 

AACTTCCTTT TCCACAACCG GGGCGATGGC ACCTTTGTGG ACGCTGCGGC CAGTGCTGGT 540 

GTGGACGACC CCCACCAGCA TGGGCGAGGT GTCGCCCTGG CTGACTTCAA CCGTGATGGC 600 

AAAGTGGACA TCGTCTATGG CAACTGGAAT GGCCCCCACC GCCTCTATCT GCAAATGAGC 660 

ACCCATGGGA AGGTCCGCTT CCGGGACATC GCCTCACCCA AGTTCTCCAT GCCCTCCCCT 720 

GTCCGCACGG TCATCACCGC CGACTTTGAC . AATGACCAGG AGCTGGAGAT CTTCTTCAAC 780 

AACATTGCCT ACCGCAGCTC CTCAGCCAAC CGCCTCTTCC " GCGTCATCCG TAGAGAGCAC 840 

GGAGACCCCC TCATCGAGGA GCTCAATCCC GGCGACGCCT TGGAGCCTGA GGGCCGGGGC 900 

ACAGGGGGTG TGGTGACCGA CTTCGACGGA GACGGGATGC TGGACCTCAT CTTGTCCCAT 960 

GGAGAGTCCA TGGCTCAGCC GCTGTCCGTC TTCCGGGGCA' ; ATCAGGGCTT CAACAACAAC 1020 

TGGCTGCGAG TGGTGCCACG .CACCCGGGTT GGGGCCTTTG CCAGGGGAGC TAAGGTCGTG 1080 

CTCTACACCA AGAAGAGTGG GGCCCACCTG AGGATCATCG ACGGGGGCTC AGGCTACCTG 1140 

TGTGAGATGG AGCCCGTGGC ACACTTTGGC CTGGGGAAGG ATGAAGCCAG CAGTGTGGAG 1200 

GTGACGTGGC CAGATGGCAA GATGGTGAGC CGGAACGTGG CCAGCGGGGA GATGAACTCA 1260 
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■ ■ GTGCTGGAGA TCCTCTACCC CCGGGATGAG GACACACTTC AGGACCCAGC CCCACTGGAG 1320 • 

ACACCAATGA ATGCATCCAG TTCCCATTCG TGTGCCCTCG AGACAAGCCC GTATGTGTCA 1360 

ACACCTATGG AAGCTACAGG TGCCGGACCA ACAAGAAGTG CAGTCGGGGC TACGAGCCCA 1440 

ACGAGGATGG CACAGCCTGC GTGGGGACTC TCGGCCAGTC ACCGGGCCCC CGCCCCACCA 1500 

5 CCCCCACCGC TGCTGCTGCC ACTGCCGCTG CTGCTGCCGC TGCTGGAGCT GCCACTGCTG 1560 

CACCGGTCCT CGTAGATGGA GATCTCAATC TGGGGTCGGT GGTTAAGGAG AGCTGCGAGC 1620- 

CCAGCTGCTG AGCAGGGGTG GGACATGAAC CAGCGGATGG AGTCCAGCAG GGGAGTGGGA 1680 

AAGTGGGCTT GTGCTGCTGC CTAGACAGTA GGGATGTAAA GGCCTGGGAG CTAGACCCTC 1740 

CCCAAGCCCA TCCATGCACA TTACTTAGCT AACAATTAGG GAGACTCGTA AGGCCAGGCC. 1800 

10 CTGTGCTGGG . CACATAGCTG TGATCACAGC AGACAGGGTC GCTGCCCTGA TGGCGCTTAC 1860 

ATTCCAGTGG GTCTAATGAC CATATCTTAG GACACAGATG. TGCCCAGGGA GGTGGTGTCA. 1920 

CTGCACAGGA AGTATGAGGA CTTTAGTGTC CTGAGTTCAA ATCCTGATTC AGGAACTCAC 1980 

AAAGCTATGT GACCTTACAC CAGTCACTTA ACTTGTTAGC CATCCATTAT CGCATCTGCA.V 2040 

AAATGGGGAT TAAGAATAGA ATCTTGGGGT TAGTGTGGAG. ATT AG ATT AA ATGTATGTAA 2100 

1 5 G ACACTTGGC f ACAAAACCTG " . GCACATAGTA" AAGGCTCAAT AAAAACAAGT GCCTCTCACT 2160 
GGGCTTTGTC "AACACGTG . 



20 



30 



Seq. ID NO: 469 Protein sequence 
Protein Accession #:NP_0 6052 8-1 



i ,'. . ii 21 3i |i r 

mdpeasdlsr: gilalrdvaa eagvskytgg RGVSVGPILS SSASDIFCDN ENGPNFLFHN 60 

RGDGTFVDAA ASAGVDDPHQ HGRGVALADF NRDGKVDTVY GNWNGPHRLY- LQMSTHGKVR 120 

25 FRDIASPKFS MPSPVRTVIT. ADFDNDQELE IFFNNIAYRS SSANRLFRVI RREHGDPME 180 

ELNPGDALEP EGRGTGGWT- DFDGDGMLDL ILSHGESKAQ PLSVFRGNQG FNNNWLRWP 240 

RTRVGAFARG AKWLYTKkS GAHLRIIDGG SGYTiCEMEFV- AHFGLGKDEA- SSVEVTWPDG 3 00 

KMVSRNVASG EMNSVLEILY PRDEDTLQDP APLETPKNAS SSKSCALETS PYVSTPMEAT 360 
GAGPTRSAVG ATSPTRMAQP AWGLSASHRA PAPPPPPLLL PLPLLLPLLE LPLLHRSS 



Seq ID'NO: "470 DNA '. sequence 
Nucleic Acid Accession' #s AJ279016 
Coding sequence* 1..1962 



35 1 r 11 21 . 31 41 51 . 

ATGTCCAGGA TGTTACCGTT CCTGCTGCTG CTCTGGTTTC TGCCCATCAC TGAGGGGTCC 60 

CAGCGGGCTG AACCCATGTT CACTGCAGTC' ACCAACTCAG TTCTGCCTCC TGACTATGAC 120 

AGTAATCCCA CCCAGCTCAA CTATGGTGTG. GCAGTTACTG ATGTGGACCA TGATGGGGAC - , . 180 

40 TTTGAGATCG" TCGTGGCGGG GTACAATGGA CCCAACCTGG TTCTGAAGTA TGACCGGGCC 240 

CAGAAGCGGC TGGTGAACAT CG CGGTCGAT GAGCGCAGCT CACCCTACTA CGCGCTGCGG; 300 

GACCGGCAGG GGAACGCCAT CGGGGTCACA GCCTGCGACA TCGACGGGGA CGGCCGGGAG 360 

GAGATCTACT TCCTCAACAC CAATAATGCC TTCTCGGGGG TGGCCACGTA CACCGACAAG 420 

TTGTTCAAGT TCCGCAATAA- CCGGTGGGAA GACATCCTGA GCGATGAGGT CAACGTGGCC 480 

45 CGTGGTGTGG CCAGCCTCTT TGCCGGACGC TCTGTGGCCT GTGTGGACAG AAAGGGCTCT 540 

GGACGCTACT CTATCTACAT TGCCAATTAC GCCTACGGTA ATGTGGGCCC TGATGCCCTC 600 

ATTGAAATGG ACCCTGAGGC CAGTGACCTC TCCCGGGGCA TTCTGGCGCT- CAGAGATGTG . 660 

GCTGCTGAGG CTGGGGTCAG CAAATATACA GGGGGCCGAG GCGTCAGCGT GGGCCCCATC 720 

CTCAGCAGCA GTGCCTCGGA TATCTTCTGC GACAATGAGA ATGGGCCTAA CTTCCTTTTC 780 

50 CACAACCGGG GCGATGGCAC CTTTGTGGAC GCTGCGGCCA GTGCTGGTGT GGACGACCCC 84 0 

CACCAGCATG GGCGAGGTGT CGCCCTGGCT GACTTCAACC GTGATGGCAA AGTGGACATC 900 

GTCTA1GGCA ACTGGAATGG CCCCCACCGC. CTCTATCTGC AAATGAGCAC CCATGGGAAG 960 

GTCCGCTTCC GGGACATCGC CTCACCCAAG: TTCTCCATGC CCTCCCCTGT CCGCACGGTC 1020 

ATCACCGCCG ACTTTGACAA TGACCAGGAG CTGGAGATCT TCTTCAACAA CATTGCCTAC 1080 

5 5 CGCAGCTCCT ■ CAGCCAACCG CCTCTTCCGC GTCATCCGTA GAGAGCACGG AGACCCCCTC 1140 

ATCGAGGAGC TCAATCCCGG. CGACGCCTTG GAGCCTGAGG GCCGGGGCAC AGGGGGTGTG 1200 

GTGACCGACT TCGACGGAGA CGGGATGCTG GACCTCATCT TGTCCCATGG AGAGTCCATG 1260 

' GCTCAGCCGC TGTCCGTCTT CCGGGGCAAT CAGGGCTTCA ACAACAACTG GCTGCGAGTG 1320 

GTGCCACGCA CCCGGTTTGG" GGCCTTTGCC AGGGGAGCTA- AGGTCGTGCT- CTACACCAAG 1380 

60 AAGAGTGGGG CCCACCTGAG GATCATCGAC GGGGGCTCAG GCTACCTGTG TGAGATGGAG 1440 

CCCGTGGCAC ACTTTGGCCT GGGGAAGGAT GAAGCCAGCA" GTGTGGAGGT GACGTGGCCA 1500 

GATGGCAAGA TGGTGAGCCG GAACGTGGCC AGCGGGGAGA TGAACTCAGT GCTGGAGATC 1S60 

CTCTACCCCC GGGATGAGGA CACACTTCAG GACCCAGCCC CACTGGAGTG TGGCCAAGGA 1620 

TTCTCCCAGC AGGAAAATGG CCATTGCATG GACACCAATG AATGCATCCA GTTCCCATTC 1680 

65 GTGTGCCCTC GAGACAAGCC CGTATGTGTC AACACCTATG GAAGCTACAG GTGCCGGACC 1740 

AACAAGAAGT .GCAGTCGGGG CTACGAGCCC AACGAGGATG GCACAGCCTG CGTGGGGACT 1800 

CTCGGCCAGT CACCGGGCCC CCGCCCCACC ACCCCCACCG CTGCTGCTGC CACTGCCGCT 1860 

GCTGCTGCCG CTGCTGGAGC TGCCACTGCT GCACCGGTCC TCGTAGATGG AGATCTCAAT 192 0 

CTGGGGTCGG TGGTTAAGGA GAGCTGCGAG CCCAGCTGCT GAGCAGGGGT GGGACATGAA 1980 

70 CCAGCGGATG GAGTCCAGCA GGGGAGTGGG AAAGTGGGCT. TGTGCTGCTG CCTAGACAGT 204 0 
AGGGATGTAA AGGCCTGGGA GCTAGACCCT. CCCCAAGCCC ATCCATGCAC ATTACTTAGC .2100 

TAACAATTAG GGAGACTCGT AAGGCCAGGC CCTGTGCTGG GCACATAGCT GTGATCACAG 2160 

CAGACAGGGT CGCTGCCCTG ATGGCGCTTA CATTCCAGTG GGTCTAATGA CCATATCTTA 222 0 

GGACACAGAT GTGCCCAGGG AGGTGGTGTC ACTGCACAGG AAGTATGAGG ACTTTAGTGT 2280 

75 CCTGAGTTCA AATCCTGATT CAGGAACTCA CAAAGCTATG TGACCTTACA CCAGTCACTT 2340 

" AACTTGTTAG CCATCCATTA TCGCATCTGC AAAATGGGGA TTAAGAATAG AATCTTGGGG 2400 

'.TTAGTGTGGA GATTAGATTA AATGTATGTA AGACACTTGG CACAAAACCT GGCACATAGT 2460 
AAAGGCTCAA tAAAAACAAG TGCCTCTCAC TGGGCTTTGT CAACACG 

80 Seq ID NO: 471 Protein sequence 
Protein Accession #: CAC08451 

. "i ; - '11. . ;2i . .31 |i si 

85 MSRMLPFLLL LWFLPITEGS QRAEPMFTAV TNSVLPPDYD SNPTQLNYGV AVTDVDHDGD • 60 

FEIWAGYNG I PNLVLKYDRA' QKRLVNIAVD ERSSPYYALR DRQGNAIGVT ACDIDGDGRE 120 
■ EIYFLSTNNA FSGVATYTDK LFKFRNNRWE DILSDEVNVA RGVASLFAGR SVACVDRKGS • 180. 
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GRYSIYIANY AYGKVGPDAli XEMDPEASDIi SRGILAIiRDV AAEAGVSXYT GGRGVSVGPI 240 < 

LSSSASDIFC DNENGFNFLP HNRGDGTFVD AAASAGVDDP HQHGRGVALA DFNRDGKVDI 300 

VYGNWNGPHR LYLQMSTHGX VRFRDIASPK FSMPSPVRTV ITADrDNDQE LSIFFNNIAY 360 

RSSSANRLFR VIRREHGDPL IBELNPGDAL EPEGRGTGGV VTDFDGDGML DLILSHGESM 420. 

AQPLSVFRGN QGFNNNWLRV. VPRTRFGAFA RGAKWLYTX KSGAHLRIID GGSGYLCEME 480 

PVAHFGLGKD EASSVEVTWP DGKMVSRNVA SGEMNSVLBI LYPRDEDTLQ DPAPLEGGQG 540 

FSQQENGHCM DTNECIQFPF VCPRDKPVCV NTYGSYRCRT NKKCSRGYEP NEDGTACVGT 600 
LGQSPGPRPT TPTAAAATAA AAAAAGAATA 'APVLVDGDLN LGSWKESCE PSC 

Seq ID NO: 472 DNA sequence 
Nucleic Acid Accession 8: FGENESHH 
Coding sequence: 1.. 479*4 

1 11 ' 21 31 41 51 

ATGGCGTGTC CGGGAGGACT CCCAGCCCGT TGCTCTGGTT GGATGGGACT GGGTGGGCCC 60 

AGCGGCTCCT CCCCAGCATC CCCTCCCCAT.- TCCTCCTCCA GGTACAATGG ACCCAACCTG 120 

GTTCTGAAGT- ATGACCGGGC CCAGAAGCGG CTGGTGAACA TCGCGGTCGA" TGAGCGCAGC 180 

TCACCCTACT ACGCGCTGCG GGACCGGCAG GGGAACGCCA TCGGGGTCAC AGCCTGCGAC 240 

ATCGACGGGG ACGGCCGGGA GGAGATCTAC TTCCTCAACA CCAATAATGC CTTCTCGGGC 300 

CACAGCAGGT CAGCGCAGGT CCCTTCTGGG CTCCACAGAA ACAGGCCTGT GCTGAAGCCT 360 

CCACCTACAA CCCCTGCAGG CCTCCTGGGT CTGCCTCCAC TCAGCGGAAG GGACTTTTCC 420 

TCCTCCCTGG GTCAGGCTTC" TCCGGACAGC AGGCAGGGAG. AGAGGGTGCC GGTTCCCTGC 480 

TGTCGGGGTG GACTGAGACC' TACCCATGAA CCAGAACCAT TTCTTCTGAG' ACCCAAATCA 540 

GGGGTGGCCA CGTACACCGA CAAGTTGTTC AAGTTCCGCA ATAACCGGTG GGAAGACATC 600 
CTG AG CG A TG AGGTCAACGT GGCCCGTGGT GTGGCCAGCC TCTTTGCCGG ACGCTCTGTG . 660 

GCCTGTGTGG ACAG AAAGGG . CTCTGGACGC TACTCTATCT ACATTGCCAA TTACGCCTAC . 720 

GGTAATGTGG GCCCTGATGC CCTCATTGAA' ATGGACCCTG AGGCCAGTGA CCTCTCCCGG 780 

GGCATTCTGG CGCTCAGAGA TGTGGCTGCT GAGGCTGGGG TCAGCAAATA TACAGAAGGC 840 

TTCTCCCACA CTGCCTCTCC' AAGCATTGGT GAGATATCTG" GCAGAACCGA GGAGCGGGAA 900 

GGAGGAGACC CAGAGGAGGC AGATGAGGAG.. CACAGTGGGG- ATGGAAGCAC CAGCCAACTG 960 

TGCCGGCTGG' GCTGGAAGGA CGGGCAGTTC AAGGAAGAAG CAGCAGCTTT- GGTGGAGGAA 1020 

CAGAGGGAGG CTGGGGCAGC TGGCGTGCCC AGAGGACGTG TTCGAACAGC TCTGCAGACT 1080 

TCCAAAAGCC ATTTGGCTGff; CAAGAACCTA TTTGGCCCAC CATGTTACTA TTCTGTCTGC 1140 

GCGCCTTCTC CAGCCCACCC. TTTCCCTGCC CGCCAAGCCC CCCAACACTA CCCTGTAGCC 1200 

CCCCTTGTCA- CTCAGCTAAT GACACATGGA CGTCTGGCTG GAAAACTAGC" CCGGAGTGTC 12 60 

CCCCACCCCC GAGCCCCAGG AATGGACCCC AAATGTAAGG GCCGCCATGC TGAGCCCGGC : 1320 

CTGATCGCTG AGGCTTTGGG CGCGTGGCCA GCGCTCAGCA CCACTGTGGT. GCCAGGGGGC. 1380 

CTGAGAAGCT GGGAGGAAAG: CAGGCAGAAG GGGCAGGCCA TGTCCAGATG TGCACTCAGG ' 1440 

GAGCTGGGAG" GTCCCTGGAG'- CCAAGCCACA CAGCACCTGC CTGCTAGAGA GCTGTATGAC .1500 

CTGGGAGAAC CT CCCATT TT ACAAAGAAGA . GACGGAGATC CAGGGAGGAG - AAGGGACTCG 1560 

CCCAAGGTCA CACAGGAGTG CCATCTAGTG GCCACCATGC CAGCTCTCGG GGGACTCGAG' 1620 

GGCCCCGGGA GGGTGGCCAA' GCGAGAGATT GGGAGAGAGA CTGGGGCAGT AGGAAGACCA IS 80 

CTCTCCCATC CCCTGGTCCC. CAACTTCCCC AGCTGCTTGA GGCCTCTTGA AGCCGGGACA 1740 

GTGCCGGGAG- CTGCCCTGCC: TGGGAATCCT GGGAACTGGG TTCTGGACAT GGCCAAGGCC 1800 

CTGGCGTGGA- ACCAGATGGA- AAAAGAGGAG GGGAAGATTC ATGGAGACCA TGAGCCCAGA 1860 

TTTAGGCTCA GGAAAGCACG GGAAGCAGAA TTCCCCCCAG GCTCCTCTGA GGAGCCTCTG 1920 

CTGCAGTTCC CCTCAGGCCT CAGAGGCAGC CCTGTCCTCC AGGTGGGCCT GGGGCTTGCT 1980 

TCTGCCACTC ACTGTGGGTC GATGTCTTTT CTAGGGGGCC GAGGCGTCAG CGTGGGCCCC 2040 

ATCCTCAGCA* GCAGTGCCTC GGATATCTTC TGCGACAATG AGAATGGGCC TAACTTCCTT 2100 

TTCCACAACC GGGGCGATGG CACCTTTGTG GACGCTGCGG- CCAGTGCTGA ACGTCGTTTA 2160 

GCCTTCATCG TTCACCTCAA ATATCACCTC TGCAGAGATT TTCCTCACTC CCTGTGCCAC 2220 

CTAGCAGAAA CTGGTCCTTC CTCCTCCTGC ' TGCCCGTGGC ATGCACGTCT TCTTCAGGCT 2280 

CCACATTGCC ATCATGGTTT- GTCTATGAGC TTTACAAGGA CCGGGTCACG GTTCTATTCA 2340 

TTCTTGACGC AAGGCTTGGC CTCCAGTGCC CACCGGAGGA CACTCAGCCT CCAGGGTTCT, 2400 

CAGGGGGCCC CACCCTGCCT TCTGGCAAGA GCTCCCTGTG TCCTGGGGTC TCTGATCCCC 2460 

ACTGCCTATT ACATTGTCCT GTGGTCTGCC ATCCCAGAGA. GCCTGATGAC CCACAGCTAT 2520 

TTGTCCTCTG AAAGAGTCAA CGTGGGTGTG GACGACCCCC ACCAGCATGG GCGAGGTGTC 2580 

GCCCTGGCTG ACTTCAACCG TGATGGCAAA- GTGGACATCG TCTATGGCAA CTGGAATGGC 2640 

CCCCACCCCC TCTATCTGCA AATGAGCACC CATGGGAAGG TCCGCTTCCG GGACATCGCC 2700 

* TCACCCAAGT TCTCCATGCC CTCCCCTGTC ' CGCACGGTCA TCACCGCCGA . CTTTGACAAT 2760 

GACCAGGAGC TGGAGATCTT CTTCAACAAC ATTGCCTACC GCAGCTCCTC AGCCAACCGC 2820 

CTCTTCCGAT GCTCCATCCT GGCTCGTGGC ' TCTTCATCCT TGACAGCTGG TGGGAGGAAC 2880 

GGTCAGGGAG AAGGTTTAAG AATCAGAAGG GGAGGGTTCC CAGGGCCAGG GGGTCAGGCC 2 940 

AAGGTCAACA CAGGTCCCCT GATGAAGAAA CAGAAAGGAA GGAAGGACGA GGACTGGGCA 3000 

AGAGGCTGTG GGAATGCAGG GCAAAGCCTG GCCAAGGAGC CGGCCTCTGC TATTGCAGGG 3060 

AAAGGGAAGG GAAATGTGGC CCAAAGTGTG. CCCAGAACCC AAGCGCCACA AGATACAAAG 3120 

CCACACTACC ACAAAAAGGG GCTACAGGGT CCAATCACTA CCAGGAAAAG GGGCTACGGG 3180 

GTCCAATCAC TACCAGGAAA- AGGGGCTACG GGGTCCAATC ACTACCAGGA AAAGGGGCTA 3240 

CGGGGTCCAA TCACTACCAG GAAAAGGGGC' TACGGGGTCC AATCACTACC AGGAAAAGGG 3300 

GCTACGGGCT CCAATCACTA CCAGGAAAAG GGGCTACAGG GTCCAATCAC TACCAGGAAA 33 60 

AGGGGCTACG GGCTCCAATC ACTACCAGGA AAAGGGGCTA CAGGGTCCAA TCACTACCAC 3420 

AGAAAGGGGC TACGGGCTCC AATCACTACC AGGAAAAGGG GCTACGGGGT- CCAATCACTA 3480 

CCAGGAAAAG GGGCTACAGG GTCCAATCAC TACCAGGAAA AGGGGCTACG GGGTCCAATC 3540 

ACTACCAGGA AAAGGGGCTA CGGGCTCCAA TCACTACCAG GAAAAGGGGC TACGGGGTCC 3600 

AATCACTACC AGGAAAAGGG GCTACAGGGT CCAATCACTA CCAGGAAAAG GGGCTACAGG 3660 

GTCCAATCAC TACCACAGAA AGGGGCTACG GGCTCCAATC ACTACCAGGA AAAGGGGCTA 3720 

CGGGGTCCAA- TCACTACCAG GAAAAGGGGC TACGGGCTCC AATCACTACC AGGAAAAGAG 3780 

GCTATGGGGT CCAATCACTA CCAGGAAAAG GGGCTACGGG GTCCAATCAC TACCAGGAAA 3840 

AGGGGCTATG GGGTCCAATC ACTACCACAG AAAGGGGCTA CGGGGTCCAA CGTCATCCGT 3900 

AGAGAGCACG GAGACCCCCT CATCGAGGAG CTCAATCCCG GCGACGCCTT GGAGCCTGAG 3 960 

GGCCGGGGCA CAGGGGGTGT GGTGACCGAC TTCGACGGAG ACGGGATGCT GGACCTCATC 4020 

TTGTCCCATG GAGAGTCCAT GGCTCAGCCG CTGTCCGTCT TCCGGGGCAA TCAGGGCTTC 4080 

AACAACAACT GGCTGCGAGT GGTGCCACGC ACCCGGTTTG GGGCCTTTGC CAGGGGAGCT 4140 

AAGGTCGTGC TCTACACCAA GAAGAGTGGG GCCCACCTGA GGATCATCGA CGGGGGCTCA 4200 

GGCTACCTGT GTGAGATGGA GCCCGTGGCA CACTTTGGCC TGGGGAAGGA TGAAGCCAGC 4260 

AGTGTGGAGG TGACGTGGCC AGATGGCAAG ATGGTGAGCC GGAACGTGGC CAGCGGGGAG 4320 
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ATCAACTC^ TCCTCGAGAT CCTCTACCCC CGGGATGAGG ACACACTTCA GGACCCAGCC .4380 

CCACTGGAGT GTGGCCAAGG ATTCTCCCAG CAGGAAAATG GCCATTGCAT GGACACCAAT 4440 

GAATGCATCC AGTTCCCATT CGTGTGCCCT CGAGACAAGC CCGTATGTGT CAACACCTAT 4500 

GGAAGCTACA GGTGCCGGAC CAACAAGAAG TGCAGTCGGG GCTACGAGCC CAACGAGGAT 4560 

GGCACAGCCT GCGTGGGTAC TGAGCTAGGC TCTAGGCATA CAATGACGTG GAAACC AAGG 4620 

CCCAAAAAGG AGCTGCAACT TTCCCAAGGC ATCTGCACCC CCGTCTGGTC CTTTTTCCTG 4680 . 

CCGGGTTGCC GGCTGCTCCT CAAAAGAGCT CAGCTCCAGG CTGCTCCCAG ' CACCCTTCTC 4740 
CAGAAAGCTC CAGGTATTCC AGAAGCCCAA GTGTATGAAC AAGATCAGGA ATAA * 

Seq ID NO: 473 Protein sequence - 
Protein Accession #: FGENESH predicted 

1 11 21 31 41 "51 

MACPGGLPAR CSGWMGLGGP SGSSPASPPH SSSRYNGPNL VLKYDRAQKR LVNIAVDERS 60 

SPYYAKRDRQ GNAIGVTACD- IDGDGREEIY FLNTNNAFSG HSSSAQVPSG LHRNRPVLKP 120 

PPTTPAGLLG LPPLSGRDF5- SSLGQASPDS RQGERVPVPC" CRGGLRPTHE PEPFLLRPKS 180 

GVATYTDKLF KFRNNRWEDI LSDEVNVARG VASLFAGRSV ACVDRKGSGR YSIYIANYAY 240 

GNVGPDALIE MDPEASDLSR GILALRDVAA EAGVSKYTEG- FSKTASPSIG EISGRTEERE 300 

GGDPEEADEE HSGDGSTSQL CRLGWKDGQF KEEAAALVEE QREAGAAGVP RGRVRTALQT 360 

SKSHLADKNL FGPPCYYSVC. APSPAHPFPA RQAPQHYPVA PLVTQLMTHG RLAGKLARSV 420 

PHPRAPGMDP' KCKGRHAEPG LMAEALGAWP ALSTTWPGG LRSWEESRQK- GQAMSRCALR 480 

ELGGPWSQAT QHLPARELYD LGEPPILQRT DGDPGRRRDS PKVTQECHLV ATMPALGGtiE 540 

GPGRVAKREI GRETGAVGRP- LSHPLVPNFP SCLRPLEAGT- VPGAALPGNP GNWVLDMAKA 600 

LAWNQMEKEE GXTHGDHEPR FRLRKAREAE PPPGSSEEPL LQFPSGLRGS PVLQVGLGLA 660 

SATHCGSMSF LGGRGVSVGP ILSSSASDIF CDNENGPNFI* FHNRGDGTFV. DAAASAERRL-. 720 

AFIVELKYHL' CRDFPHSLCH IiAETGPSSSC CPWHARLLQA PHCHHGLSMS FTRTGSRFYS • 7B0 

FLTQGLASSA HRRTbSLQGS GGAPPCLLAR APCVLGSLIP TAYYIVLWSA IPESLMTHSY 840 

LSSERVNVGV- DDPHQHGRGV ALADFNRDGK VDIVYGNWNG PHRI»YLQMST HGKVRFRDIA 900 

SPKFSMPSPV RTVITADFDN DQELEIFFNN IAYRSSSANR- LFRCSILARG SSSLTAGGRN 960 

GQGEGLRIRR GGFPGPGGQA KVNTGPLMKK QKGRKDEDWA RGCGNAGQSL AKEPASAIAG 102O 

KGKGNVAQSV PRTQAPQDTK PHYHKKGLQG. PITTRKRGYG- VQSLPGKGAT GSNHYQEKGL 1080 

RGPITTRKRG YGVQSLPGKG- ATGSNHYQEK GLQGPITTRK RGYGtiQSLPG. KGATGSNHYH 1140 

RKGLRAPITT RKRGYGVQSL FGKGATGSNH- YQEKGLRGPI TTRKRGYGLQ: SLPGKGATGS - 1200 

NHYQEKGLQG PITTRKRGYR VQSLPQKGAT GSNHYQEKGL RGPITTRKRG YGLQSLPGKE 1260 

AMGSNHYQEK GLRAPITTRK RGYGVQSLPQ. KGATGSNVIR REHGDPLIEE LNPGDALEPE 1320 

GRGTGGWTD FDGDGMLDLI LSHGESMAQP LSVFRGNQGF NNNWLRWPR TRFGAFARGA 1380 

KWLYTKKSG AHLRIIDGGS GYLCEMEPVA HFGLGKDEAS. SVEVTWPDGK MVSRNVASGE 1440 

MNSVLEILYP' RDEDTLODPA PLECGQGFSQ. QENGHCMDTN ECIQFPFVCP RDKPVCVNTY 1500 

GSYRCRTNKK CSRGYEPNED GTACVGTELG SRHTMTWKPR PKKELQLSQG ICTPVWSFFIi 1560 
PGCRLLLKRA QLQAAPSTLL QKAPGIPEAQ VYEQDQE 

Seq ID NO: 474 DNA sequence 

Nucleic Acid Accession NM_003661.1 

Coding sequence: 1..1152 

1 11 .21-31 41 SI 

ATGAGTGCAC TTTTCCTTGG TGTGGGAGTG AGGGCAGAGG AAGCTGGAGC GAGGGTGCAA 60 

CAAAACGTTC CAAGTGGGAC AGATACTGGA GATCCTCAAA GTAAGCCCCT CGGTGACTGG 120 

GCTGCTGGCA CCATGGACCC AGAGAGCAGT ATCTTTATTG AGGATGCCAT TAAGTATTTC 180 

AAGGAAAAAG TGAGCACACA GAATCTGCTA CTCCTGCTGA- CTGATAATGA GGCCTGGAAC 240 

GGATTCGTGG CTGCTGCTGA ACTGCCCAGG AATGAGGCAG A7GAGCTCCG TAAAGCTCTG 300 

GACAACCTTG CAAGACAAAT GATCATGAAA GACAAAAACT GGCACGATAA AGGCCAGCAG 360 

TACAGAAACT GGTTTCTGAA AGAGTTTCCT. CGGTTGAAAA GTGAGCTTGA GGATAACATA 420 

AGAAGGCTCC GTGCCCTTGC AGATGGGGTT CAGAAGGTCC ACAAAGGCAC CACCATCGCC 480 

AATGTGGTGT CTGGCTCTCT CAGCATTTCC TCTGGCATCC TGACCCTCGT CGGCATGGGT- 540 
CTGGCACCCT. TCACAGAGGG AGGCAGCCTT GTACTCTTGG AACCTGGGAT GGAGTTGGGA • 600 

ATCACAGCCG CTTTGACCGG GATTACCAGC AGTACCATGG ACTACGGAAA GAAGTGGTGG 660 

ACACAAGCCC AAGCCCACGA CCTGGTCATC AAAAGCCTTG ACAAATTGAA GGAGGTGAGG 720 

GAGTTTTTGG - GTGAGAACAT ATCCAACTTT CTTTCCTTAG CTGGCAATAC TTACCAACTC 790 

ACACGAGGCA TTGGGAAGGA CATCCGTGCC CTCAGACGAG CCAGAGCCAA TCTTCAGTCA 840 

GTACCGCATG CCTCAGCCTC ACGCCCCCGG GTCACTGAGC CAATCTCAGC TGAAAGCGGT 900 

GAACAGGTGG AGAGGGTTAA TGAACCCAGC ATCCTGGAAA TGAGCAGAGG AGTCAAGCTC 960 

ACGGATGTGG CCCCTGTAAG" CTTCTTTCTT GTGCTGGATG TAGTCTACCT CGTGTACGAA 1020 

TCAAAGCACT TACATGAGGG GGCAAAGTCA GAGACAGCTG AGGAGCTGAA GAAGGTGGCT 1080 

CAGGAGCTGG AGGAGAAGCT AAACATTCTC AACAATAATT ATAAGATTCT GCAGGCGGAC 1140 
CAAGAACTGT GA 

Seq ID NO: 475 Protein sequence 
Protein -Accession 8: NP_003652.1 

1 * 11 21 - 31 41 51 

i r i i i i 

- MSALFLGVGV RAEEAGARVQ QNVPSGTDTG DPQSKPLGDW AAGTMDPESS IFIEDAIKYF -60 

KEKVSTQNLL tiLLTDNEAWN GPVAAAELPR NEADBLRKAL DNLARQMIMK DKNWHDKGQQ 120 

YRNWFLKEFP RLKSELEDNI RRLRALADGV QKVHKGTTIA NWSGSLSIS SGILTLVGMG 180 

IAPFTEGGSL VLLEPGMELG ITAALTGITS STMDYGKKWW TQAQAHDLVI KSLDKLKBVR 240 

• EFLGENISNF LSLAGNTYQL TRGIGKDIRA LRRARANLQS VPHASASRPR VTEPISAESG — 
EQVERVNEPS ILEMSRGVKL TDVAPVSFFL VLDWYLVYE SKHLHEGAKS ETAEELKKVA 
QELEEKLNIL NNNYKTtiQAD QEI* 

' Seq ID NO: 476 DMA sequence 
Nucleic Acid Accession 8': NM_014452.1 
Coding sequence :" 1 .. 1 96 8 
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ATGGGGACCT CTCCGAGCAG CAGCACCGCC CTCGCCTCCT GCAGCCGCAT CGCCCGCCGA 60 

GCCACAGCCA CGAK3ATCGC GGGCTCCCTT CTCCTGCTTG GATTCCTTAG CACCACCACA . 120 

GCTCAGCCAG AACAGAAGGC CTCGAATCTC ATTGGCACAT ACCGCCA7GT TGACCGTGCC 180 

ACCGGCCAGG TGCTAACCTG TGACAAGTGT CCAGCAGGAA CCTATGTCTC TGAGCATTGT 240 

ACCAACACAA' GCCTG CGCGT CTGCAGCAGT TGCCCTGTGG GGACCTTTAC CAGGCATGAG 300 

AATGGCATAG AGAAATGCCA TGACTGTAGT CAGCCATGCC CATGGCCAAT GATTGAGAAA -«360. 

TTACCTTGTG CTGCCTTGAC TGACCGAGAA TGCACTTGCC CACCTGGCAT GTTCCAGTCT 420 

AACGCTACCT GTGCCCCCCA TACGGTGTGT CCTGTGGGTT GGGGTGTGCG GAAGAAAGGG " 480 

ACAGAGACTG AGGATGTGCG GTGTAAGCAG TGTGCTCGGG GTACCTTCTC AGATGTGCCT 540 

TCTAGTGTGA TGAAATGCAA AGCATACACA GACTGTCTGA GTCAGAACCT GGTGGTGATC " 600 

AAGCCGGGGA CCAAGGAGAC AGACAACGTC TGTGGCACAC TCCCGTCCTT CTCCAGCTCC 660 

ACCTCACCTT CCCCTGGCAC AGCCATCTTT CCACGCCCTG AGCACATGGA AACCCATGAA .. 720 

GTCCCTTCCT CCACTT ATGT TCCCAAAGGC ATGAACTCAA CAGAATCCAA CTCTTCTGCC 7 BO 
TCTGTTAGAC CAAAGGTACT GAGTAGCATC CAGGAAGGGA CAGTCCCTGA CAACACAAGC . 840 

TCAGCAAGGG GGAAGGAAGA CGTGAACAAG ACCCTCCCAA ACCTTCAGGT AGTCAACCAC 900 

CAGCAAGGCC CCCACCACAG ACACATCCTG AAGCTGCTGC CGTCCATGGA GGCCACTGGG 960 

GGCGAGAAGT CCAGCACGCC CATCAAGGGCT CCCAAGAGGG GACATCCTAG ACAGAACCTA 1020 

CACAAGCATT TTGACATCAA TGAGCATTTG \ CCCTGGATGA TTGTGCTTTT CCTGCTGCTG 1080 

GTGCTTGTGG TGATTGTGGT GTGCAGTATC CGGAAAAGCT CGAGGACTCT GAAAAAGGGG 1140 

CCCCGGCAGG ATCCCAGTGC CATTGTGGAA AAGGCAGGGC TGAAGAAATC CATGACTCCA 1200 

ACCCAGAACC GGGAGAAATG GATCTACTAC TGCAATGGCC ATGGTATCGA TATCCTGAAG 1260 

CTTGTAGCAG CCCAAGTGGG AAGCCAGTGG AAAGATATCT- ATCAGTTTCT TTGCAATGCC 1320 

AGTGAGAGGG AGGTTGCTGC TTTCTCCAAT ' GGGTACACAG. CCGACCACGA GCGGGCCTAC .1380 

GCAGCTCTGC AGCACTGGAC CATCOGGGGC CCCGAGGCCA- GCCTCGCCCA GCTAATTAGC 1440 

GCCCTGCGCC AGCACCGGAG AAACGATGTT- GTGGAGAAGA TTGGTGGGCT GATGGAAGAC 1500. 

ACCACCCAGC TGGAAACTGA CAAACTAGCT CTCCCGATGA GCCCCAGCCC GCTTAGCCCG 1560 

AGCCCCATCC CCAGCCCCAA CGCGAAACTT GAGAATTCCG CTCTCCTGAC GGTGGAGCCT. 1620 

TCCCCACAGG ACAAGAACAA GGGCTTCTTC GTGGATGAGT CGGAGCCCCT TCTCCGCTGT 1680 

GACTCTACAT CCAGCGGCTC CTCCGCGCTG AGCAGGAACG GTTCCTTTAT TACCAAAGAA 1740 

AAGAAGGACA CAGTGTTGCG GCAGGTACGC CTGGACCCCT GTGACTTGCA GCCTATCTTT 1800 

GATGACATGC TCCACTTTCT. AAATCCTGAG GAGCTGCGGG TGATTGAAGA GATTCCCCAG 1860 

GCTGAGGACA AACTAGACCG GCTATTCGAA ATTATTGGAG TCAAGAGCCA GGAAGCCAGC 1920 
CAGACCCTCC TGGACTCTGT TTATAGCCAT CTTCCTGACC TGCTGTAG 

Seq ID NO: 477 Protein sequence 
Protein Accession #: NP_055267.1 

1 .11 21 " 31 41 j 1 

MGTSPSSSTA LASCSRIARR ATATMIAGSL LLLGFLSTTT AQPEQKASNL IGTYRHVDRA 60 

TGQVLTCDKC PAGTYVSEHC TNTSLRVCSS CPVGTFTRHE NGIEKCHDCS\ QPCPWPHIEK 120 

LPCAALTDRE CTCPPGMFQS-. NATCAPHTVC PVGWGVRKKGl TETEDVRCKQ. CARGTFSDVP 180 

SSVMKCKAYT DCLSQNLWI KPGTKETDNV CGTLPSFSSS TSPSPGTAIF PRPEHMETHE 240 

VPSSTYVPKG- MNSTESKSSA SVRPKVIiSSI QEGTVPDMTS SARGKEDVNK TLPNLQWNH 300 

QQGPHHRHIL KLLPSMEATG GEKSSTPIKG PKRGHPRQNL HKHFDINEHL PWMIVLFLLL 360 

VLWIWCSI RKSSRTLKKG PRQDPSAIVE;: KAGLKKSMTP TQNREKWIYY CNGHGIDILK 420 

LVAAQVGSQW KDIYQFLOJA SEREVAAFSN- GYTADHERAY AALQHWTIRG PEASLAQLIS 480 

ALRQHRRNDV VEKIRGLMED TTQLETDKLA LPMSPSPbSP SPIPSPKAKL ENSALLTVEP 540 

SPQDKNKGFF VDESEPLLRC DSTSSGSSAL SRNGSFITKE KKDTVLRQVR LDPCDLQPIF 600 
DDMLHFLNPB ELRVIEEIPQ AEOKLDRLFE' IIGVKSQEAS QTLLDSVYSH LPDLL 

Seq ID NO: 478- DMA sequence 
Nucleic Acid Accession #: XM_044533 
Coding sequence: 238.. 2751 

1 u t •< i 1 ' r r 

GCTCTGCCCA AGCCGAGGCT GCGGGGCCGG CGCCGGCGGG AGGACTGCGG TGCCCCGCGG 60 

AGGGGCTGAG TTTGCCAGGG CCCACTTGAC CCTGTTTCCC ACCTCCCGCC CCCCAGGTCC 120 

GGAGGCGGGG GCCCCCGGGG CGACTCGGGG GCGGACCGCG GGGCGGAGCT GCCGCCCGTG 180 

AGTCCGGCCG AGCCACCTGA GCCCGAGCCG CGGGACACCG TCGCTCCTGC TCTCCGAATG 240 

CTGCGCACCG CGATGGGCCT GAGGAGCTGG CTCGCCGCCC CATGGGGCGC GCTGCCGCCT 300 

CGGCCACCGC TGCTGCTGCT CCTGCTGCTG CTGCTCCTGC TGCAGCCGCC GCCTCCGACC 360 

TGGGCGCTCA GCCCCCGGAT CAGCCTGCCT CTGGGCTCTG AAGAGCGG CC ATTCCTCAGA 420 

TTCGAAGCTG AACACATCTC CAACTACACA GCCCTTCTGC TGAGCAGGGA TGGCAGGACC 480 

CTGTACGTGG GTGCTCGAGA GGCCCTCTTT GCACTCAGTA 1 GCAACCTCAG CTTCCTGCCA 540 

GGCGGGGAGT ACCAGGAGCT GCTTTGGGGT GCAGACGCAG AGAAGAAACA GCAGTGCAGC 600 

TTCAAGGGCA AGGACCCACA GCGCGACTGT CAAAACTACA TCAAGATCCT CCTGCCGCTC 660 

AGCGGCAGTC ACCTGTTCAC CTGTGGCACA GCAGCCTTCA GCCCCATGTG TACCTACATC 720 

AACATGGAGA ACTTCACCCT GGCAAGGGAC GAGAAGGGGA ATGTCCTCCT GGAAGATGGC 780 

AAGGGCCGTT GTCCCTTCGA CCCGAATTTC AAGTCCACTG CCCTGGTGGT TGATGGCGAG ' 840 

CTCTACACTG GAACAGTCAG CAGCTTCCAA GGGAATGACC CGGCCATCTC GCGGAGCCAA 900 

AGCCTTCGCC CCACCAAGAC CGAGAGCTCC CTCAACTGGC TGCAAGACCC AGCTTTTGTG 960 

GCCTCAGCCT ACATTCCTGA GAGCCTGGGC AGCTTGCAAG GCGATGATGA CAAGATCTAC 1020 

TTTTTCTTCA GCGAGACTGG CCAGGAATTT GAGTTCTTXG AGAACACCAT TGTGTCCCGC 1080 

ATTGCCCGCA TCTGCAAGGG CGATGAGGGT GGAGAGCGGG TGCTACAGCA GCGCTGGACC 1140 

TCCTTCCTCA AGGCCCAGCT GCTGTGCTCA CGGCCCGACG ATGGCTTCCC CTTCAACGTG 1200 

CTGCAGGATG TCTTCACGCT GAGCCCCAGC CCCCAGGACT GGCGTGACAC CCTTTTCTAT 1260 

GGGGTCTTCA CTTCCCAGTG GCACAGGGGA ACTACAGAAG GCTCTGCCGT CTGTGTCTTC 1320 

ACAATGAAGG ATGTGCAGAG AGTCTTCAGC GGCCTCTACA AGGAGGTGAA CCGTGAGACA 1380 

CAGCAGTGGT ACACCGTGAC CCACCCGGTG CCCACACCCC GGCCTGGAGC GTGCATCACC 1440 

AACAGTGCCC GGGAAAGGAA GATCAACTCA TCCCTGCAGC TCCCAGACCG CGTGCTGAAC 1500 

TTCCTCAAGG ACCACTTCCT GATGGACGGG CAGGTCCGAA GCCGCATGCT GCTGCTGCAG 1560 

CCCCAGGCTC GCTACCAGCG CGTGGCTGTA CACCGCGTCC CTGGCCTGCA CCACACCTAC 1620 

GATGTCCTCT TCCTGGGCAC TGGTGACGGC CGGCTCCACA AGGCAGTGAG CGTGGGCCCC 1680 

CGGGTGCACA TCATTGAGGA GCTGCAGATC TTCTCATCGG GACAGCCCGT GCAGAATCTG 1740 
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CTCCTGGACA CCCACAGGGG GCTGCTGTAT GCGGCCTCAC ACTCGGGCGT AGTCCAGGTG 1800 

CCCATGGCCA ACTGCAGCCT GTACAGGAGC TGTGGGGACT GCCTCCTCGC CCGGGACCCC 1860 

TACTGTGCTT GGAGCGGCTC CAGCTGCAAG CACGTCAGCC TCTACCAGCC TCAGCTGGCC 1920 

ACCAGGCCGT GGATCCAGGA CATCGAGGGA GCCAGCGCCA AGGACCTTTG CAGCGCGTCT 1980 

TCGGTTGTGT CCCCGTCTTT TGTACCAACA GGGGAGAAGC CATGTGAGCA AGTCCAGTTC 2040 

CAGCCCAACA CAGTGAACAC TTTGGCCTGC CCGCTCCTCT CCAACCTGGC GACCCGACTC 2100 

TGGCTACGCA ACGGGGCCCC CGTCAATGCC TCGGCCTCCT GCCACGTGCT ACCCACTGGG 2160 

GACCTGCTGC TGGTGGGCAC CCAACAGCTG GGGGAGTTCC AGTGCTGGTC ACT AGAGGAG : 2220 

GGCTTCCAGC AGCTGGTAGC CAGCTACTGC CCAGAGGTGG TGGAGGACGG GGTGGCAGAC 2280 

CAAACAGATG AGGGTGGCAG' TGTACCCGTC ATTATCAGCA CATCGCGTGT GAGTGCACCA-; 2340 

GCTGGTGGCA AGGCCAGCTG"' GGGTGCAGAC AGGTCCTACT GGAAGGAGTT CCTGGTGATG- 2400 

TGCACGCTCT TTGTGCTGGC CGTGCTGCTC CCAGTTTTAT TCTTGCTCTA CCGGCACCGG 2460 

AACAGCATGA AAGTCTT CCT GAAGCAGGGG GAATGTGCCA GCGTGCACCC CAAGACCTGC 2520 

CCTGTGGTGC TGCCCCCTGA - GACCCGCCCA CTCAACGGCC. TAGGGCCCCC TAGCACCCCG 2SB0 

CTCGATCACC GAGGGTACCA GTCCCTGTCA GACAGCCCCC CGGGGTCCCG AGTCTTCACT 2640 

GAGTCAGAGA AGAGGCCACT CAGCATCCAA- GACAGCTTCG TGGAGGTATC CCCAGTGTGC 2700 

CCCCGGCCCC GGGTCCGCCT TGGCTCGGAG . ATCCGTGACT CTGTGGTGTG AGAGCTGACT 2760 

TCCAGAGGAC GCTGCCCTGG CTTCAGGGGC TGTGAATGCT CGGAGAGGGT CAACTGGACC 2820 

TCCCCTCCGC TCTGCTCTTC- GTGGAACACG: ACCGTGGTGC CCGGCCCTTG GGAGCCTTGG 2880 

GGCCAGCTGG CCTGCTGCTC TCCAGTCAAG. TAGOGAAGCT CCTACCACCC AGACACCCAA- 2940 

ACAGCCGTGG* CCCCAGAGGT CCTGGCCAAA TATGGGGGCC TGCCTAGGTT GGTGGAACAG 3000 

TGCTCCTTAT: GTAAACTGAG- CCCTTTGTTT * AAAAAACAAT TCCAAATGTG AAACTAGAAT 3060 

GAGAGGGAAG AGATAGCATG GCATGCAGCA CACACGGCTG CTCCAGTTCA TGGCCTCCCA 3120 

GGGGTGCTGG: GGATGCATCC AAAGTGGTTG TCTGAGACAG AGTTGGAAAC CCTCACCAAC 3180 

TGGCCTCTTC ACCTTCCACA TTATCCCGCT GCCACCGGCT- GCCCTGTCTC ACTGCAGATT 3240 

CAGGACCAGC TTGGGCTGCG TGCGTTCTGC CTTGCCAGTC AGCCGAGGAT GTAGTTGTTG 3300 

CTGCCGTCGT- CCCACCACCT- CAGGGACCAG AGGGCTAGGT TGGCACTGCG GCCCTCACCA 3360 

GGTCCTGGGC TCGGACCCAA CTCCTGGACC TTTCCAGCCT GTATCAGGCT GTGGCCACAC 3420 

GAGAGGACAG- CGCGAGCTCA GGAGAGATTT CGTGACAATG TACGCCTTTC CCTCAGAATT- 3480 

CAGGGAAGAG ACTGTCGCCT. GCCTTCCTCC GTTGTTGCGT GAGAACCCGT GTGCCCCTTC 3540 

CCACCATATC CACCCTCGCT CCATCTTTGA ACTCAAACAC GAGGAACTAA CTGCACCCTG 3600 

GTCCTCTCCC CAGTCCCCAG TTCACCCTCC ATCCCTCACC TTCCTCCACT CTAAGGGATA 36G0 

TCAACACTGC CCAGCACAGG GGCCCTGAAT TTATGTGGTT TTTATACATT TTTTAATAAG 3720 
ATGCACTTTA TGTCATTTTT TAATAAAGTC TGAAGAATTA CTGTTT 

Seq ID NO; 479 Protein sequence 
Protein Accession #: XP_044533.3 

1 11 21 31 41 SI 

MLRTAMGLRS WLAAPWGALP PRPPLLLLLIi LLLLLQPPPP TWALSPRISL PLGSEERPFL 60 

RFEAEHISNY TALLLSRDGR TLYVGARSAL. FALSSNLSFL PGGEYQELLW GADAEKKQQC 120 

SPKGKDPQRD CQNYIKILLP- LSGSHLFTCG TAAFSPMCTY" INMENPTLAR- DEKGNVLLED ; 180 

GKGRCPFDPN FKSTALWDG ELYTGTVSSF QGNDPAISRS QSLRPTKTES. SLNWLQDPAF 240 

VASAYIPESL GSLQGDDDKI YFFFSETGQE FEFFEKTIVS RIARICKGDE GGERVLQQRW 300 

TSFLKAQfcLC SRPDDGFPFN- VLQDVFTLSP SPQDWRDTLF YGVFTSQWHR GTTEGSAVCV 360 

FTMKDVQRVF SGLYKEVNRE TQQWYTVTHP VPTPRPGACI TNSARERKIN SSLQLPDRVL * n 
NFLKDHFUMD GQVRSRMLLL QPQARYQRVA VHRVPGLHHT YDVLFLGTGD GRLHKAVSVG 
PRVHIIEELQ IFSSGQPVQN- LLIiDTHRGLI* YAASHSGWQ; VPMAMCSLYR SCGDCLLARD 

PYCAWSGSSC KHVSLYQPQI* ATRPWIQDIE GASAKDLCSA SSWSPSFVP TGEKPCEQVQ 600 

FQPNTVNTLA CPLLSNIATR LWLRNGAPVN ASASCHVLPT" GDLLLVGTQQ LGEFQCWSLE 660 

EGFQQLVASY CPEWEDGVA DQTDEGGSVP VIISTSRVSA PAGGKASWGA DRSYWKEFLV 720 

MCTLFVLAVL LPVLFLLYRH RNSMKVFLKQ GECASVHPKT CPWLPPETR PLNGLGPPST 780 
PLDHRGYQSL SDSPPGSRVF- TESEKRPLSI QDSFVEVSPV CPRPRVRLGS EIRDSVV 



420 
480 
540 



Seq It) NO: 480 dna sequence 

Nucleic Acid Accession #: NM_004217.1 

Coding sequence: 58.. 1092 

1 11 21 31 41 51 

GGCCGGGAGA GTAGCAGTGC CTTGGACCCC AGCTCTCCTC CCCCTTTCTC TCTAAGGATG 60 

GCCCAGAAGG AGAACTCCTA CCCCTGGCCC TACGGCCGAC AGACGGCTCC ATCTGGCCTG 120 

AGCACCCTGC CCCAGCGAGT CCTCCGGAAA GAGCCTGTCA CCCCATCTGC ACTTGTCCTC 180 

ATGAGCCGCT CCAATGTCCA GCCCACAGCT GCCCCTGGCC AGAAGGTGAT GGAGAATAGC 240 

. AGTGGGACAC CCGACATCTT. AACGCGGCAC TTCACAATTG ATGACTTTGA GATTGGGCGT 300 

CCTCTGGGCA AAGGCAAGTT TGGAAACGTG TACTTGGCTC GGGAGAAGAA AAGCCATTTC 360 

ATCGTGGCGC TCAAGGTCCT CTTCAAGTCC CAGATAGAGA AGGAGGGCGT- GGAGCATCAG 420 

CTGCGCAGAG AGATCGAAAT CCAGGCCCAC CTGCACCATC CCAACATCCT GCGTCTCTAC 480 

AACTATTTTT ATGACCGGAG GAGGATCTAC TTGATTCTAG AGTATGCCCC CCGCGGGGAG 540 

CTCTACAAGG AGCTGCAGAA GAGCTGCACA TTTGACGAGC AGCGAACAGC CACGATCATG 600 

' GAGGAGTTGG CAGATGCTCT AATGTACTGC CATGGGAAGA AGGTGATTCA CAGAGACATA 660 

AAGCCAGAAA ATCTGCTCTT AGGGCTCAAG GGAGAGCTGA AGATTGCTGA CTTCGGCTGG 720 

TCTGTGCATG CGCCCTCCCT GAGGAGGAAG ACAATGTGTG GCACCCTGGA CTACCTGCCC 780 

CCAGAGATGA TTGAGGGGCG CATGCACAAT GAGAAGGTGG ATCTGTGGTG CATTGGAGTG 840 

CTTTGCTATG AGCTGCTGGT GGGGAACCCA CCCTTTGAGA GTGCATCACA CAACGAGACC 900 

TATCGCCGCA TCGTCAAGGT GGACCTAAAG TTCCCCGCTT CTGTGCCCAC GGGAGCCCAG 960 

GACCTCATCT CCAAACTGCT CAGGCATAAC CCCTCGGAAC GGCTGCCCCT GGCCCAGGTC 1020 

TCAGCCCACC CTTGGGTCCG GGCCAACTCT CGGAGGGTGC TGCCTCCCTC TGCCCTTCAA- 1080 

TCTGTCGCCT GATGGTCCCT GTCATTCACT CGGGTGCGTG TGTTTGTATG TCTGTGT ATG *- 1140 

TATAGGGGAA AGAAGGGATC CCTAACTGTT CCCTTATCTG TTTTCTACCT CCTCCTTTGT 1200 
• TTAATAAAGG CTGAAGCTTT TTGT 

Seq ID NO: 481 Protein sequence 
Protein Accession ft: NPJJ04208 

1 11 21 31 41 51 
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I . 1 

MAQKENSYPW PYGRQTAPSG 
SSGTPDILTR HFTIDDFEIG 
QLRREIEIQA- HLHHPNILRL 
MEELADALMY CHGKKVIHRD 
PPEMIEGRMH NEKVDLWCIG 
QDLISKLLRH NPSERLPLAQ 



PCT/US02/12476 



LSTLPQRVLiR KEPVTPSALV 
RPLGKGKFGN VYLAREXKSH 
YNYFYDRRRI YLILEYAPRG 
IKPENLLLGL KGELKIADFG 
VLCYELLVGN PPFESASHNE 
VSAHPWVRAN SRRVLPPSAL 



I I 

LMSRSNVQPT . AAPGQKVMEN 6 0 

FIVALKVLFK SQIEKEGVEH 120 

ELYKELQKSC TFDEQRTATI 180 

WSVEAPStiRR KTMCGTLDYL 240 

TYRRIVKVDL KFPASVPTGA 300 
QSVA 



Seq ID NO: 482 DNA sequence 
Nucleic Acid Accession #: AK055663 
Coding sequence: 38.. 1423 



1 
I 

AGAACGGCTT 
AAAACCACAA 
CCGAAGGTCC 
GCTTATGTGG 
TTTTGATCTT 
TAGCCCTGTC 
AGTCTTGGCA 
ACAGCCCGAG 
CCTGTTCACG 
r TACGAGCTGG 
GGGACTTAGC 
AGCATTTGCT 
CACTGCCTCT 
GTACAGTGGG 
ACTCATCAGA 
GACCCTAGGT 
TGAACAAATG 
TGTTCAAATT 
CAATGTCCTA 
TGATTTGAAC. 
ATTTAACACT 
TTATGGTTTT 
TGGAGTTCCA 
TAGATATGGA 
TTATAAGGAA- 
TGTTTAATCA- 
TATGAAACTA 
GCTTTAAATA 
GTTTTGTAGT. 
AGATCTGTCA 
TGCCACTGTG 
CTTAGTTTTT. 
ATGCAGTGGC 
TGCCTCAGCC 
TGTATTTTTA' 
ACCTCATGAT 
CCTGGCCGAT 
GGGAAAGGGA 
AATTGCTAAA 
TTTTTAGCAG 
GATTTTTGTT 



11' 
I 

CCGGCGGGAG 
AGATCCTTTT 
TGGAAGATAC 
TGCAGTTCTA 
TTTAGTTTAA 
TATTCATTTG 
CAGTTGGGAG 
ATACACACGG 
ATGCT TTCT A 
CTTCAAGAGC 
AGTATCTTCC 
CTTTGTATTA 
GCTATAGCTA 
AAAGTCTTAC 
GAGGTATCTA 
TTTGGCTCAT 
GTTCTTGCTC 
TTCAAGGATG 
AACTTTTCAG 
CCAGTTACAT 
CCTGGGAAAA 
GGTCTCAATC 
GGAATTGGAG 
ACTAATAATA 
TATTGACTCC 
TTTACTCTAA 
TATTTTTGTA 
GGCTTCCTTT 
TGACTGCAGT 
CATTACTAAG 
CCCGGCCAAT 
GTTTTGTTTT 
ATGATCTCAG 
TCCCGAGTAG 
GTAAAGACGG 
CCACCCACCT 
ATTTTCTTTA 
AAAATGTCTG 
TT TTT CTT T G 
AAATTTTGGA 
AAAGTTTCTC 



21 

I - 

CTGTGCAGCT 
TTGGCAAGTT 
TGCTCTTTGG- 
CTAATAGTAT 
TGACATGTTT 
GGTTTGAAAG 
CTCTCTTTAT 
GAAGATTATT 
TTCGGAATAA 
ATGTTG CAGA - 
TTCCCCGAAT 
CATAT ATGCT. 
TTGCCTTGAT 
TCCAGACAAC 
CCTTAGATGG 
TGGCTGGATC 
ATGTGACCAA 
ACTGGATTAG- 
ATCATCAOGT 
CAACTCCAGC 
ATGTGAACCC 
ATGGACACAC 
CAACTCAAGG' 
GAATTGGACA 
TTGGCTTCCA 
ATGTTAGATA 
AAATGTATTT 
AGAAAATGTG 
GTGATGTGAC 
ATACGATATT 
ACATTATTAT 
GTTTTTTGAG 
CTCACTGCAA 
CTGGGATTAC 
GGGATTTCAC 
TAGCCTCCCA 
ATGAAATTTA 
TTCAAAAAGT 
AGGTTCTCCT 
ATACATTCTA 
TCCTTTAAAA 



31 

i 

OCTTATCATG 
GTTACGGGAA 
TGTAATAAAC 
AGCTTTAACT 
AATAAGTTAC 
ATTAGAAGTC. 
ATTAAAAGAA* 
AGTTGGTACT 
ACCTTTTGCT 
TCTTAGTCGA 
GAATCCATTT 
CATTGAAATT 
GACATTTGGC 
ACCACCCCAT 
AGTTTTAGAA- 
AGTGCATGTA 
CAGGCTGTAC 
GCCTGCCTTA : 
AATCCCAATG 
TAAACCTAGT 
AGTTATTCTT 
ACCTTACAGC 
ATTGAGGACT 
ACCAAGACCA 
ATTTATTTAG 
ATAGTAGTCT 
GTGACAGTGA 
TTTCTTTAAA 
CTTACCTTTA 
TCTTTTTTTT 
TAACTTAAGG 
ATGGAGTCTC 
CCTCTGCCTC 
AGGCACCTGC 
CATGTTGGCC 
AAGTGCTGGG 
TAAATATGCT 
AAAGGTCTCT 
GAATTATGTC 
TCTAGCACAA 
ATTTTAGTAC 



41 
I 

GGGACAATTC 
TTTAGACTTG 
TTGATATGTA 
GCCTATACTT 
TGGC3TAACAT 
CTGGCTGTAT 
AGTGCAGAAC 
TTTGTGGCTC 
TATGTCTCAG 
AGCTTGTGTG 
GTTTTGATTG 
AATAATTATT- 
ACTATGTATC 
GTTATTGGTC 
GTCCGAAATG 
AGAATTCGAC 
ACTCTAGTGT 
TTGTCTGGGC 
OCTCTTTTAA 
AGTCCACCTC 
CTAAACACAC 
AGCATGCTTA 
GGTTTTACAA 
TGATAGACTC 
TAATCCAACT 
TGTTCACATT 
AATCCTCGTA 
TTTGGATTTT. 
TAAGAGCCAC 
TCCGAGACGG 
CTGTACTTTA 
ACTCTGTCGC 
CTGAGTTCAA 
CACCACGCCC 
AGGCTGGTCT 
ATTAGGTGTG 
TCTTGAATAA 
TTTATAGCTT 
TTACAAACTA 
TTTGAATTTT 
ATTTGTAAAT 




ACCTGACCAT 
TGAGGAAACC. 
TTGCCTCCAC 
GCTTTTTGGA 
TTTGTTTCAA 
AAGCTGCTAG 
GAATTATTCC 
ATCTTGCTGG. 
TTGCCGTAGA 
CCATGAGTGT • 
AGTTGGACAA 
AACATTTTTG 
GAGATGCCAA 
CTACTCTAAC 
CTGTTGCAGC 
AGGGTACTGA 
CAGAATTTTC 
AAACAAGGCC 
ATCAAGGACT 
ATATACCAAG 
TAACTTATTT 
TTGCATTGAC 
TCATGAAACC 
AATGTTAAAG 
GGTATCTTTG 
TTGATGGAGT 
AGTCTTGCTC 
TTAAGGCTTC 
CCAGGCTGGA V 
ATGATTCTCC 
AGCTAATTTT 
TGAACTCCTG 
AGCCACCGCA 
TACACATTTT 
TTCCAAACTT 
AAAGCAAAAA 
TAATTATCAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
. 1260= 
1320 
13B0 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 



Seq ID NO: 483 Protein. sequence 
Protein Accession fh BAB70980.1 



MGTIHLFRKP 
TAYTYLTIFD 
ESAERFLEQP 
RSLCGIIPGL 
GTMYPMSVYS 
VRIRRDANEQ 
MPDLKGTDDL 
SSMttNQGlGV 



11 
I 

QRSFFGKLLR 
LFSLHTCLIS 
EIHTGRLLVG 
SSIFLPRMNP 
GKVLIOTTPP 
MVLAHVTNRIi 
NPVTSTPAKP 
PGIGATQGLR 



21 
I 

EFRLVAADRR 
YWVTLRKPSP 
TFVALCFNLF 
FVLIDLiAGAF 
KVIGQLDKLI 
YTLVSTLTVQ 
SSPPPEFSFN 
TGFTNIPSRY 



31 

! 

SWKILLFGVI 
VYSFGFERLE 
TMLSIRNKPF. 
ALCITYMLIE 
REVSTLDGVL 
IFKDDWIRPA 
TPGKNVNPVI 
GTNNRIGQPR 



41 
I 

NLICTGFLLM 
VLAVFASTVL 
AYVSEAASTS 
INNYFAVDTA 
EVRNEHFWTL 
LLSGPVAANV 
LLNTQTRPYG 
P 



51 
I 

WCSSTNSIAL 
AQLGALPILK 
WLQEHVADLS 
SAIAIALMTF 
GPGSLAGSVH 
LNFSDHHVIP 
FGLNHGHTPY 



60 
120 
180 
240 
300 
360 
420 



Seq ID NO: 484 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: . 1. .'900 



1 
I 

ATGCCGCCGC 
CCGCGGCGGC 
GCCGTGGGCA 
CGGCCCACTG 
GGCTGCGGCG 
GGACCCCGGG 
CTTCCTAACT 
CCGGTGCGCA 
CTTTGCTACC 
TTTCAAAACA 
GTGCTGCTGG 



11 
I 

GGGAGCTGAG 
GTAGCGCGCC 
AGAGCAGCCT 
CGCTGGACAC 
GGGCTGTGCA 
GAGGAGACTG 
CAGGCTCTCC 
TTGAGCTCTG 
CGGATACCGA 
TCACAGAGAA 
TGGGCACCCA 



21 

I. 

CGAGGCCGAG 
CCCAGAGCTG 
CATCGTCAGC 
CTTCTCTGGT 
CCGGGGAGCT. 
GAGCAGGCCC 
CCGCCCCGCC 
GGACACAGCG 
TGTCTTCCTG 
ATGGCTGCCC 
GGCCGACCTG 



31 
I 

CCGCCCCCGC 
GGCATCAAGT 
TACACCTGCA 
ACGTACGTTC 
GGGGCGGGCG 
CGAGGTGGCG 
CCTGCAGTGC 
GGACAGGAGG 
GCGTGCTTCA 
GAGATCCGCA 
AGGGACGATG 



41 
I 

TCCGGGCCCC 
GCGTGCTGGT 
ATGGGTACCC 
AATCGCCCGT 
TCTCGGCGGG 
CTGGTGCGGC 
AAGTCCTGGT 
ATTTTGACCG 
GCGTGGTGCA 
CGCACAACCC 
TCAACGTACT 



51 
1 

GACCCCTCCC 

GGGCGACGGC 

CGCGCGCTAC 

GCGGCCGCGT 

AGGGCGCAGA 

CCAGGACGCT 

GGATGGAGCT 

ACTTCGTTCC 

GCCCAGCTCC , 

CCAGGCGCCT 

AATTCAGCTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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GACCAGGGGG GCCGGGAGGG CCCCGTGCCC CAACCCCAGG CTCAGGGTCT GGCCGAGAAG 720 

ATCCGAGCCT GCTGCTACCT . TGAGTGCTCA GCCTTGACGC. AGAAGAACTT GAAGGAACTA 7B0 

TTTGACTCGG CTATTCTCAG TGCCATTGAG CACAAAGCCC GGCTGGAGAA GAAACTGAAT 840 
GCCAAAGGTG .TGCGCACCCT CTCCCGCTGC CGCTGGAAGA AGTTCTTCTG CTTCGTTTGA 

Seq- ID NO: 485 Protein sequence 
Protein Accession ft: FGENBSH predicted 

1 11 21 31 41- 51 

MPPRELSEAE PPPLRAPTPP PRRRSAPPEL GIKCVLVGDG AVGKSSLIVS YTCNGYPARY 60 

RPTALDTFSG TYVQSPVRPR GCGGAVHRGA GAGVSAGGRR GPRGGDWSRP RGGAGAAQDA 120 

LPNSGSPRPA PAVQVLVDGA PVRIELWDTA GQEDFDRLRS LCYPDTDVFL ACFSWQPSS 180 

FQNITEKWLP EIRTHNPQAP VLLVGTQADL RDDVNVLIQL DQGGREGPVP QPQAQGLAEK 240 
IRACCYLECS ALTQKNLKEV. FDSAILSAIE HKARLEKKLN AKGVRTLSRC RWKKFFCFV 

Seq. ID NO: 486 DNA sequence . 
Nucleic Acid Accession 8: XM_063832.2 
Coding sequence: 1..711 

1 .11 '21 31 41 51 

atgccgccgc GGGAGCTGAG CGAGGCCGAG CCGCCCCCGC TCCGGGCCCC GACCCCTCCC 60 

• CCGCGGCGGC GTAGCGCGCC CCCAGAGCTG GGCATCAAGT GCGTGCTGGT GGGCGACGGC 120 

GCCGTGGGCA AGAGCAGCCT CATCGTCAGC TACACCTGCA ATGGGTACCC CGCGCGCTAC 180 

CGGCCCACTG. CGCTGGACAC CTTCTCTGTG . CAAGTCCTGG TGGATGGAGC TCCGGTGCGC 240 • 

ATTGAGCTCT GGGACACAGC GGGACAGGAG GATTTTGACC GACTTCGTTC CCTTTGCTAC 300 

CCGGATACCG* ATGTCTT CCT GGCGTGCTTC AGCGTGGTGC AGCCCAGCTC CTTTCAAAAC 360 

ATCACAGAGA AATGGCTGCC CGAGATCCGC ACGCACAACC CCCAGGCGCC TGTGCTGCTG 420 

GTGGGCACCC AGGCCGACCT GAGGGACGAT GTCAACGTAC TAATTCAGCT GGACCAGGGG 480 

GGCCGGGAGG GCCCCGTGCC CCAACCCCAG GCTCAGGGTC TGGCCGAGAA GATCCGAGCC 540 

TGCTGCTACC TTGAGTGCTC AGCCTTGACG CAGAAGAACT- TGAAGGAAGT ATTTGACTCG 600 

GCTATTCTCA GTGCCATTGA- GCACAAAGCC CGGCTGGAGA AGAAACTGAA TGCCAAAGGT 660 
GTGCGCACCC TCTCCCGCTG CCGCTGGAAG AAGTTCTTCT GCTTCGTTTG A 

Seq ID NO: 487 Protein sequence 
Protein Accession ft: XP_063832.1" 

1 11 21 31 41 51 

JtPPRELSEAE PPPLRAPTPP PRRRSAPPEL GIKCVLVGDG AVGKSSLIVS YTCNGYPARY 60 

RPTALDTFSV QVLVDGAPVR IELWDTAGQE DFDRLRSLCY PDTDVFLACF SWQPSSFQN 120 

ITEKWLPEIR THNPQAPVLL VGTQADLRDD VNVLIQLDQG GREGPVPQPQ AQGLAEKIRA 180 
CCYLECSALT QKNLKEVFDS AILSAIEHKA RLEKKLNAKG VRTLSRCRWK KFFCFV 

Seq ID NO: 488 DNA sequence 

Nucleic Acid Accession 8: NM_014398.1 

Coding sequence: 64.. 1314 

I 11 21 31 |1 51 

GGCACCGATT CGGGGCCTGC CCGGACTTCG CCGCACGCTG CAGAACCTCG CCCAGCGCCC 60 

ACCATGCCCC GGCAGCTCAG CGCGGCGGCC GCGCTCTTCG CGTCCCTGGC CGTAATTTTG 120 

CACGATGGCA GTCAAATGAG AGCAAAAGCA TTTCCAGAAA CCAGAGATTA TTCTCAACCT 180 

ACTGCAGCAG CAACAGTACA GGACATAAAA AAACCTGTCC AGCAACCAGC TAAGCAAGCA 240 

CCTCACCAAA CTTTAGCAGC AAGATTCATG GATGGTCATA TCACCTTTCA AACAGCGGCC 300 

ACAGTAAAAA TTCCAACAAC TACCCCAGCA ACTACAAAAA ACACTGCAAC. CACCAGCCCA 360 

ATTACCTACA- CCCTGGTCAC AACCCAGGCC ACACCCAACA ACTCACACAC AGCTCCTCCA 420 

GTTACTGAAG TTACAGTCGG CCCTAGCTTA GCCCCTTATT CACTGCCACC CACCATCACC 480 

CCACCAGCTC ATACAGCTGG AACCAGTTCA TCAACCGTCA GCCACACAAC TGGGAACACC 540 

ACTCAACCCA GTAACCAGAC CACCCTTCCA GCAACTTTAT CGATAGCACT GCACAAAAGC 600 

ACAACCGGTC AGAAGCCTGA TCAACCCACC CATGCCCCAG GAACAACGGC AGCTGCCCAC 660 

AATACCACCC GCACAGCTGC ACCTGCCTCC ACGGTTCCTG GGCCCACCCT TGCACCTCAG 720 

CCATCGTCAG TCAAGACTGG AATTTATCAG GTTCTAAACG GAAGCAGACT CTGTATAAAA 780 

GCAGAGATGG GGATACAGCT GATTGTTCAA GACAAGGAGT CGGTTTTTTC ACCTCGGAGA • 840 

TACTTCAACA TCGACCCCAA CGCAACGCAA GCCTCTGGGA ACTGTGGCAC CCGAAAATCC 900 

AACCTTCTGT TGAATTTTCA GGGCGGATTT- GTGAATCTCA CATTTACCAA GGATGAAGAA 960 

T CAT ATT AT A TCAGTGAAGT GGGAGCCTAT TTGACCGTCT CAGATCCAGA GACAGTTTAC 1020 

CAAGGAATCA AACATGCGGT GGTGATGTTC CAGACAGCAG TCGGGCATTC CTTCAAGTGC 1080 

GTGAGTGAAC AGAGCCTCCA GTTGTCAGCC CACCTGCAGG TGAAAACAAC CGATGTCCAA 1140 

CTTCAAGCCT TTGATTTTGA AGATGACCAC. TTTGGAAATG TGGATGAGTG CTCGTCTGAC 1200 

TACACAATTG TGCTTCCTGT- GATTGGGGCC ATCGTGGTTG GTCTCTGCCT TATGGGTATG 1260 

GGTGTCTATA AAATCCGCCT AAGGTGTCAA TCATCTGGAT ACCAGAGAAT- CTAATTGTTG 1320 

CCCGGGGGGA ATGAAAATAA TGGAATTTAG AGAACTCTTT- CATCCCTTCC AGGATGGATG 1380 

TTGGGAAATT CCCTCAGAGT- GTGGGTCCTT CAAACAATGT AAACCACCAT CTTCTATTCA 1440 

AATGAAGTGA GTCATGTGTG ATTTAAGTTC AGGCAGCACA TCAATTTCTA AATACTTTTT 1500 

GTTTATTTTA TGAAAGATAT AGTGAGCTGT TTATTTTCTA- GTTTCCTTTA GAATATTTTA 1560 

GCCACTCAAA " GTCAACATTT GAGATATGTT GAATTAACAT. AATATATGTA AAGTAGAATA 1620 

AGCCTTCAAA TTATAAACCA . AGGGTCAATT GTAACTAATA CTACTGTGTG TOCATTGAAG 1680 

ATTTTATTTT ACCCTTGATC TTAACAAAGC CTTTGCTTTG TTATCAAATG GACTTTCAGT 1740 

GCTTTTACTA * TCTGTGTTTT ATGGTTTCAT GTAACATACA TATTCCTGGT CTAGCACTTA 1800 

ACTCCTTTTC CACTTTAAAT TTGTTTTTGT TTTTTGAGAC GGAGTTTCAC TCTTGTCACC 1860 

CAGGCTGGAG TACAGTGGCA CGATCTCGGC TTATGGCAAC CTCCGCCTCC CGGGTTCAAG 1920 

TGATTCTCCT GCTTCAGCTT CCCGAGTAGC TGGGATTACA GGCACACACT ACCACGCCTG 1980 

GCTAATTTTT GTATTTTTAT TATAGACGGG TTTCACCATG TTGGCCAGAC TCCTCTTGAA- 2040 

CTCTTGACCT' CAGGTGATCC ACCCACCTCA GCCTCCCAAA GTGCTGGGAT TACAGGCATG 2100 

AGCCATTGCG CCCGGCCTTA AATGTTTTTT TTAATCATCA AAAAGAACAA CATATCTCAG 2160 
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AAAAAGAACA AATCAGCTTA TATTTTTTAT 2220 

CTAAGAATTA GGTGGCTACA GATGGTAGAA 2280 

GCCCTTAATT ATTAACAAAG. TGCCAGAGTC" 2340 

CATTCTCACA ACTTATAAGT GAATGAGTAA ' 2400 

ATGTCACCTG GCTAACTGAT GGCAGAGCCA 2460 

GGTCTTTGGT CTTCTCCCTA CACCAAGTTA 2520 

TGAAGGCTCA CACCTCATAC CAGCATACGC 2580 

ATCATGAGAC ATTAGGGTAG . ATGAAAGGAG 2640. 

TAATAAATCC TCCACTCTCT GGAAGGAGAC 2700 

CTCATTTTTA TGGGATTGCT TAGCTGGGCT ' 2760 

GTATTTTTAA ATTTTTTTGA TAATAGAGAA 2820 

GTATAGCCCC ATCTTGTGGT AACTTGCTGC 2880 

TTCACTTTAT TCTGTAGAGC AGCCTGCCAA 2940 

TAAAGAAAGG AACTAAGTCA GGATGTTAAC 3000 

GTCAAGGAAT AATTCAAGTC AGCCTAGAGA 3060 

TGACTCAGTA AGTTGGCAAG GTCCTGACTT 3120 
AGGTTTTTGC AATAAAAACT TACTTTGG. 

Seq ID NO: 489 Protein sequence 
Protein Accession #: NPJJ5S213.1 

1 11 21 31 41 SI 

I III I I 

MPRQLSAAAA LFASLAVILH DGSQMRAKAF PETRDYSQPT AAATVQDIKK PVQQPAKQAP 60 

HQTLAARFMD GHITFQTAAT VKIPTTTPAT. TKNTATTSPI TYTLVTTQAT PNNSHTAPPV 120 

TEVTVGPSLA- PYSLPPTITP PAHTAGTSSS TVSHTTGNTT QPSNQTTLPA TLSIALHKST 180 

TGQKPDQPTH APGTTAAAHN TTRTAAPAST, VPGPTLAPQP SSVKTGIYQV LNGSRLCIKA 240 

EMGIQLIVQD KESVFSPRRY' FNIDPNATQA SGNCGTRKSN ' LLLNFQGGFV NLTFTKDEES 300 

YYISEVGAYL TVSDPETVYQ. GIKHAWMFQ. TAVGHSFKCV SEQSLQLSAH LQVKTTDVQL 360 
QAFDFEDDHF, GNVDECSSDY TIVLPVIGAr. WGLCLMGMG VYKIRLRCQS SGYQRI . 

Seq ID NO: 490 DNA 'sequence * 
Nucleic Acid Accession #: NMJ>05409.3 
Coding sequence: 94.. 378 

11 21 31 41 51 

I I I I I 

GTTCAGCATT. TCTACTCCTT CCAAGAAGAG CAGCAAAGCT GAAGTAGCAG - 60 

AGCAGCAACA GCAAAAAACA AAGATGAGTG TGAAGGGCAT GGCTATAGCC 120 

TATTGTGTGC TACAGTTGTT CAAGGCTTCC CCATGTTCAA AAGAGGACGC 180 

TAGGCCCTGG GGTAAAAGCA GTGAAAGTGG CAGATATTGA GAAAGCCTCC 240 

CAAGTAACAA CTGTGACAAA . ATAGAAGTGA TTATTACCCT GAAAGAAAAT 300 

GATGCCTAAA TCCCAAATCG AAGCAAGCAA GGCTTATAAT CAAAAAAGTT '360 

ATTTTTAAAA- ATATCAAAAC ATATGAAGTC CTGGAAAAGG GCATCTGAAA 420 

AAGTTTAACT GTGACTACTG AAATGACAAG AATTCTACAG TAGGAAACTG . 480 

ATGGTTTTGT GACTTTCAAC TTTTGTACAG TTATGTGAAG GATGAAAGGT 540 

ACCAAAAACA GAAATACAGT CTTCCTGAAT GAATGACAAT CAGAATTCCA 600 

GAGTCCAGCA ATTAAATGGA TTTCTAGGAA AAGCTACCTT AAGAAAGGCT 660 

GGAGTTTACA AAGTGCTTTC ACGTTCTTAC TTGTTGTATT. ATACATTCAT 720 

GCTAGAGAAC CTTCTAGATT TGATGCTTAC AACTATTCTG TTGTGACTAT 780 

CTGTCTCTAG AAGTTATCTG TCTGTATTGA TCTTTATGCT ATATTACTAT. 840 

AGTGGAGACA TTGACATTAT TACTGGAGTC AAGCCCTTAT AAGTCAAAAG 900 

TCGTAAAGCA TTCCTCAAAC ATTTTTTCAT GCAAATACAC ACTTCTTTCC 960 

TGTAGCACAT CAATATGTAG GGAAACATTC TTATGCATCA TTTGGTTTGT 1020 

ATTCATTAAA TGTAATTCAT AAAATGTACT ATGAAAAAAA TTATACGCTA" 1080 

GCAACAGTGC ACATATTTCA TAACCAAATT AGCAGCACCG GTCTTAATTT 1140 

AACTTTTATT CATTGAGATG TTTTGAAGCA ATTAGGATAT GTGTGTTTAC 12 00 

GTTTTGATCC GTTTGTATAA ATGATAGCAA TATCTTGGAC ACATTTGAAA 1260 

TTTTGTCTAC CAAAGAAAAA TGTTGAAAAA TAAGCAAATG TATACCTAGC. 1320 

ACTTTTTGTA ATTCTGTCTC TTAGAAAAAT ACATAATCTA ATCAATTTCT 1380 

CTATATACTG TAAAATTTAG GTATACTCAA GACTAGTTTA AAGAATCAAA 1440 
TCTCTAATAA ACTACCACAA CCTTTCTTTT TTAAAAAAAA AAA 

Seq ID NO: 491 Protein sequence 
Protein Accession #: NP_O05400-l 

1 li ' . 21 

I I : I . 

MSVKGMAIAL AVILCATWQ GFPMFKRGRC 
EVIITLKENK GQRCLNPKSK QARL1 IKKVE 

Seq ID NO: "492 DNA sequence 

Nucleic Acid Accession #: " NM_000577.l 

Coding sequence: 41.. 520 ! , 

• 1 '11 ' 21 . 31 41 51 

I I - • I - I I I 

• GGCACGAGGG GAAGACCTCC TGTCCTATCA GGCCCTCCCC ATGGCTTTAG AGACGATCTG 60 
CCGACCCTCT GGGAGAAAAT CCAGCAAGAT GCAAGCCTTC AGAATCTGGG ATGTTAACCA 120 
GAAGACCTTC TATCTGAGGA ACAACCAACT AGTTGCCGGA TACTTGCAAG GACCAAATGT 180 
CAATTTAGAA GAAAAGATAG ATGTGGTACC CATTGAGCCT CATGCTCTGT TCTTGGGAAT -240 
CCATGGAGGG AAGATGTGCC TGTCCTGTGT CAAGTCTGGT GATGAGACCA GACTCCAGCT 300 

'GGAGGCAGTT AACATCACTG ACCTGAGCGA .GAACAGAAAG CAGGACAAGC GCTTCGCCTT 360 

CATCCGCTCA GACAGTGGCC CCACCACCAG TTTTGAGTCT GCCGCCTGCC CCGGTTGGTT 420 

CCTCTGCACA GCGATGGAAG CTGACCAGCC .CGTCAGCCTC ACCAATATGC CTGACGAAGG 480 

CGTCATGGTC ACCAAATTCT ACTTCCAGGA GGACGAGTAG TACTGCCCAG GCCTGCCTGT 540 

TCCCATTCTT GCATGGCAAG GACTGCAGGG . ACTGCCAGTC CCCCTGCCCC AGGGCTCCCG 600 



WO 02/086443 ■ •* ' 

GTTGTCTAAG TGTTTTTATG TAAAACCAAC 
CTTGATGACT CCTGCTCCAG AATTGCTAGA 
CTAAACAATA AGCAAGAGAC AATAATAATG 
TAGGCTAAGC ACTTTATCT A TATCTCATTT 
ACTGAGACTT " AAGGGAACTG AATCACTTAA 
GAGCTTGAAT TCATGTTGGT. CTGACATCAA 
CCTACAAGAA CAATGACACC ACACTCTGCC 
TCACCTTACA GGGAAATGGG TTTATCCAGG 
AGCTTTGCAG ATAACAAAAT AGCCTATCCT 
TGAGGGGCTT TGTAAAACAT TAGTCAGTTG 
GTAAAGATGA AGGCATCAAA TAAACTCAAA 
ACTTCGCTAA CCAACTGTTC TTTCTTGAGT 
TTCTGCACTT CATATCCATA TTTCCTATTG 
" GAATTTTATT" TCTGCTGTTT TTTTTGCTGC 
" AGAAAAGTCC ACATAACCCT AGAATTCTTA- 
CCATGTTGAC TTTCCTCATG TGTTTCCTTA 
TAGTCTTAAT AAAACATTGA ATTGTAGTAA 



1 
I 

TTCCTTTCAT 
CAACAGCACC 
TTGGCTGTGA 
TGTCTTTGCA 
ATAATGTACC 
AAAGGACAAC 

, GAAAGAAAGA 
AACCTAGAAC 
AGACTTTTCT 
GGGTGAAAGG 
CTGCCCAAAG 
GGTTACCATC 
GCATTTCTAG 
GAGAACATTT 
CTGTGGTTAC . 
CATCTATGTG. 
CCAAATATCA 
TTTATAACCA . 

. TGGGATACTG 
GATGTTTTTC 
TGTACTTTTT. 
TACAAAATGT 
AATCACTTTT 
TTGTTCATGC 

. GTCATTTTTT 



31 41 51 

I I I 

. LCIGPGVKAV KVADIEKASI MYPSNNCDKI 60 

RKNF 



373 



WO. 02/086443 

GCTATGGGGG " CACTGAGGAC . CAGCCATTGA GGGGTGGACC CTCAGAAGGC GTCACAACAA 6 S 0 

CCTGGTCACA GGACTCTGCC - TCCTCTTCAA CTGACCAGCC TCCATGCTGC CTCCAGAATG 720 

GTCTTTCTAA TGTGTGAATC AGAGCACAGC AGCCCCTGCA CAAAGCCCTT CCATGTCGCC 780 

TCTGCATTCA GGATCAAACC CCGACCACCT- GCCCAACCTG CTCTCCTCTT GCCACTGCCT '840 ■ 

CTTCCTCCCT CATTCCACCT TCCCATGCCC TGGATCCATC AGGCCACTTG ATGACCCCCA 900 

ACCAAGTGGC TCCCACACCC TGTTTTACAA' AAAAGAAAAG ACCAGTCCAT GAGGGAGGTT 960 

TTTAAGGGTT TGTGGAAAAT GAAAATTAGG ATTTCATGAT TTTTTTTTTT .CAGTCCCCGT 1020 

GAAGGAGAGC CCTTCATTTG. GAGATTATGT TCTTTCGGGG" AGAGGCTGAG GACTTAAAAT 1080 

ATTCCTGCAT TTGTGAAATG ATGGTGAAAG TAAGTGGTAG. CTTTTCCCTT. CTTTTTCTTC 1140 . 

TTTTTTTGTG ATGTCCCAAC TTGTAAAAAT TAAAAGTTAT GGTACTATGT. TAGCCCCATA 1200 

ATTTTTTTTT TCCTTTTAAA ACACTTCCAT " AATCTGGACT. CCTCTGTCCA GGCACTGCTG . 1260. 

CCCAGCCTCC AAGCTCCATC TCCACTCCAG - ATTTTTTACA GCTGCCTGCA GTACTTTACC 1320 

TCCTATCAGA AGTTTCTCAG . CTCCCAAGGC TCTGAGCAAA TGTGGCTCCT GGGGGTTCTT. 1380 

TCTTCCTCTG CTGAAGGAAT AAATTGCTCC TTGACATTGT • AGAGCTTCTG GCACTTGGAG 1440 

ACTTGTATGA AAGATGGCTG TGCCTCTGCC TGTCTCCCCC ACCAGGCTGG GAGCTCTGCA 1500 

GAGCAGGAAA CATGACTCGT ATATGTCTCA GGTCCCTGCA GGGCCAAGCA CCTAGCCTCG 1560 

CTCTTGGCAG GTACTCAGCG AATGAATGCT GTATATGTTG GGTGCAAAGT TCCCTACTTC 1620 

CTGTGACTTC AGCTCTGTTT TACAATAAAA'- TCTTGAAAAT GCCTAAAAAA AAAAAAAAAA 1680 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 

Seq ID NO: 493 Protein sequence 
Protein Accession #: NPJ300568.1 

1 11 21 ■ 31 ' 41 51 

MALETICRPS; GRKSSKMQAP RIWDVNQKTF YLRNNQLVAG YLQGPNVNLE EKIDWPIEP 60 

HAKFLGIHGG • KMCLSCVKSG DETRLQLEAV" NITDLSENRK QDKRFAFIRS DSGPTTSFES 12 0 
AACPGWFLCT AMEADQPVSL TNMPDEGVMV- TKFYFQEDE 

Seq ID NO: 494 DNA sequence 

Nucleic Acid Accession NM_002081.1 

Coding sequence: 222.. 1898 

1 11 21 31 41 51 

GGCTGCCCGA GCGAGCGTTC GGACCTCGCA CCCCGCGCGC CCCGCGCCGC CGCCGCCGCC 60 

GGCTTTTGTT GTCTCCGCCT CCTCGGCCGC CGCCGCCTCT GGACCGCGAG CCGCGCGCGC 120 

CGGGACCTTG GCTCTGCCCT TCGCGGGCGG: GAACTGCGCA GGACCCGGCC AGGATCCGAG- 180 

AGAGGCGCGG- GCGGGTGGCC GGGGGCGCCG ; CCGGCCCCGC CATGGAGCTC CGGGCCCGAG 240 

GCTGGTGGCT. GCTATGTGCG GCCGCAGCGC TGGTCGCCTG CGCCCGCGGG GACCCGGCCA 300 

GCAAGAGCCG GAGCTGCGGC GAGGTCCGCC AGATCTACGG AGCCAAGGGC TTCAGCCTGA 3 60. 

GCGACGTGCC CCAGGCGGAG ATCTCGGGTG AGCACCTGCG GATCTGTCCC CAGGGCTACA 420 

CCTGCTGCAC CAGCGAGATG GAGGAGAACC TGGCCAACOG CAGCCATGCC GAGCTGGAGA 4 80 

CCGCGCTCCG GGACAGCAGC CGCGTCCTGC AGGCCATGCT TGCCACCCAG CTGCGCAGCT 540 

TCGATGACCA CTTCCAGCAC CTGCTGAACG: ACTCGGAGCG GACGCTGCAG; GCCACCTTCC 600 

CCGGCGCCTT CGGAGAGCTG TACACGCAGA ACGCGAGGGC CTTCCGGGAC CTGTACTCAG - 660 

AGCTGCGCCT GTACTACCGC GGTGCCAACC TGCACCTGGA GGAGACGCTG. GCCGAGTTCT 720 

GGGCCCGCCT GCTCGAGCGC CTCTTCAAGC AGCTGCACCC CCAGCTGCTG CTGCCTGATG 780 

ACTACCTGGA CTGCCTGGGC AAGCAGGCOG. AGGCGCTGCG GCCCTTCGGG GAGGCCCCGA 840 

GAGAGCTGCG . CCTGCGGGCC ACCCGTGCCT TCGTGGCTGC TCGCTCCTTT GTGCAGGGCC 900 

TGGGCGTGGC CAGCGACGTG GTCCGGAAAG- TGGCTCAGGT CCCCCTGGGC CCGGAGTGCT 960 

CGAGAGCTGT CATGAAGCTG GTCTACTGTG CTCACTGCCT GGGAGTCCCC GGCGCCAGGC 1020 

CCTGCCCTGA CTATTGCCGA AATGTGCTCA AGGGCTGCCT TGCCAACCAG GCCGACCTGG 1080 

ACGCCGAGTG GAGGAACCTC CTGGACTCCA TGGTGCTCAT CACCGACAAG TTCTGGGGTA 1140 

CATCGGGTGT GGAGAGTGTC ATCGGCAGCG TGCACACGTG GCTGGCGGAG GCCATCAACG 1200 

CCCTCCAGGA- CAACAGGGAC AOGCTCACGG CCAAGGTCAT CCAGGGCTGC GGGAACCCCA ; 1260 

AGGTCAACCC CCAGGGCCCT GGGCCTGAGG AGAAGCGGCG CCGGGGCAAG CTGGCCCCGC 1320 

GGGAGAGGCC ACCTTCAGGC ACGCTGGAGA AGCTGGTCTC TGAAGCCAAG GCCCAGCTCC 1380 

GCGACGTCCA GGACTTCTGG ATCAGCCTCC CAGGGACACT GTGCAGTGAG AAGATGGCCC. 1440 

TGAGCACTGC CAGTGATGAC CGCTGCTGGA ACGGGATGGC CAGAGGCCGG TACCTCCCCG 1500 

AGGTCATGGG TGACGGCCTG. GCCAACCAGA TCAACAACCC CGAGGTGGAG " GTGGACATCA 1560 

CCAAGCCGGA CATGACCATC CGGCAGCAGA TCATGCAGCT GAAGATCATG ACCAACCGGC 1620 

TGCGCAGCGC CTACAACGGC AACGACGTGG ACTTCCAGGA CGCCAGTGAC GACGGCAGCG 1680 

GCTCGGGCAG CGGTGATGGC TGTCTGGATG ACCTCTGCGG CCGGAAGGTC AGCAGGAAGA 1740 

GCTCCAGCTC CCGGACGCCC TTGACCCATG CCCTCCCAGG CCTGTCAGAG CAGGAAGGAC 1800 

AGAAGACCTC GGCTGCCAGC TGCCCCCAGC CCCCGACCTT CCTCCTGCCC CTCCTCCTCT 1860 

TCCTGGCCCT TACAGTAGCC AGGCCCCGGT GGCGGTAACT GCCCCAAGGC CCCAGGGACA 1920 

GAGGCCAAGG ACTGACTTTG CCAAAAATAC AACACAGACG ATATTTAATT CACCTCAGCC 1980 

TGGAGAGGCC TGGGGTGGGA CAGGGAGGGC CGGCGGCTCT GAGCAGGGGC AGGCGCAGAG 2040 

GTCCCAGCCC CAGGCCTGGC CTCGCCTGCC TTTCTGCCTT TTAATTTTGT ATGAGGTCCT 2100 

CAGGTCAGCT GGGAGCCAGT- GTGCCCAAAA GCCATGTATT TCAGGGACCT CAGGGGCACC 2160 

TCCGGCTGCC TAGCCCTCCC CCCAGCTCCC TGCACCGCCG CAGAAGCAGC CCCTCGAGGC 2220 

CTACAGAGGA GGCCTCAAAG CAACCCGCTG GAGCCCACAG CGAGCCTGTG CCTTCCTCCC 2280 

CGCCTCCTCC CACTGGGACT CCCAGCAGAG CCCACCAGCC AGCCCTGGCC CACCCCCCAG 2340 

CCTCCAGAGA AGCCCCGCAC GGGCTGTCTG GGTGTCCGCC ATCCAGGGTC TGGCAGAGCC 2400 

TCTGAGATGA TGCATGATGC CCTCCCCTCA GCGCAGGCTG CAGAGCCCGG CCCCACCTCC 2460 

CTGCGCCCTT GAGGGGCCCC AGCGTCTGCA GGGTGACGCC TGAGACAGCA CCACTGCTGA 2520 

GGAGTCTGAG. GACTGTCCTC CCACAGACCC TGCAGTGAGG .GGCCCTCCAT GCGCAGATGA 2580 

GGGGCCACTG ACCCACCTGC GCTTCTGCTG GAGGAGGGGA AGCTGGGCCC. AAAGGCCCAG 2640 

GGAGGCAGCG TGGGCTCTGC CAATGTGGGC TGCCCCTCGC ACACAGGGCT CACAGGGCAG 2700 

GCCTTGCTGG GGTCCAGGGC TGTTGGAGGA CCCCGAGGGC TGAGGAGCAG CCAGGACCCG 276.0 

CCTGCTCCCA- TCCTCACCCA GATCAGGAAC CAGGGCCTCC CTGTTCACGG TGACACAGGT 2820 

CAGGGCTCAG AGTGACCCTC GGCTGTCACC TGCTCACAGG GATGCTGGTG GCTGGTGAGA 2880 

CCCCGCACTG CACACGGGAA. TGCCTAGGTC CCTTCCCGAC CCAGCCAGCT GCACTGCAGG 2940 

GCACGGGGAC CTGGATAGTT AAGGGCTTTT CCAAACATGC ATCCATTTAC TGACACTTCC 3000 

TGTCCTTGTT CATGGAGAGC TGTTCGCTCC TCCCAGATGG CTTOGGAGGC CCGCAGGGCC 3060 

CACCTTGGAC CCTGGTGACC TCCTGTCACT CACTGAGGCC ATCAGGGCCC TGCCCCAGGC 3120 
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■CTGGACGGGC CCTCCTTCCC TCCTGTGCCC* CAGCTGCCAG GTGGCCCTGG GGAGGGGTGG 3180 

TGTGGTGTTG GGAAGGGGTC CTGCAGGGGG AGGAGGACTT GGAGGGTCTG GGGGCAGCTG 3240- 

TCCTGAACCG ACTGACCCTG. AGGAGGCCGC TTAGTGCTGC TTTGCTTTTC ATCACCGTCC 3300 

CGCACAGTGG ACGGAGGTCC CCGGTTGCTG, GTCAGGTCCC CATGGCTTGT TCTCTGGAAC 3360 

CTGACTTTAG ATGTTTTGGG ATCAGGAGCC CCCAACACAG GCAAGTCCAC CCCATAATAA 3420 

CCCTGCCAGT GCGAGGGTGG GCTGGGGACT- CTGGCACAGT GATGCCGGGC GGCAGGACAG 3480 

CAGCACTCCC GCTGCACACA GACGGCCTAG GGGTGGOGCT CAGACCCCAC CCTAGGCTCA 3540 

TCTCTGGAAG GGGCAGCCCT GAGTGGTCAC TGGTCAGGGC AGTGGCCAAG CCTGCTGTGT 3600 

CCTTCCTCCA ■ GAAGGTCCCC CCACCGCTCA GTGTCAGOGG GTGACGTGTG TTCTTTTGAG 3660 
TCCTTGTATG AATAAAAGGC TGGAAACCTA AA 

Seq ID NO: 495 Protein sequence 
Protein Accession 8: NP^002072.1 

1. . 11 - .21 .31 41 .51 

MELRARGWWL LCAAAALVAC ARGDPASKSr" SCGEVRQIYG AKGFSDSDVP QAEISGEHLR 60 

ICPQGYTCCT SEMEENLANR SHAELETALR DSSRVLQAML ATQLRSFDDH FQHLLNDSER 120 

TLQATPPGAF GELYTQNARA FRDLYSELRL YYRGANLHLE ETLAEFWARL LERLFKQLHP 1B0 

QLLLPDDYLD CLGXQAEALR' PFGEAPRELR LRATRAFVAA RSFVQGLGVA SDWRXVAQV 240 

PLGPECSRAV MKLVYCAHCL GVPGARPCPD YCRNVLKGCL ANQADLDAEW RNLLDSMVLI 300 

TDKFWGTSGV- ESVIGSVHTW LAEAINALQD" NRDTLTAKVI QGCGNPKVNP QGPGPEEKRR- 360 

RGKLAPRERP, PSGTLEKLVS EAKAQLRDVQ/ DFWISLPGTL CSEKMALSTA ■ SDDRCWNGMA 420 

RGRYLPEVMG DGLANQINNP EVEVDITKPD MTIRQQIMQL KIMTNRLRSA YNGNDVDFQD 480 
ASDDGSGSGS- 

Seq. ID NO: 496. DNA sequence 

Nucleic Acid Accession &: NM_00165O.2 

Coding sequence: 40.1011 

1 11 i 1 r r r 

GGGGCAGGCA ATGAGAGCTG CACTCTGGCT GGGGAAGGCA TGAGTGACAG ACCCACAGCA 60 

AGGCGGTGGG GTAAGTGTGG- ACCTTTGTGT ACCAGAGAGA ACATCATGGT GGCTTTCAAA 120 

GGGGTCTGGA CTCAAGCTTT CTGGAAAGCA GTCACAGCGG AATTTCTGGC CATGCTTATT 180 

TTTGTTCTCC TCAGCCTGGG ATCCACCATC AACTGGGGTG GAACAGAAAA- GCCTTTACCG - 240 

GTCGACATGG TTCTCATCTCf CCTTTGCTTT GGACTCAGCA TTGCAACCAT. GGTGCAGTGC 300 

TTTGGCCATA TCAGCGGTGG CCACATCAAC CCTGCAGTGA CTGTGGCCAT GGTGTGCACC 360 
AGGAAGATCA GCATCGCCAA GTCTGTCTTC TACATCGCAG 
ATTGGAGCAG GAATCCTCTA- TCTGGTCACA- CCTCCCAGTG 
ACCATGGTTC ATGGAAATCT TACCGCTGGT CATGGTCTCC 
TTTCAATTGG TGTTTACTAT CTTTGCCAGC TGTGATTCCA 
TCAATAGCTT TAGCAATTGG ATTTTCTGTT GCAATTGGAC 

ACTGGTGCCA GCATGAATCC CGCCCGATCC TTTGGACCTG 

GAAAACCATT- GGATAT ATTG . GGTTGGGCCC ATCATAGGAG CTGTC CTCGC TGGTGGCCTT 780 

TATGAGTATG TCTTCTGTCC AGATGTTGAA TTCAAACGTC GTTTTAAAGA* AGCCTTCAGC 840 

AAAGCTGCCC AGCAAACAAA- AGGAAGCTAC ATGGAGGTGG AGGACAACAG • GAGTCAGGTA 900 

GAGACGGATG ACCTGATTCT AAAACCTGGA GTGGTGCATG: TGATTGACGT- TGACCGGGGA 960 

GAGGAGAAGA AGGGGAAAGA CCAATCTGGA GAGGTATTGT. CTTCAGTATG ACTAGAAGAT 1020 

CGCACTGAAA GCAGACAAGA CTCCTTAGAA CTGTCCTCAG ATTTCCTTCC ACCCATTAAG 1080 

GAAACAGATT TGTTATAAAT TAGAAATGTG CAGGTTTGTT GTTTCATGTC ATATTACTCA 1140 

GTCTAAACAA TAAATATTTC ATAATTTACA' AAGGAGGAAC GGAAGAAACC TATTGTGAAT 1200 

TCCAAATCTA- AAAAAAGAAA TATTTTTAAG ATGTTCTTAA GCAAATATAT ACCTATTTTA 1260 

TCTAGTTACC TTTCATTAAC AACCAATTTT AACCGTGTGT CAAGATTTGG TTAAGTCTTG 1320 

CCTGACAGAA CTCAAAGACA CGTCTATCAG CTTATTCCTT CTCTACTGGA ATATTGGTAT 1380 
AGTCAATTCT TATTTGAATA TTTATTCTAT TAAACTGAGT TTAACAATGG C 

Seq ID NO: 497 Protein sequence 
Protein Accession. #: NP_001641.1 

ilSDRPTARRW GKCGPLCTRE NIMVAFKGVW TQAFWKAVTA EFLAMLIFVL LSI/3STINWG 60 

GTEKPIiPVDM VLISLCFGLS IATMVQCFGH. ISGGHINPAV TVAMVCTRKI SIAKSVFYIA 120 

AQCLGAIIGA GILYLVTPPS WGGtfSVTMV HGNLTAGHGL liVELIITFQL VFTIFASCDS 180 

KRTDVTGSIA LAIGFSVAIG HLFAINYTGA SMNPARSFGP AVIMGNWENH WIYWVGPIIG 240 

AVLAGGLYEY. VFCPDVEFKR RFKEAFSKAA QOTKGSYMEV EDNRSQVETD DDILKPGWH 300 
VIDVDRGEEK KGKDQSGEVL SSV : ■ . ; 

Seq ID NO: 498 DNA sequence 

Nucleic Acid. Accession Jh AB0206B4.1 

Coding sequence: 1. .1744 . 

1 • 11 - 21 . 31 . 41 51 

CCCCCTTGTC ATTAATACAT TAAAAAGATT CAATCTTTAC CCTGAGGTAA TTTTGGCCAG 60 

TTGGTACCGG ATTTATACCA AAATAATGGA CTTGATTGGT ATTCAAACCA AGATATGTTG 120 

GACGGTTACC AGAGGAG AAG GACTCAGTCC TATTGAAAGC TGTGAAGGAT TGGGAGATCC 1B0 

TGCTTGCTTT TATGTTGCTG TAATTTTTAT TTTAAATGGA CTAATGATGG CATTATTCTT 240 

CATATATGGC ACATATTTAA GTGGCAGCCG ATTAGGAGGC CTGGTTACAG TGTTGTGCTT - 300 

CTTTTTCAAT CATGGAGAGT GTACCCGTGT. AATGTGGACA CCACCTCTCC GTGAAAGCTT 360 

CTCATATCCA TTTCTTGTTC TTCAGATGTT GCTAGTGACT CATATTCTCA GGGCTACAAA 420 

ACTTTATAGA GGAAGCTTGA TTGCACTCTG CATTTCCAAT GTATTTCTCA TGCTTCCTTG 480 

GCAGTTTGCT CAGTTTGTAC TTCTTACTCA GATTGCATCA TTATTTGCAG TATATGTTGT 540 

CGGGTACATT GATATATGTA AATTACGGAA GATCATTTAT ATACACATGA TTTCTCTTGC 600 

ACTTTGTTTT- GTTTTGATGT TTGGGAACTC AATGTTATTA ACTTCTTATT ATGCTTCTTC 660 

TTTGGTAATT ATTTGGGGTA TTCTGGCAAT GAAACCACAT TTCCTGAAAA TAAATGTATC 720 



CCCAGTGCCT' GGGGGCCATC 420 

TGGTGGGAGG, CCTGGGAGTC 480 

TGGTTGAGTT GATAATCACA 540 

AACGGACTGA- TGTCACTGGC 600 

ATTTATTTGC AATCAATTAT 660 

CAGTTATCAT GGGAAATTGG 720 
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TGAACTTAGT TTATGGGTTA" TTCAAGGATG 
1 ATACTTGACA TCTAAAATTT TTGGTATTGC 
ATCAAAATTC TTTAGTTATA; AGGATTTTGA 
TGACTTTATG GAAAAAGAGA" CTCCACTGAG 
TCTTGTAGTG TTTGTTGCTA. TTGTTAGAAA 
' TAAACAACAG- ACACATGTAA GAAAACACCA 
ATTGCAATTG TTAGCATATA CAGCCCTTGG 
GACACCACAC ATGTGTGTTA' TGGCATCACT 
CTTTTGCAAA- GTACATCCTG GTGCTATTGT 
AGGTTCAGCA AATCTGCAAA CCCAGTGGAA- 
AGAAGAACTT ATAGAATGGA' TCAAATATAG 
CATGCCCACG ATGGCAAGTG TTAAGCTCTC 
TTATGAAGAC GCAGGCTTGA GAGCCAGAAC 
AGCAGCCGAA GAAGTGAAGC GAGAACTGAT 
AGAGTCATGG. TGTGTAAGAA GAT CCAAGCC 
AGAAGATCCT. GCCAATGCTG. GGAAAACTCC 
ACCTCACTTC ACCACTGTAT TCCAGAACAG 
ATGACTGCTA CATGACCTGC TGCCTACGGA 
GCTAAGTCAT GTGTTGTTCA TATCCCAAAA 
AACGTTTTAT. TTGGTCAATT TGAATGTCAT 
CAATTGGTTA CTATTTCAAT GCACCCTTTA 
• TACTTGACTT. TAATATTTGT GCTAAAGTGA' 
GGGTTGTGAC AAGGATGACA TGAAAATACA 
TATAGTGTAA GAACTATTAA- TGCCCCTTGC 
TGGACATTTC AGTGATTGTA AGTTCTTCGG 
ACAGTAGATG GGGCAGACAT GGAGTGTTTG 
CTTGTGCGCT TTTTGTTCTC TGTTCTCTTG 
CCAAACTCTT GGACCTTGTG GTTAGGAAAT 
GTGTCTCTTT CTCTCTCTCT CTTGCTCTCT 
AATAAGTACT GTTTACTCAT TTAGTTGCTT 
ATTAATGGTA ACTGTAT TTT- TCTCATTTTT 
CTTTAAACCA CTTTAAAGTT TTTTCATGTT 
ACAACCCCAA ATATAGTGCA : TCTAGAAACT 
TGGAACAGTA GACTGTAGTA CATGGTAATT 
TGACTAATTT TGCTGTCAAA AATGTAAAGA 
CTTTTAAGAT TGCCTGTCTT TAATAAGACA 1 
TCTAAAAACC ATCATTTCAG TATAAGGAAT 
TTCTTCCTAT TTATTTTTAT TTTGAAAAAT 
ATTTTTGTTT CTTAGAAGTT AATTTGTGTG- 
ATAGCTCTGA GAAAAGGTTT TAGGGTTTTA 
AGACTACACA TTTAATTATA" CAGCTTCTCT 
TGGATTGTCC TTCAGACCTT AGTCCTCAGG 
GCTGTTCCCT TTCTACCTTC TTACCAGAGC 
CAGCTTGCTG ACATAAGTCA GCTGCAAAGG 
CCCCCAGTCT TGTGCCAGCG CCTAGAGCCG 
' CAGCCCAGGA GAGGGACTCT GGCAGC GTTC 
CTGGTTGACT GTTTTTATTT CTTAGGCTTT 
TTGATTTGTG GATTAAAAGC. AACATTTGAG 
CGGTGGACAC TTGAGGCTGA GGATGGGAGT 
TGCTTATCTG TGATTGTTGC TCACCTGAGT 
ATTTTTAAAA ATTATACTTT TACATTTATT- 
TCCAAAGAAG TTCACATGTA ATAAGTAGAA 
CTATTATAAC TGCTTCATTT GCTGGGAACC 
GAAGGATCCT TTTGTAGCAG TGTTTATGAA 
TAGGGGAGCC AGTTTGGAGC AGAGGCCTGA 
TCGCAGCCCA AGCACTGTGG AGCATCCACA 
TTCCATAGGC GTACAAAACA GTATTAAAGC 
AATATTTTTG CTTTAGTATG AGGAAAGTAA- 
TTGCTACAAC ATTTTCGAAA ACAAAGTTGG 
AAAAATGCTT GGCAAAAAAA ATATAGTGTT 
GAAAGTATGT ATCAGGAATA AAGTGATATT 
ATGCCAGTTG TTTACATGTA CTATATATGT 

Seq ID KOt 499 Protein sequence 
Protein Accession 8: BAA749O0.1 



PCT/US02/12476 



TTTTTGGTTA 
AGATGACGCT 
TACTTTATTG 
ATACACAAAG 
GATTATTAGT 
GTTTGATCAT 
TATTTTAATT- 
GATCTGCTCA 
GTTTGCTATA 
TATTGTAGGG 
TACTAAACCA 
TGCACTTCGG 
AAAAATAGTA 
AAAGTTAAAA 
TGGTTGCAGT 
CTTATGTAAC 
TGTTTACAAA* 
GAACTACATC 
ACTTTTATAG 
TCTAATTATA 
AAATTTGCTA 
GCAAAGCTAC. 
GGACAATTCT 
TTCTTTTTTC 
TCATGTCAGC 
CTATATAAAA 
TTAATGAAGC 
TCCCTTAACT 
CTCTTCTCCT 
ATCAAGTACT 
AGCATTATTC 
TAATTATAGT 
AATGTATATT 
TTTCTTTTAC 
ATAATGATAA 
AAGCCTTAAG 
AAGTATATTT 
TTCTACACCT 
AAATGAGATT 
AATTCTAAGC 
TTCTTAACCA 
CATGGTTTCT 
CCAAGGGCAG 
CTGAGGAGTG 
CAGCTCCCAG 
TTCAGATTTG 
TGCTAGTTTT 
CGATGATGCA 
TGACATGAGC 
GTGGCTGATT 
TTATATTTTT 
ATTCTGTATA 
ATTAAAAGTA 
TGTAACCCCC 
AGGTCCCTGC 
CCTTTGATGG 
TCAGTGTTTT 
GGATGGGCAA 
GGCTGTATTT 
AAAATAGGCC 
GCATAGGAGT 
TAAATTAAAA 



TTTGGAACTG 
CATATTGGCA 
TATACCTGTG 
ACATTATTGC 
GATATGTGGG 
GGAGAGCTGG 
ATGAGACTAA 
AGACAGCTAT 
TTAGCAGCAA 
GAGTTCAGCA 
GATGCAGTGT 
CCCATTGTGA 
• TACTCAATGT 
GTGAACTATT 
ATGCCTGAAA 
CTCTTGGTGA 
GTCCTAGAAG 
TGTAATGGTT 
GTAACTGTTT- 
AAAATGACTT 
TGCAAATGAG 
CTGTATAAAG 
GACAATGTAG 
TGCCTCTTGC 
CCCTGTCATC 
CTATCTGTTT 
TTTTCCTGCC 
TCCAGCCATA 
CTTCCCCATA 
TATTCTTGGT 
AAATGTTTAT 
TTTAAGAAAA 
TGATTAGACA 
TATTAAGATA 
ATGGAGTTTT 
CCTTATGTTA 

TCTTTGAATT 
CTTCAAAACG 
AAAGCGTGAC 
CAGGCAGATT 
GGTGCCCACT 
GCCTGGTCCC 
TGCCCTCAGA 
GGATGCTCCT 
TGGCCACTGT 
AGAAAATAGG 
CAACAGTCCA 
AGGGAGAGGG 
GTGTACATCC 
CTCACCCCCA 
GGAAAAAAGC 
ATATAAATTA 
AGCAAAATAT 
TATGCAGCCG 
CAATGCAGAT 
GCATATTGTT 
AGAAGCGATC 
CTTTAAAAAG 
AGTGATATTA 
ATTGTATTTT 
AAAATCATGA 



PLVINTLKRF 
ACFYVAVIFI 
SYPFLVLQML 
GYIDICKDRK 
ELSLWVIQGC 
DPKSKETPLR 
LQLLAYTALG 
GSANLQTQWN 
YEDAGLRART 
EDPANAGKTP 



11 
I 

NLYPEVILAS 
LNGLMMALFF 
LVTHILRATK 
IIYIHMISLA 
PWLFGTVILK 
YTKTLLLPW 
ILIMRLKLFL 
IVGEFSNLPQ 
KIVYSMYSRK 
LCNLLVKDSK 



21 
I 

WYRIYTKIMD 
IYGTYLSGSR 
LYRGSLIAXjC 
LCFVLMFGNS 
YLTSKIFGIA 
LWFVAIVRK 
TPHMCVMASL 
EBLIEWIKYS 
AAEEVKRELI 
PHFTTVFQNS 



31 
I 

LIGIQTKICW 
LGGLVTVLCF 
ISNVFFMLPW 
MLLTSYYASS 
DDAHIGNLLT 
IISDMWGVLA 
ICSRQLFGWL 
TKPDAVFAGA 
KLKVNYYILE 
VYKVLEWKE 



41 
I 

TVTRGEGLSP 
FFNHGECTRV 
QFAQFVLLTQ 
LVIIWGILAM 
SKFFSYKEFD 
KQQTHVRKHQ 
FCKVHPGAIV 
MPTMASVKLS 
ESWCVRRSKP 



TCATACTTAA 
ACTTACTAAC 
CAGCGGAGTT 
TTCCAGTTGT 
GTGTCTTAGC 
TTTACCATGC 
AACTCTTCTT 

ttggatggct" 

tgtcaataca 

atttgcccca 

ttgcgggtgc 

atcatccaca 

atagtcggaa ; 

acattctaga 

tttgggatgt 

aggattccaa 

ttgtaaaaga 

TTAATGTTTT 
TCAAATAGAA 
ACACCTTTAT 
TATATGCTTG 
AAAACACAGT 
GGGCTGATTT 
TCTTGTCTTT 
AACTTGAGTT 
GTTTTACTTC 
CATTATTAAT 
TGGCATTATC 
.TTTTCTGTCA 
TTTAAAAAAA 
ATTTTAATAC 
ACTATTTTGA 
TCATTTATAG- 
CAATAAAACA 
TTATATTTTA 
TAATTTTGGT 
TTTAGTTTTT 
CCTTGTATGA 
ATGAAACCTC 
TATGGCTGAC 
AACCTCATTG . 
CCTGGAAGCC 
GGGGAAGCAG = 
GAAGCACCGC 
TCCCTGGAGG 
TTCTCATTTG 
GAAGCAGCCC 
GGAAAATGGG 
AGGTGCGCGC 
AGCAGTTACA 
GTAATTTCCT 
ATTAAAAATA 
GCTTTTTCCA 
GGCTATATAT 
TGGCCACAGC. 
TGGTAGCAGG 
AGCATTTACA 
AAAATAGCTA 
ATAAGCCTCT 
ATGAGAAAAT 
TATGAATTTT 
GAAATG 



51- 
I 

IESCEGLGDP 
MWTPPLRESF 
IASIiFAVYW 
KPHFLKINVS 
TLLYTCAAEF 
FDHGELVYHA 
FAILAAMSIQ 
ALRPIVNHPH 
GCSMPEIWDV 



780 
840' 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320. 
1360 
.1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 

2 940 
3000 
3060 
3120* 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 

3 960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 



60 
120 
180 
240 
300 
360 
420 
480 
540 



Seq ID NO: 500 DNA sequence 

Nucleic Acid Accession ft: NM_00i276.1 

Coding sequence: 12 7.; 127 8 



41 



51 



1 • ix- ' ' .21 .31 

] I* II I I 

AGTGGAGTGG GACAGGTATA TAAAGGAAGT ACAGGGCCTG GGGAAGAGGC CCTGTCTAGG 
TAGCTGGCAC CAGGAGCCGT GGGCAAGGGA AGAGGCCACA CCCTGCCCTG CTCTGCTGCA 



60 
120 



376 



W,0 02/086443 /. . " ■ • 1 

GCCAGAATGG GTGTGAAGGC GTCTCAAACA GGCTTTGTGG TCCTGGTGCT GCTCCAGTGC - 180 

TGCTCTGCAT ACAAACTGGT CTGCTACTAC ACCAGCTGGT CCCAGTACCG GGAAGGCGAT 240 

GGGAGCTGCT TCCCAGATGC CCTTGACCGC TTCCTCTGTA CCCACATCAT CTACAGCTTT 300 ' 

GCCAATATAA GCAACGATCA CATCGACACC TGGGAGTGGA ATGATGTGAC GCTCTACGGC 360 

ATGCTCAACA" CACTCAAGAA CAGGAACCCC AACCTGAAGA CTCTCTTGTC TGTCGGAGGA 420 

TGGAACTTTG GGTCTCAAAG ATTTTCCAAG. ATAGCCTCCA ACACCCAGAG TCGCOGGACT 480 

TTCATCAAGT CAGTACCGCC ATTCCTGCGC ACCCATGGCT TTGATGGGCT GGACCTTGCC 540 

TGGCTCTACC CTGGACGGAG AGACAAACAG CATTTTACCA CCCTAATCAA GGAAATGAAG 600 

GCCGAATTTA TAAAGGAAGC CCAGCCAGGG . AAAAAGCAGC TCCTGCTCAG OGCAGCACTG 660 

rCTGCGGGGA AGGTCACCAT TGACAGCAGC TATGACATTG OCAAGATATC CCAACACCTG 720 

GATTTCATtA GCATCATGAC CTACGATTTT CATGGAGCCT GGCGTGGGAC CACAGGCCAT 780 

CACAGTCCCC TGTTCCGAGG TCAGGAGGAT GCAAGTCCTG. ACAGATTCAG CAACACTGAC B40 

TATGCTGTGG ' GGTACATGTT GAGGCTGGGG GCTCCTGCCA GTAAGCTGGT GATGGGCATC 900 

CCCACCTTCG GGAGGAGCTT CACTCTGGCT TCTTCTGAGA CTGGTGTTGG AGCCCCAATC 960 

TCAGGACCGG GAATTCCAGG CCGGTTCACC AAGGAGGCAG GGACCCTTGC CTACTATGAG 1020 

ATCTGTGACT " TCCTCCGCGG AGCCACAGTC -CATAGAACCC TCGGCCAGCA GGTCCCCTAT 1080 

GCCACCAAGG GCAACCAGTG GGTAGGATAC GACGACCAGG AAAGCGTCAA AAGCAAGGTG 114,0 

CAGTACCTGA AGGATAGGCA GCTGGCAGGC GCCATGGTAT GGGCCCTGGA CCTGGATGAC 1200 

TTCCAGGGCtf CCTTCTGCGG CCAGGATCTG. CGCTTCCCTC TCACCAATGC CATCAAGGAT 1260 

GCACTCGCTG- CAACGTAGCC CTCTGTTCTG CACACAGCAC. GGGGGCCAAG • GATGCCCCGT 132 0 

CCCCCTCTGG: CTCCAGCTGG CCGGGAGCCT GATCACCTGC CCTGCTGAGT CCCAGGCTGA 1380 

GCCTCAGTCT CCCTCCCTTG GGGCCTATGC AGAGGTCCAC AACACACAGA TTTGAGCTCA 1440 

GCCCTGGTGG GCAGAGAGGT AGGGATGGGG CTGTGGGGAT AGTGAGGCAT. CGCAATGTAA 1500 

GACTCGGGAT TAGTACACAC TTGTTGATGA TTAATGGAAA TGTTTACAGA TCCCCAAGCC 1560 

TGGCAAGGGA ATTTCTTCAA- CTCCCTGCCC CCTAGCCCTC CTTATCAAAG, GACACCATTT 1620 - 

TGGCAAGCTC TATCACCAAG GAGCCAAACA TCCTACAAGA" CACAGTGACC . ATACTAATTA- 1680- 

TACCCCCTGC AAAGCCAGCT TGAAACCTTC ACTTAGGAAC GTAATCGTGT CCCCTATCCT 1740 

ACTTCCCCTT CCTAATTCCA' CAGCTGCTCA ATAAAGTACA AGAGTTTAAC AGTGTGTTGG 1B00 

CGCTTTGCTT TGGTCTATCT ■ TTGAGCGCCC ACTAGACCCA CTGGACTCAC CTCCCCCATC 1860 

TCTTCTGGGT TCCTTCCTCT GAGCCTTGGG ACCCCTGAGC TTGCAGAGAT GAAGGCCGCC 1920 
ATGTT 

Seq ID NO: 501 Protein sequence 
Protein Accession #: NP_001267:i- 

1 11 21 1 ■ 31 |1 ^ 

IgVKASQTGF VVLVhLQCCS AYKLVCYYTS VtSQYREGDGS CFPDALDRFL CTHIIYSFAN 60 

ISNDHIDTWB WNDVTLYGML NTLKNRNPNIi KTLLSVGGWN FGSQRFSKIA SNTQSRRTFI 120 

KSVPPFLRTH GFDGLDLAWL YPGRRDKQHF TTLIKEMKAE FIKEAQPGKK QLLLSAALSA 180 

GKVTIDSSYD IAK1SQHLDF ISIMTYDFHG. AWRGTTGHHS PIiFRGQEDAS PDRFSNTDYA 240 

VGYMLRLGAP ASKLVMGIPT FGRSFTLASS BTGVGAPISG PGIPGRFTKE AGTLAYYBIC 300 

DFLRGATVHR TLGQQVPYAT KGNQWVGYDD QESVKSKVQY LKDRQLAGAM VWALDLDDFQ 360 
GSFCGQDLRF PLTNAIKDAL AAT 

Seq ID NO: 502 DNA sequence 

Nucleic Acid Accession #: NM_006474.1 

Coding sequences 181.. 66 9 

1 11 -21" 31 41 51 

GCTGCCTAGG GTCTGGAAAG CTCGGGCACC CTCCCTCTCC GGGGCTCCTG CTCCCACCCC 60 

TCCGGCCCCC CCACCGTCX3C GCTCCTCCAG GCTGGGCCTG TGGCCGCGGT GCTTTTAATT 120 

TTCCCCCAGC fCAGAATCTT GCTGCTCGGC CCCCAGGAGA GCAACAACTC AACGGGAACG 180 
ATGTGGAAGG TGTCAGCTCT GCTCTTCGTT TTGGGAAGCG 
GAAGGAGCCA GCACAGGCCA GCCAGAAGAT GACACTGAGA 
GTTGCCATGC CAGGTGCCGA AGATGATGTG GTGACTCCAG 
AAGTCTGGCT TGACAACTCT. GGTGGCAACA AGTGTCAACA 
GAGGATCTGC CAACTTCAGA AAGCACAGTC CACGCGCAAG 

GCCTCAAACG TGGCCACCAG TCACTCCACG GAGAAAGTGG ATGGAGACAC ACAGACAACA 540 

GTTGAGAAAG ATGGTTTGTC AACAGTGACC CTGGTTGGAA TCATAGTTGG GGTCTTACTA 600 

GCCATCGGTT TCATTGGTGG AATCATCGTT GTGGTTATGC GAAAAATGTC GGGAAGGTAC 660 

TCGCCCTAAA GAGCTGAAGG GTTACGCCCT GCTTGCCAAC GTGCTTTAAA AAAAGACCGT 720 

TTCTGACTCT GTGGCCCTGT CCCTGAGCTC GTGGGGAGAA GATGACCCTG GGAACATTTG 780 

CGGGCCCATT CAGATTCCAC GGTGACTTTC CGTTTGCCAA ATTAACCGAG GAAAGACCTT B40 
TCACCAGATT TGGTTCTTAA ACTTT 

Seq ID NO: 503 Protein sequence 
Protein Accession #» NP__006 465.1 

1 11 21 31 41 51 

MWKVSALLFV LGSASLWVLA EGASTGQPED DTETTGLEGG VAMPGAEDDV VTPGTSEDRY 60 

KSGLTTLVAT SVNSVTGIRI' EDLPTSESTV HAQEQSPSAT ASNVATSHST EKVDGDTQTT 120 
VEKDGLSTVT LVGIIVGVLL .. AIGFIGGIIV WMRKMSGRY SP 

Seq ID NO: '504 DNA sequence . 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 62.-895 

1 11 * 21 31 41 51 

CACTGCTCTG AGAATTTGTG AGCAGCCCCT AACAGGCTGT TACTTCACTA CAACTGACGA 60 

TATGATCATC TTAATTTACT TATTTCTCTT GCTATGGGAA GACACTCAAG GATGGGGATT 120 

CAAGGATGGA ATTTTTCATA ACTCCATATG GCTTGAACGA GCAGCCGGTG TGTACCACAG 180 

AGAAGCACGG -TCTGGCAAAT ACAAGCTCAC CTACGCAGAA GCTAAGGCGG TGTGTGAATT 240 

TGAAGGCGGC CATCTCGCAA CTTACAAGCA GCTAGAGGCA GCCAGAAAAA TTGGATTTCA 300 



CGTCGCTCTG GGTCCTGGCA 240 

CTACAGGTTT GGAAGGCGGC 300 

GAACCAGOGA AGACCGCTAT 360 

GTGTAACAGG CATTCGCATC 420 

AACAAAGTCC AAGCGCCACA 480 



377 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
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TGTCTGTGCT GCTGGATGGA TGGCTAAGGG CAGAGTTGGA TACCCCATTG TGAAGCCAGG 360 

GCGCAACTGT GGATTTGGAA AAACTGGCAT TATTGATTAT GGAATCCGTC TCAATAGGAG .420 

- TGAAAGATGG GATGCCTATT GCTACAACCC ACAOGCAAAG GAGTGTGGTG GCGTCTTTAC 480 . 

AGATCCAAAG CAAATTTTTa' AATCTCCAGG CTTCCCAAAT GAGTACGAAG ATAACCAAAT 540 

CTGCTACTGG CACATTAGAC TCAAGTATGG TCAGCGTATT CACCTGAGTT TTTTAGATTT 600 

TGACCTTGAA GATGACCCAG . GTTGCTTGGC TGATTATGTT GAAATATATG ACAGTTACGA 660 

TGATGTCCAT. GGCTTTGTGG GAAGATACTG TGGAGATGAG CTTCCAGATG ACATCATCAG 720 

■ TACAGGAAAT GTCATGACCT TGAAGTTTCT AAGTGATGCT- TCAGTGACAG CTGGAGGTTT 780 

■ CCAAATCAAA TATGTTGCAA TGGATCCTGT ATCCAAATCC AGTCAAGGAA AAAATACAAG 840 
TACTACTTCT ACTGGAAATA AAAACTTTTT AGCTGGAAGA TTTAGCCACT TATAAAAAAA - 900 

' AAAAAAAGGA TGATCAAAAC ' ACACAGTGTT TATGTTGGAA TCTTTTGGAA CTCCTTTGAT 960 

CTCACTGTTA TTATTAACAT TTATTTATTA TTTTTCTAAA TGTGAAAGCA ATACATAATT 1020 

TAGGGAAAAT TGGAAAATAT AGGAAACTTT AAACGAGAAA ATGAAACCTC TCATAATCCC 1080 

ACTGCATAGA AATAACAAGC GTTAACATTT TCATATTTTT TTCTTTCAGT CATTTTTCTA 1140 

TTTGTGGTAT ATGTATATAT GTACCTATAT GTATTTGCAT TTGAAATTTT GGAATCCTGC 1200 

TCTATGTACA GTTTTGTATT ATACTTTTTA AATCTTGAAC TTTATAAACA TTTTCTGAAA 1260 

TCATTGATTA TTCTACAAAA ACATGATTTT AAACAGCTGT AAAATATTCT ATGATATGAA 1320: 

TGTTTTATGC ATTATTTAAG CCTGTCTCTA TTGTTGGAAT TTCAGGTCAT TTTCATAAAT 1380 
ATTGTTGCAA TAAATATCCT- TGAACACACA AAAAAAAAAA AA 

Seq ID NO:- 505 Protein "sequence 
Protein Accession # :; Eos sequence 

1 11 21 31 41 51 

I 1 I I I I . 

• MIILIYLFLL LWEDTQGWGF. KDGIFHNSIW LERAAGVYHR EARSGKYKLT YAEAKAVCEP 60 

EGGHLATYKQ LEAARKIGFH VCAAGWMAKG RVGYPIVKPG PNCGFGKTGI IDYGIMiNRS 120 

ERWDAYCYNP HAKECGGVFT DPKQIFKSPG 1 FPNEYEDNQI CYWHIRLKYG QRIHLSFLDF 180 

DLEDDPGCLA DYVEIYDSYD DVHGFVGRYC GDELPDDIIS TGNVMTLKFL SDASVTAGGF 240 
QIKYVAMDPV SKSSQGKNTS TTSTGNKNFL AGRPSHL 

Seq ID NO: 506 DNA sequence 

Nucleic Acid Accession NM__007115.1 

Coding sequence .* 69.. 902 
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i 
I 

GAATTCGCAC 
CTGACGATAT 
GGGGATTCAA' 
ACCACAGAGA 
GTGAATTTGA 
GATTTCATGT. 
AGCCAGGGCC 
ATAGGAGTGA 
TCTTTACAGA 
ACCAAATCTG 
TAGATTTTGA 
GTTACGATGA 
TCATCAGTAC 
GAGGTTTCCA 
ATACAAGTAC 
AAAAAAAAAA 
TTGATCTCAC 
TAATTTAGGG 
ATCCCACTGC 
TTGTATTTGT 
. CCTGCTCTAT 
TGAAATCATT 
ATGAATGTTT 
TAAATATTGT 



11 

I 

TGCTCTGAGA 
GATCATCTTA 
GGATGGAATT 
AGCACGGTCT 
AGGCGGCCAT 
CTGTGCTGCT. 
CAACTGATGA 
AAGATGGGAT 
TCCAAAGCGA 
CTACTGGCAC 
CCTTGAAGAT 
TGTCCATGGC 
AGGAAATGTC 
AATCAAATAT 
TACTTCTACT 
AAGGATGATC 
TGTTATTATT 
AAAATTGGAA 
ATAGAAATAA 
GGTATATGTA 
GTACAGTTTT. 
GATTATTCTA 
TATGCATTAT 
TGCAATAAAT 



21 

I 

ATTTGTGAGC 
ATTTACTTAT 
TTTCATAACT 
GGCAAATACA 
CTCGCAACTT- 
GGATGGATGG 
TTTGGAAAAA 
GCCTATTGCT 
ATTTTTAAAT 
ATTAGACTCA 
GACCCAGGTT 
TTTGTGGGAA 
ATGACCTTGA 
GTTGCAATGG 
GGAAATAAAA 
AAAACACACA 
AACATTTATT 
AATATAGGAA 
CAAGCGTTAA 
TATATGTACC 
GTATTATACT 
CAAAAACATG 
TTAAGCCTGT 
ATCCTTCGGA 



31 

I. 

AGCCCCTAAC 
TTCTCTTGCT. 
CCATATGGCT 
AGCTCACCTA 
ACAAGCAGCT 
CTAAGGGCAG 
CTGGCATTAT 
ACAACCCACA 
CTCCAGGCTT 
AGTATGGTCA 
GCTTGGCTGA 
GATACTGTGG 
AGTTTCTAAG 
ATCCTGTATC 
ACTTTTTAGC 
GTGTTTATGT 
TATTATTTTT 
ACTTTAAACG 
CATTTTCATA 
TATATGTATT 
TTTTAAATCT 
ATTTTAAACA 
CTCTATTGTT 
ATTC 



41 
I 

AGGCTGTTAC 
ATGGGAAGAC 
TGAACGAGCA 
CGCAGAAGCT 
AGAGGCAGCC 
AGTTGGATAC 
TGATTATGGA 
CGCAAAGGAG 
CCCAAATGAG 
GCGTATTCAC 
TTATGTTGAA 
AGATGAGCTT 
TGATGCTTCA 
CAAATCCAGT 
TGGAAGATTT 
TGGAATCTTT 
CTAAATGTGA 
AGAAAATGAA 
TTTTTTTCTT 
TGCATTTGAA 
TGAACTTTAT 
GCTGTAAAAT 
GGAATTTCAG 



51 
I 

TTCACTACAA 
ACTCAAGGAT 
GCCGGTGTGT 
AAGGCGGTGT 
AGAAAAATTG 
CCCATTGTGA 
ATCCGTCTCA 
TGTGGTGGCG 
TACGAAGATA 
CTGAGTTTTT 
ATATATGACA 
CCAGATGACA 
GTGACAGCTG 
CAAGGAAAAA 
AGCCACTTAT 
TGGAACTCCT 
AAGAAATACA 
ACCTCTCATA 
TCAGTCATTT 
ATTTTGGAAT 
GAACATTTTC 
ATTCTATGAT 
GTCATTTTCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



Seq ID NO:" 507 Protein sequence 
Protein Accession #: NP_009046.1 



11 
I 



21 



31 



41 



51 



MIILIYLFLL LWEDTQGWGF KDGIFHNSIW LERAAGVYHR EARSGKYKLT YAEAKAVCEF 
EGGHLATYKQ LEAARKIGFH VCAAGWMAKG RVGYPIVKPG PNXXFGKTGI IDYGIRLNRS 
ERWDAYCYNP HAKECGGVFT DPKRIFKSPG FPNEYEDNQI CYWHIRLKYG QRIHLSFLDF 
DLEDDPGCLA DYVEIYDSYD DVHGFVGRYC GDELPDDIIS TGNVMTLKFL SDASVTAGGF 
QIKYVAMDPV SKSSQGKNTS TTSTGNKNFL AGRFSHL 



60 
120 
180 
240 
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Seq ID NO: 508 DNA sequence 

Nucleic Acid Accession #: NM_001044. 

Coding sequence: 12 9.. 1991 



ACCGCTCCGG 
AAAGCCCAGG 
GTGTGCCCAT 
CT AAGG AGCC 
ACGGAGTGCA 
AGGATCGGGA 
TGGACCTGGC 



11 
1 

AGCGGGAGGG 
CCCGGGCGGC 
GAGTAAGAGC 
CAATGCCGTG 
GCTCACCAGC 
GACCTGGGGC 
CAACGTCTGG 



21 
I 

GAGGCTTCGC 
CAGACCAAGA 
AAATGCTCCG 
GGCCCGAAGG 
TCCACCCTCA 
AAGAAGATCG 
CGGTTCCCCT 



31 
I 

GGAACGCTCT 
GGGAAGAAGC 
TGGGACTCAT 
AGGTGGAGCT 
CCAACCCGCG 
ACTTTCTCCT 
ACCTGTGCTA 



41 
I 

CGGCGCCAGG 
ACAGAATTCC 
GTCTTCCGTG 
CATCCTTGTC 
GCAGAGCCCC 
GTCCGTCATT 
CAAAAATGGT 



51 

I 

ACTCGCGTGC 
TCAACTCCCA 
GTGGCCCCGG 
AAGGAGCAGA 
GTGGAGGCCC 
GGCTTTGCTG 
GGCGGTGCCT 



60 
120 
180 
240 
300 ■ 
360 
420 



378 
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* TCCTGGTCCC CTACCTGCTC TTCATGGTCA TTGCTGGGAT GCCACTTTTC TACATGGAGC 480 
"TGGCCCTCGG CCAGTTCAAC AGGGAAGGGG CCGCTGGTGT CTGGAAGATC TGCCCCATAC 540 

. TGAAAGGTGT .GGGCTTCACG GTCATCCTCA TCTCACTGTA TGTCGGCTTC TTCTACAACG 600 

. TCATCATCGC CTGGGCGCTG CACTATCTCT TCTCCTCCTT CACCACGGAG CTCCCCTGGA 660 

TCCACTGCAA CAACTCCTGG AACAGCCCCA ACTGCTCGGA TGCCCATCCT GGTGACTCCA 720 

GTGGAGACAG- CTCGGGCCTC. AACGACACTT TTGGGACCAC ACCTGCTGCC GAGTACTTTG 780 

AACGTGGCGT. GCTGCACCTC CACCAGAGCC: ATGGCATCGA CGACCTGGGG CCTCCGCGGT 840 

GGCAGCTCAC AGCCTGCCTG .GTGCTGGTCA TCGTGCTGCT CTACTTCAGC CTCTGGAAGG 900 

GCGTGAAGAC CTCAGGGAAG. GTGGTATGGA TCACAGCCAC CATGCCATAC GTGGTCCTCA 960. 

CTGCCCTGCT CCTGCGTGGG; GTCACCCTCC CTGGAGCCAT. AGACGGCATC AGAGCATACC 102 0 

• TGAGCGTTGA- CTTCTACCGG CTCTGCGAGG CGTCTGTTTG. GATTGACGCG GCCACCCAGG 1080 
TGTGCTTCTC CCTGGGCGTG GGGTTCGGGG TGCTGATCGC CTTCTCCAGC TACAACAAGT 1140 
TCACCAACAA- CTGCTACAGG GACGCGATTG TCACCACCTC CATCAACTCC CTGACGAGCT 1200 
TCTCCTCCGG CTTCGTCGTCL TTCTCCTTCC TGGGGTACAT GGCACAGAAG CACAGTGTGC 1260 
CCATCGGGGA CGTGGCCAAG- GACGGGCCAG- GGCTGATCTT CATCATCTAC CCGGAAGCCA 1320 
TCGCCACGCT CCCTCTGTCC. TCAGCCTGGG CCGTGGTCTT CTTCATCATG CTGCTCACCC 1380 
TGGGTATCGA-; CAGCGCCATG GGTGGTATGG: AGT CAGTGAT- CACCGGGCTC ATCGATGAGT . 1440 
TCCAGCTGCT- GCACAGACAC CGTGAGCTCT. TCACGCTCTT CATCGTCCTG" GCGACCTTCC 1500 
TCCTGTCCCT- GTTCTGCGTC ACCAACGGTG GCATCTACGT CTTCACGCTC CTGGACCATT 1560 
TTGCAGCCGG- CACGTCCATC CTCTTTGGAG TGCTCATCGA AGCCATCGGA- GTGGCCTGGT 1620 
TCTATGGTGT TGGGCAGTTC AGCGACGACA TCCAGCAGAT • GACCGGGCAG CGGCCCAGCC 1680 
TGTACTGGCG GCTGTGCTGG AAGCTGGTCA GCCCCTGCTT TCTCCTGTTC GTGGTCGTGG 1740 
TCAGCATTGT GACCTTCAGA CCCCCCCACT ACGGAGCCTA CATCTTCCCC GACTGGGCCA 1800 
ACGCGCTGGG. CTGGGTCATC GCCACATCCT CCATGGCCAT GGTGCCCATC TATGCGGCCT 1860 
ACAAGTTCTG CAGCCTGCCT GGGTCCTTTC GAGAGAAACT GGCCTACGCC ATTGCACCCG 1920 
AGAAGGACCG . TGAGCTGGTG GACAGAGGGG AGGTGCGCCA GTTCACGCTC CGCCACTGGC. 1980 
TCAAGGTGTA GAGGGAGCAG AGACGAAGAC CCCAGGAAGT CATCCTGCAA TGGGAGAGAC _ 2040 
ACGAACAAAC CAAGGAAATC TAAGTTTCGA GAGAAAGGAG GGCAACTTCT. ACTCTTCAAC 2100 
CTCTACTGAA: AACACAAACA ACAAAGCAGA AGACTCCTCT CTTCTGACTG. TTTACACCTT ■ 2160 
TCCGTGCCGG GAGCGCACCT CGCCGTGTCT- TGTGTTGCTG TAATAACGAC GTAGATCTGT 2220 
GCAGCGAGGT CCACCCCGTT GTTGTCCCTG CAGGGCAGAA AAACGTCTAA CTTCATGCTG 2280 
TCTGTGTGAGl GCTCCCTCCC TCCCTGCTCC CTGCTCCCGG CTCTGAGGCT GCCCCAGGGG 2340 
CACTGTGTTC TCAGGCGGGG ATCACGATCC TTGTAGACGC ACCTGCTGAG. AATCCCCGTG 2400 
CTCACAGTAG ' CTTCCTAGAC CATTTACTTT GCCCATATTA AAAAGCCAAG TGTCCTGCTT 2460 
GGTTTAGCTG TGCAGAAGGT- GAAATGGAGG AAACCACAAA TTCATGCAAA GTCCTTTCCC 2520 
GATGCGTGGC TCCCAGCAGA GGCCGTAAAT TGAGCGTTCA GTTGACACAT TGCACACACA 2580 
GTCTGTTCAG AGGCATTGGA GGATGGGGGT CCTGGTATGT CTCACCAGGA AATTCTGTTT - 2640 
ATGTTCTTGC AGCAGAGAGA AATAAAACTC. CTTGAAACCA GCTCAGGCTA CTGCCACTCA • 2700 
GGCAGCCTGT- GGGTCCTTGT GGTGTAGGGA ACGGCCTGAG AGGAGCGTGT CCTATCCCCG ■■ 2760 
GACGCATGCA GGGCCCCCAC AGGAGCGTGT CCTATCCCCG GACGCATGCA GGGCCCCCAC 2820 
AGGAGCATGT CCTATCCCTG GACGCATGCA GGGCCCCCAC. AGGAGCGTGT- ACTACCCCAG 2880 
AACGCATGCA GGGCCCCCAC AGGAGCGTGT ACTACCCCAG GACGCATGCA GGGCCCCCAC 2940 
TGGAGCGTGT ACTACCCCAG GACGCATGCA GGGCCCCCAC AGGAGCGTGT CCTATCCCCG 3000 
GACCGGACGC ATGCAGGGCC CCCACAGGAG . CGTGTACTAC CCCAGGACGC ATGCAGGGCC 3060 
CCCACAGGAG CGTGTACTAC CCCAGGATGC ATGCAGGGCC CCCACAGGAG CGTGTACTAC 3120 
CCCAGGACGC ATGCAGGGCC CCCATGCAGG CAGCCTGCAG ACCAACACTC TGCCTGGCCT 3180 
TGAGCCGTGA CCTCCAGGAA GGGACCCCAC TGGAATTTTA TTTCTCTCAG' GTGCGTGCCA 3240 
CATCAATAAC AACAGTTTTT ATGTTTGCGA ATGGCTTTTT AAAATCATAT TTACCTGTGA 3300 
ATCAAAACAA ATTCAAGAAT GCAGTATCCG CGAGCCTGCT TGCTGATATT GCAGTTTTTG .3360 
TTTACAAGAA TAATTAGCAA TACTGAGTGA AGGATGTTGG CCAAAAGCTG CTTTCCATGG 3420 
CACACTGCCC TCTGCCACTG ACAGGAAAGT GGATGCCATA GTTTGAATTC ATGCCTCAAG 3480 
TCGGTGGGCC TGCCTACGTG CTGCCCGAGG GCAGGGGCCG- TGCAGGGCCA. GTCATGGCTG 3540 
TCCCCTGCAA GTGGACGTGG GCTCCAGGGA CTGGAGTGTA" ATGCTCGGTG GGAGCCGTCA 3600 
GCCTGTGAAC. TGCCAGGCAG CTGCAGTTAG CACAGAGGAT GGCTTCCCCA TTGCCTTCTG- 3660 
GGGAGGGACA CAGAGGACGG CTTCCCCATC GCCTTCTGGC CGCTGCAGTC AGCACAGAGA 3720 
GCGGCTTCCC CATTGCCTTC TGGGGAGGGA CACAGAGGAC AGTTTCCCCA TCGCCTTCTG 3780 
GTTGTTGAAG ACAGCACAGA GAGCGGCTTC CCCATCGCCT TCTGGGGAGG GGCTCCGTGT 3840 
AGCAACCCAG GTGTTGTCCG TGTCTGTTGA CCAATCTCTA TTCAGCATCG TGTGGGTCCC 3 900 
TAAGCACAAT AAAAGACATC CACAATGGAA AAAAAAAAAG GAATTC 

Seq ID NO; 509 Protein sequence 
Protein Accession #: NP_001035.1 

1 11 21 31 41 51 

| || I I 1 

MSKSKCSVGL MSSWAPAKE PNAVGPKEVE LILVKEQNGV QLTSSTLTNP RQSPVEAQDR 60 

ETWGKKIDFL LSVIGFAVDL ANVWRFPYLC YKNGGGAFLV PYLLFMVIAG MPLFVMELAL 120 

GQFNREGAAG VWKICPIUCG VGFTVII,ISL YVGFFYNVII AWALHYLFSS FTTEIiPWIHC 180 

NNSWNSPNCS DAHPGDSSGD SSGLNDTFGT TPAAEYFERG VLHLHQSHGI DDIX3PPRWQL 240 

TACLVLVIVL liYFSLWKGVK TSGKWWITA TMPYWLTAL- LLRGVTLPGA IDGIRAYLSV 300 

DFYRLCEASV. WIDAATQVCP SLGVGFGVIiI AFSSYNKFTN NCYRDAIVTT SINSLTSFSS 360 

GFWFSFLGY MAQKHSVPIG DVAKDGPGIjI FIIYPEAIAT fcPLSSAWAW FFIMIrLTLGI 420 

DSAMGGMESV ITGLIDBFQL LHRHRELFTL FIVLATFLLS LFCVTNGGIY VFTLLDHFAA 480 

GTSILFGVLI EAIGVAWFYG VGQFSDDIQQ MTGQRPSLYW RLCWKLVSPC FLLFWWSI 540 

VTFRPPHYGA YIFPDWANAL GWVIATSSMA MVPIYAAYKF CSLPGSFREK LAYAIAPEKD 600 
RELVDRGEVR QFTLRHWLKV 

Seq ID NO: 510 DNA sequence 

Nucleic Acid Accession ft: NM_001216.1 

Coding sequence:' 43.. 1422 

1 11 21 31 41 51 

| | i 1 I I 

GCCCGTACAC ACCGTGTGCT GGGACACCCC ACAGTCAGCC GCATGGCTCC CCTGTGCCCC 60 

AGCCCCTGGC TCCCTCTGTT GATCCCGGCC CCTGCTCCAG GCCTCACTGT GCAACTGCTG 120 

CTGTCACTGC TGCTTCTGAT GCCTGTCCAT CCCCAGAGGT TGCCCCGGAT GCAGGAGGAT 180 

TCCCCCTTGG GAGGAGGCTC TTCTGGGGAA GATGACCCAC TGGGCGAGGA GGATCTGCCC 240 
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AGTGAAGAGG ATTCACCCAG AGAGGAGGAT CCACCCGGAG AGGAGGATCT ACCTGGAGAG 300 

G^GGATCTAC CTCGAGAGGA GGATCTACCT GAAGTTAAGC CTAAATCAGA AGAAGAGGGC 360 

^C^AGT TAGAGGATCT ACCTACTGTT. GAGGCTCCTG GAGATCCTCA AGAACCCCAG 420 

££1^ ACAGGGACAA AGAAGGGGAT OACCAGAGTC ATTGGCGC£ TKftGGCGK ™ 

CCGCCCTGGC CCCGGGTGTC CCCAGCCTGC GCGGGCCGCT TCCAGTCCCC GGTGGj^ATC 540 

CGCCCCCAGC TCGCCGCCTT CTGCCCGGCC CTGCGCCCCC. TGGAACTCCT ^C™^ 600 

rrrCCGCGGC TCCCAGAACT GCGCCTGCGC AACAATGGCC ACAGTGTGCA ACTGACCCTG 660 

CC^^GC TaS^^GGC TCTGGGTCCC GGGCGGGAGT ACCGGGCTCT GCAGCTGCAT 720 

C^CAC^gI GGGCTGCAGG TCGTCCGGGC TCGGAGCACA CTGTGGAAGG CCACCGTTTC 180 

CCTGCCGAGA TCCACGTGGT TCACCTCAGC ACCGCCTTTG CCAGAGTTGA CGAGGCCTTG 840 

g^Scg^ Sggcctggc cgtgttggcc gcctttctgg aggagggccc ^gaaaac 9 0 

AGTGCCTATG AGCAGTTGCT GTCTCGCTTG GAAGAAATCG CTGAGGAAGG CTCAGAGACT 960 

GACTGGACAT ATCTGCACTC CTGCCCTCTG ACTTCAGCCG OHICnca* 1020 

TATGAGGGGT CTCTG ACT AC ACCGCCCTGT GCCCAGGGTG TCATCTGGAC TGTGTTTAAC 1080 

CAGACAGTGA TGCTGAGTGC T AAGCAGCTC CACACCCTCT CTGACACCCT GTGGGGACCT 1140 

GGTGACTCTC GGCTACAGCT GAACTTCCGA GCGACGCAGC CITTGAATGG GCGAGTGATT 1200 

GAGGCCTCCT TCCCTGCTGG AGTGGACAGC AGTCCTCGGG CTGCTGAGCC AGTCCAGCTG 1260 

AATTCCTGCC TGGCTGCTGG TGACATCCTA GCCCTGGTTT TTGGCCTCCT ^™CTGTC 1320 

ACCAGCGTCG CGTTCCTTGT GCAGATGAGA AGGCAGCACA GAAGGGGAAC CAAAGGGGGT 1380 

GTGAGCTACC GCCCAGCAGA- GGTAGCCGAG ACTGGAGCCT AGAGGCTGGA TCTT^AGAA 1440 

TGTGAGAAGC CAGCCAGAGG: CATCTGAGGG GGAGCCGGTA ACTGTCCTGT CCTGCTCATT 1500 
ATGCCACTTC CTTTTAACTG CCAAGAAATT TTTTAAAATA AATATTTATA AT 

Seq ID NO:' 511 Protein sequence 
Protein Accession ft: NP_D0l2O7.l 



11 21. 



31 41 51 



360 
420 



jtAPLCPSPWL PLLIPAPAPG ItVQLLLSLL IlMPVHPQRL PRMQEDSPLG GGSSGEDDPL 60 
G^LPSE^ SPREEDPPGE EDLPGEEDLP GEEDLPEVKP KSEEEGSLKI, EDLPTVEAPG. 120 
DPQEPQNNAH- RDKEGDDQSH^- WRYGGDPPWP RVSPACAGRF QSPVDIRPQL AAFCPALRPL 180 
S^FQLPPL- PSLRLRKNGH SVQLTLPPGL. EMALGPGRSY RALQLKLHWG AAGRPGSEHT.. 240 
VEGHRFPAEI HWELSTAFA RVDEALGRPG GLAVLAAFLE EGPEENSAYE QLLSRLEEIA 300 
EEGSETQVPG ^IfsD' FSRYFQYEGS. LTTPPCAQGV IWTVFNQTVM LSAKQLHTLS 
DTLWGPGDSR LQLNFRATQP LNGRVIEASF PAGVDSSPRA AEPVQLNSCL AAGDILALVF 
GLLFAVTSVA FLVQMRRQHR RGTKGGVSYR" PAEVAETGA 

Seq ID NO i 512 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence : 1 3 9 7 8 

U . ' 21 31 41 51 

Itggtgggtg AAGGACCCTA CCTTATCTCA GATCTGGACC AGCGAGGCCG GCGGAGATCC 60 

T^GCAGAAA GATATGACCC CAGCCTGAAG . ACCATGATCC CAGTGCGACC CTGTGCAAGG 120 

TTAGCACCCA' ACCCGGTGGA TGATGCCGGG CTACTCTCCT TCGCCACATT. TTCCTGGCTC 180 

ACGCCGGTGA TGGTGAAAGG CTACCGGCAA- AGGCTGACCG TAGACACCCT GCCCCCATTG- 240 

TCGACATATG ACTCATCTGA CACCAATGCC. AAAAGATTTC GAGTCCTTTG GGATGAAGAG - 300 

GTJ^CAAGGG; TGGGTCCTGA GAAGGCCTCT CTGAGCCACG TGGTGTGGAA ATTCCAGAGG 360 

ACACGCGTGT. TGATGGACAT- . CGTGGCCAAC ATCCTGTGCA TCATCATGGC AGCCATAGGG 420 

CCGACAGTTC TCATTCACCA; AATCCTCCAG CAGACTGAGA GGACCTCTGG GAAAGTCTGG 480 



CTTCTTTTGG 540 

GCTCTCCACC 600 

TGGCGAGGTG • 660 

TTGTCCTTTG. 720 

CATTCTGGGG 780 

GATGTTTATG. 840 

GCGAGTTCAG 900 

GGAGAAATCT - 960 

AAAAGCTGGA 1020 

CATCGTGCTG 1080 

ATTTAGTGTG 1140 



COliAuUiixA., luiwwi^.^ 

GTTGGCATTG GACTGTGCAT. AGCCCTTTTT- GCCACCGAGT TTACCAAAGT 
GCCCTTGCCT GGGCCATCAA • CTACCGCACG GCCATCCGGT TGAAGGTGGC 
TTGGTTTTTG AAAACCTAGT GTCCTTCAAG ACATTGACCC ACATCTCTGT 
CTCAATATAC TGTCAAGTGA- TAGCTATTCT TTGTTTGAAG CTGCCTTGTT 
CCAGCCACCA TCCCGATCCT - AATGGTCTTT. TGTGCGGCGT ACGCCTTTTT 
CCCACAGCTC TCATCGGGAT ATCAGTGTAT- GTCATATTCA TACCCGTCCA 
GCCAAGCTCA ATTCAGCTTT. CCGAAGGTCA GCAATTTTGG TGACAGACAA 
ACAATGAATG AGTTTCTGAC CTGCATCAGG CTGATCAAAA TGTATGCCTG 
TTTACCAACA CTATCCAAGA TATAAGAAGG AGGGAAAGAA AATTACTGGA 
TTTGTCCAAA GTGGAAACTC TGCCCTGGCC. CCCATCGTGT CCACCATAGC 
araTTATCrT GCCACATCCT CCTGAGACGC AAACTCACCG CACCCGTGGC 

ATOC^CT WMTGTART GAAGTTTTCC ATTCCAATCT TGCCCTTCTC CATCAAAGCA 1200 

CGAATGTCTC TCTAAGGAGA ATGAAGAAAA TTCTCATAGA TAAAAGCCCC 1260 

CcScA S£££a£c IgLgACCCA GATACTGTCT TGCTTTTAGC AAATGCCAGC 1320 

TTGaStGGG AGCATGAAGC CAGCAGGAAA AGTACOCCAA AGAAATTGCA GAACCAGAAA 1380 

AGGCATTTAT GCAAGAAACA GAGGTCAGAG. GCATACAGTG AGAGGAGTCC ACCAGCCAAG 1440 

GGAGCCACTG GCCCAGAGGA'-. GCAAAGTGAC AGCCTCAAAT CGGTTCTGCA CAGCATAAGC ISOO 

^3^A GAAAGTTATG TCGTIATCCC GAAGCCCAGC TCCTGGCTTG-GAGGTGGCCA 1560 

SSto TTGGGAGAAT CATCAGAGGA TACAGGCCTC ATGGATTTTC TGCTAAAGAC 1620 

aSgIS CWGAAGGCT TCTTACTTGG CCCCAAGAAG TGGATAGGAC TCAAAGGGCA 1680 

GCCAAATACC TGGGGAAGAT CTTGGGAATA TGTGGGAATG TGGGAAGTGG AAAGAGCTCC 1740 

CTCCTTCCAG CTCTCCTAGG ACAGATGCAG CTGCAGAAAG GGGTGGTGGC AGTCAATGGA 1800 

ACm^OT ACGTTTCACA GCAGGCATGG ATCTTTCATG GAAATGTGAG AGAAAACATA 1860 

AAAAGTWGA TCACCAAAGG TATCAGCACA CAGTCCGCGT CTGTGGCCTC 1920 

CAGAAGGACC TGMCAACCT CCCCTATGGA GACCTGACTG AGATTGGGGA GCGGGGCCTC 1980 

S^CT^ GgSgAG GCAGAGGATT AGCCTGGCGC GCGCTGTCTA CTCCGACCGT 2040 

CAGCTCTACC TGCTGGACGA CCCCCTGTCG GCCGTGGACG CCCACGTGGG GAAGCACGTC 2100 

SSSaGT GCATTAAGAA GACGCTCAGG GGAAAGACAG TCGTCCTGGT GACCCACCAG 2160 

?^StS tgatgaagtt atttiattag AAGATGGAGA GATTTGTGAA 2220 

AAGGGAACCC ACAAGGAGTT AATGGAGGAG AGAGGGCGCT ATGCAAAACT. GATTCACAAC 2280 

^SSt SSa^TCAA GGATCCTGAA CACCTTTACA ATGCAGCAAT- GGTC^CC 2340 

TTCAAGGAGA GCCCTGCTGA GAGAGAGGAA GATGCTGGTA TAATCGGGTA CCTCCTITCT 2400 

rTOTCASG TGTTCCTCTT CCTCCTGATG ATTGGCAGCG CTGCCTTCAG CAACTGGTGG 2460 

GGTTGGACAA GGGCTCACGG ATGACCTGK3 GGCCCCAGGG CAACAGGACC 2S20 

SgTCTGAGG ?EgGCGCGGT GCIGGCAGAC ATCGGTCAGC ATGTGTACCA GTGGGTGTAC 2580 

aSgCAM« TGGTGTTCAT GCTGGTGTTT GGCGTCACCA AAGGCTTCGT CTTCACCAAG 2640 
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ACCACACTGA TGGCATCCTC CTCTCTGCAT GACACGGTGT TTGATAAGAT CTTAAAGAGC 2700 

CCAATGAGTT TCTTTGACAC GACTCCCACT GGCAGGCTAA TGAACCGTTT TTCCAAGGAT 2760 

ATGGACGAGC TGGATGTGAG GCTGCCGTTT CACGCAGAGA ACTTTCTGCA GCAGTTTTTT 2820 

ATGGTGGTGT TTATTCTOGT GATCTTGGCT GCTGTGTTTC CTGCTGTCCT TTTAGTCGTG 2880 

GCCAGCCTTG CTGTAGGCTT- CTTCATTCTG TTACGCATTT TCCACAGAGG AGTCCAGGAG 2940 

CTCAAGAAGG TGGAGAATGT CAGCCGGTCA CCCTGGTTCA CCCACATCAC CTCCTCCATG 30 00 

CAGGGCCTGG GCATCATTCA . CGCCTATGGC AAGAAGGAGA GCTGCATCAC CTATACTTCA 3060 

TC^AAGGCC tgtcattgtc atacatcatc cagctgagcg gactgctcca AG7GTGTGTG 3120 

CGAACGG^A CAGAGACGCA AGCCAAATTC ACCTCCGTGG AGCIGCTCAG GGAATACATT 3180 

TCGACCTGTG TTCCTGAATG. CACTCATCCC CTCAAAGTGG GGACCTGTCC CAAGGACTGG 3240- 

CCCAGCTGTG GGGAGATCAC CTTCAGAGAC TATCAGATGA GATACAGAGA CAACACCCCC 3300 

CTTGTTCTCG ACAGCCTGAA- CTTGAACATA' CAAAGTGGGC AGACAGTCGG GATTGTTGGA 3360 

AGAACAGGTT CCGGAAAGTC ATCGTTAGGA ATGGCTTTGT TTCGTCTGGT. GGAGCCAGCC 3420 

AGTGGCACAA TCTTTATTGA TGAGGTGGAT ATCTGCATTC TCAGCTTGGA AGACCTCAGA 3480 

ACCAAGCTGA CTGTGATCCC- ACAGGATCCT GTCCTGTTTG TAGGTACAGT AAGGTACAAC 3540 

TTGGATCCCT TTGAGAGTCA CACCGATGAG ATGCTCTGGC AGGTTCTGGA GAGAACATTC 3600 

ATGAGAGACA CAATAATGAA ACTCCCAGAA AAATTACAGG CAGAAGTCAC AGAAAATGGA 3660 

GAAAACTTCT CAGTAGGGGA ACGTCAGCTG CTTTGTGTGG CCCGAGCTCT TCTCCGTAAT 3720 

^TtZ SScCtSS* TGAAGCCACC GCCTCTATGG ACTCCAAGAC TGACACCCTG 3780 

GTTCAGAACA CCATCAAAGA TGCCTTCAAG GGCTGCACTG TGCTGACCAT CGCCCACCGC 3840 

CTCAACACAG TTCTCAACTG CGATCACGTC CTGGTTATGG AAAATGGGAA GGTGATTGAG 3900 

TTTGACAAGC CTGAAGTCCT TGCAGAGAAG CCAGATTCTG CATTTGCGAT GTTACTAGCA 3 960 
GCAGAAGTCA GATTGTAG 

Seq ID NO: 513 Protein sequence 
Protein Accession 8: Eos sequence 

1 11 * 21 31 41 51 

ivGBGPYLIS DLDQRGRRRS FAERYDPSLK TMIPVRPCAR LAPNFVDDAG IiSFATFSWL 60 

TPVMVKGYRO RLTVDTLPPl/ STYDSSDTNA KRFRVLWDEE VARVGPEKAS LSHVWKFQR. 120 

TRVLMD I VAN: ILCIIMAAIG PTVLIHQILQ QTERTSGKVW- VGIGLCIALF ATEFTKVFFW, 180 

ALAWAINYRT AIRLKVALST- LVFENLVSFK TLTHISVGEV LNILSSDSYS LFEAALFCPL 240 

S5 CAAYAPFILG:-PTALIGISV^- VIFIPVQMFM- AKLNSAFRRS AILVTDKRVQ . 300 

TMNEFLTCIR- LIKMYAWEKS - FTNTIQDIRR' RERKLLEKAG FVQSGNSALA PIVSTIAIVL 360 

SaFSV IAMFNVMKFS IAILPFSIKA MAEANVSLRR MKKILIDKSP^ 420 

PSYITQPEDP DTVLLLANAT LTWEHEASRK STPKKLQNQK RHLCKKQRSE AYSERSPPAK 480 

gItcPEEQSD SLKSVLHSIS FWRKLCRYP EAQUAWRWP AVFVGRIIRG YRPHGFSAKD 540. 

pqevdrtqra- akylgkilgi cgnvgsgkss llaallgqmq wkgvvavng oo 

TLAYVSQQAW IFHGNVREN I " LFGEKYDHQR" ' YQHTVRVCGL QKDLSNLPYG DLTEIGERGL 660 

NLSGGQRQRI slaravysdr qlyllddpls -avdahvgkhv FEECIKKTLR GKTWLVTHQ 720 

^FLESCDEV ILLEDGEICE KGTHKELMEE RGRYAKLIHN LRGLQFKDPE HLYNAAMVEA 780 

FKESPAEREE DAGIIGYLLS • LFTVFLFLLM IGSAAFSNWW LGLWLDKGSR MTCGPQGNRT 840 

MCEVGAVLAD IGQHVYQWVY- TASMVFMLVF GVTKGFVFTK TTLMASSSLH DTVFDKILKS 900 

pSFFDTTPT GRLMNRFSKD- MDELDVRLPF HAENFLQQFF MWFILVILA AVFPAVLLVV 960 

ASLAVGFFIL LRIFHRGVQB LKKVENVSRS PWFTHITSSM QGLGIIHAYG KKESCITYTS 1020 

SKGLSLSYII QLSGLLQVCV RTGTETQAKF- TSVELliREYr STCVPECTHP LKVGTCPKDW 1080 

PSCGEITFRD YQMRYRDNTP LVLDSLNLNI*- QSGQTVG1VG RTGSGKSSLG MALPRLVEPA 1140 

SGTIFIDEVD. ICILSLEDLR TKLTVIPQDP- VLFVGTVRYN LDPFESHTDE MLWQVLERTF 1200 

MRDTIMKLPE KLQAEVTENG ENFSVGERQL LCVARALLRN SKIILLDEAT ASMDSKTDTL 1260 

VQNT I KDAFK GCTVLTIAHR LNTVLNCDHV LVMENGKVIE FDKPEVLAEK PDSAFAMLLA 1320 
AEVRL 



Seq ID NO: 514 DNA sequence 
Nucleic Acid Accession #: Z31560 
Coding sequence: 1-966 

1. .11 21 31 - S1 

CACAGCGCCC GCATGTACAA CATGATGGAG ACGGAGCTGA AGCCGCCGGG CCCGCAGCAA 60 

ACTTCGGGGG GCGGCGGCGG CAACTCCACC GCGGCGGCGG CCGGCGGCAA CCAGAAAAAC 120 

AGCCCGGACC GCGTCAAGCG GCCCATGAAT- GCCTTCATGG TGTGGTCCCG CGGGCAGCGG 180 

CGCAAGATGG CCCAGGAGAA CCCCAAGATG CACAACTCGG AGATCAGCAA GCGCCTGGGC 240 

GCCGAGTGGA AACTTTTGTC GGAGACGGAG AAGCGGCCGT TCATCGACGA GGCTAAGCGG 300 

CTGCGAGCGC TGCACATGAA GGAGCACCCG GATTATAAAT ACCGGCCCCG GCGGAAAACC 360 

AAGACGCTCA TGAAGAAGGA TAAGTACACG OGCCCGGCG GGCTGCTGGC CCCCGGCGGC 420 

AA TAG CAT GG CGAGCGGGGT CGGGGTGGGC GCCGGCCTGG GCGCGGGCGT GAACCAGCGC 480 

A^GGaSgTT ACGCGCACAT. GAACGGCTGG AGCAACGGCA GCTACAGCAT GATGCAGGAC 540 

CAGCTGGGCT ACCCGCAGCA CCCGGGCCTC AATGCGCACG GCGCAGCGCA GATGCAGCCC 600 

ATGCACCGCT ACGACGTGAG. CGCCCTGCAG TACAACTCCA TGACCAGCTC GCAGACCTAC 660 

ATGAACGGCT CGCCCACCTA CAGCATGTCC TACTCGCAGC AGGGCACCCC TGGCATGGCT 720 

CTTGGCTCCA TGGGTTCGGT GGTCAAGTCC GAGGCCAGCT CCAGCCCCCC TGTGGTTACC 7 B0 

TCTTCCTCCC ACTCCAGGGC GCCCTGCCAG GCCGGGGACC TCOGGGACAT GATCAGCATG 840 

tmSccccg GCGCCGAGGT gccggaaccc GCCGCCCCCA GCAGACTTCA CATGTCCCAG 900 

CACTACCAGA GCGGCCCGGT GCCCGGCACG GCCATTAACG GCACACTGCC CCTCTCACAC 960 

ATGTGAGGGC CGGACAGCGA ACTGGAGGGG GGAGAAATTT TCAAAGAAAA ACGAGGGAAA .1020 

TGGGA^GGT GCAAAAGAGG AGAGTAAGAA ACAGCATGGA GAAAACCCGG TACGCTCAAA 1080 
AAAAA 

Seq ID NO: 51S Protein sequence 
Protein Accession #: CAA83435 ' . . 

1 U ' 21 |1 » f 

HfifiRMYNMME TELKPPGPQQ TSGGGGGNST AAAAGGNQKN SPDRVKRPHH AFMVWSRGQR 60 

RKHAQHNPKM HNSEISKRLG AEHKLLSETE KRPFIDEAKR LRALHMKEHP O^^KT 120 

KT^KKDKYT LPGGLLAPGG NSMASGVGVG AGLGAGVNQR MDSYAHM13GW SNGSYSMMQD 180 
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QI/3YPQKPGL NAHGAAQKQP" KHRYDVSALQ VUSMTSSQTY MNGSPTYSMS VSQQGTPGMA 
LGSHGSWKS EASSSPPWT SSSHSRAPCQ AGDLRDKISM YLPGAEVPEP AAPSRLHMSQ 
HYQSGPVPGT AIKGTI.PI.Sa. M - 

Seq ID NO: 516 DNA sequence 
Nucleic Acid Accession 8: U91618 
Coding sequences- 29- .541 . 



PCT/US02/12476 



240 
300 



1 

I 

CGGACTTGGC 
CATGCTACTC 
AGCATTAGAA 
TCCCTCTTGG. 
AGCTGAGGAA 
TGCTTTAGAT. 
TCACAGCAGG 
TGACAAAAAT 
GCTGTATGAG 
AGAGAATAAA 
ATTATATTTG 
ATTGAATGTG 
TCTTCAAAAA 



11 

1, 

TTGTTAGAAG 
CTGGCTTTCA' 
GCAGATTTCT 
AAGATGACTC 
ACAGGAGAAG; 

ggctttagct 
gcttttcaac 

GGAAAGGAAG 
AATAAACCCA 
TCATTTATTT 
TGTGAAAATG 
TTTTTCTGCA 
AAAAAAAAAA 



21 
I 

GCTGAAAGAT 
GCTCCTGGAG 
TGACCAATAT 
TGCTAAATGT 
TTCATGAAGA 
TGGAAGCAAT, 
ACTGGGAGTT 
AAGTCATAAA 
GAAGACCCTA 
ACATGTGATT; 
TGACAAACAC 
CTAATAGAAA" 
AAATGGGGCC 



31 
I 

GATGGCAGGA 
TCTGTGCTCA 
GCATACATCA 
TTGCAGTCTT 
GGAGCTTGTT- 
GTTGACAATA 
AATCCAGGAA- 
GAGAAAAATT 
CATACTCAAA 
GTGATTCATC 
ACTTATCTGT 
TTAGACTAAG 
GCAATT 



41 
I 

ATGAAAATCC 
GATTCAGAAG 
AAGATTAGTA 
GTAAATAATT 
GCAAGAAGGA 
TACCAGCTCC. 
GATATTCTTG 
CCTTATATTC 
AGAGATTCTT- 
ATCCCTTAAT 
CTCTTCTACA 
TGTTTTCAAA 



SI 
I 

AGCTTGTATG 
AGGAAATGAA 
AAGCACATGT ■ 
TGAACAGCCC 
AACTTCCTAC 
ACAAAATCTG 
ATACTGGAAA 
TGAAACGGCA 
ACTATTACTG 
TAAATAT CAA ; 
ATTGTGGTTT - 
TAAATCTAAA 



Seq ID NO: 517 Protein sequence 
Protein Accession #: AAB50564 

l' 11 21 31 ' 41 SI 

MMAGMKIQLV- CMLLLAFSSW SLCSDSEEEM KALEADFLTN MHTSKISKAH VPSWKMTLLN 
VCSLVNNLNS PAEETGEVHE EBLVARRKLP- TALDGFSLEA MLTIYQLHKI CHSRAFQHWE 
LIQEDILDTG. NDKNGKEEVI KRKIPYILKR QLYENKPRRP YILKRDSYYY 

Seq ID NO: 518 DNA sequence 

Nucleic Acid Accession #: NM_006536.2 

'Coding sequence: 109. .2940 



ACCTAAAACC 
ATGTATGCAG 
AGCATTGCAG. 
GAACTCCCAT 
ATTGCAATTA- 
ATAACTGAAG 
ATAAAGATTT- 
TCATATGAAA 
TACACCCTAC 
TTCCTACTGA*' 
GAATGGGCCC 
ATAAATGGGC; 
GTGTGTGAAA : 
GGATGCACCT 
AGTTTATCTT' 
CTACAGAACC. 
TTTCACCACA; 
GT ACAGGCTG 
GCTGACAGAC 
ATTCATACCT 
CACCAAATTA 
TCAGCTAAAA 
AAACTGAATG 
CTTCTTGGCA 
CTGGGTTCAT- 
TTCTTTGTTC 
TCTGGAACTG 
" AAACCTCACC 
ATGTTTCTAG 

ggacgaaaat 
. tggattccag: 
tctctgcaag 

GCCACTGTGG 
TATGCCAATG 
GAGCCAGAGA 
GTTATAAAAA 
TATAGCTTGA 
* CCAGGGAGTC 
GCTCCAAGGA 
AGCTCAGGAG 
CCACCATGCA 
TGGACAGCAC 
AGTAAAAGTC 
AAGCGAAATC 
ACGAATGGAC 
GCAATACGAG 
CCTCTGTTTA 
GGAGTTTTAA 



11 
I 

TTGCAAGTTC 
CAGGCT CAGT 
GTCCTATTTG: 
TCCTGGGAGC 
ATCCTCAGGT 
CTTCATTTTA 
TAATACCTGC 
AGGCAAATGT 
AATACAGAGG 
ATGATAACTT- 
ACCTCCGTTG 
AAAATCAAAT 
AAGGTCCTTG 
TTATCTACAA' 
CTGTGGTTGA 
AGATGTGCAG 
GCTTTCCCAT- 
GTGACAAAGT- 
TCCTTCAACT 
TCGTGGGCAT 
ACAGCAATGA* 
CAGACATCAG 
GAAAAGCTTA 
ATTGCTTACC 
CTGCAGCCCC 
CAGATATATC 
GAGACATTTT 
■ATCAATTGAA 
TTACGTGGCA 
ACTACACAAA 
GAACAGCTAA 
CCCTGAAAGT 
AAGCCTTTGT- 
TGAAACAGGG 
CTGGAGATCC 
ATGATGGAAT 
AAGTGCATGT- 
ATGCTATGTA 
AATCAGTAGG 
GCTCCTTTTC 
AAATTATTGA 
CTGGAGAAGA 
TACAGAATAT 
CTCAGCAAGC 
■ CTGAACATCA 
CAATGGATAG 
TTCCCCCCAA 
CAGCAATGGG 



21 
I 

AGGAAGAAAC 
GTGAGTGAAC 
CAACCTGAAG. 
TGGAGTACAG 
ACCTGAGAAT. 
CCTATTTAAT 
CACATGGAAA 
CATAGTGACT 
GTGTGGAAAA 
AACAGCTGGC 
GGGTGTGTTC 
TAAAGTGACA 
CCCCCAAGAA 
TAGCACCCAA 
ATTTTGTAAT 
CCTCAGAAGT 
GAATGGGACT. 
GGTCTGTTTA 
ACAACAAGCC 
TGCCAGTTTC 
TGATCGAAAG • 
CATTTGTTCA 
TGGCTCTGTG 
CACTGTGCTC 
AAATCTGGAG 
AAACTCCAAT 
CCAGCAACAT 
AAACACAGTG 
GGCCAGTGGT 
TAATTTTATC 
GCCTGGGCAC 
GACAGTGACC 
GGAAAGAGAC. 
ATTTTATCCC. 
TGTTACGCTG 
TTACTCGAGG 
• CAATCACTCT 
TGTACCAGGT 
CAGAAATGAG 
AGTGCTGGGA 
CCTGGAAGCT 
CTTTGATCAG 
CCAAGATGAC 
TGGCATCAGG 
GCCAAATGGA 
GAACTCCTTA 
TTCTGATCCT 
TTTGATAGGA 



31 
I 

CATCTGCATC 
TGGAGGCTTC. 
TTTGTGACTC 
CTTCAAGACA 
CAGAACCTCA 
GCTACCAAGA 
GCTAATAATA 
GACTGGTATG 
GAGGGAAAAT 
TACGGATCAC 
GATGAGTATA 
AGGTGTTCAT 
AACTGTATT A ■* 
AATGCAACTG. 
GCAAGTACCC 
GCATGGGATG 
GAGCTTCCAC 
GTGCTGGATG 
GCAGAATTTT 
GACAGCAAAG 
TTGCTGGTTT 
GGGCTTAAGA 
ATGATATTAG 
AGCAGTGGTT 
GAATTATCAC 
AGCATGATTG 
ATTCAGCTTG 
ACTGTGGATA 
CCTCCTGAGA 
ACCAATCTAA 
TGGACTTACA 
TCTCGCGCCT- 
AGCCTCCATT 
ATTCTTAATG 
AGACTCCTTG 
TATTTTTTCT 
CCCAGCATAA 
TACACAGCAA 
GAGGAGCGAA 
GTTCCAGCTG 
GTAAAAGTAG 
GGCCAGGCTA 
TTTAACAATG 
GAGATATTTA 
GAAACACATG 
CAGTCTGCTG 
GTACCTGCCA 
ATCATTTGCC 



41 

I 

CATATTGAAA 
TCTACAACAT 
TCCTGGTTGC 
ATGGGTATAA 
TCTCAAACAT 
GAAGAGTATT 
ACAGCAAAAT 
GGGCACATGG 
ACATTCATTT 
GAGGCCGAGT 
ACAATGACAA 
CTGACATCAC 
TTAGTAAGCT - 
CATCAATAAT 
ACAACCAAGA 
TAATCACAGA 
CTCCTCCCAC 
TGTCCAGCAA 
ATTTGATGCA 
GAGAGATCAG 
CATATCTGCC 
AAGGATTTGA 
TGACCAGCGG 
CAACAATTCA 
GTCTTACAGG 
ATGCTTTCAG 
AAAGTACAGG 
ATACTGTGGG 
TTATATTATT 
CTTTTCGGAC 
CCCTGAACAA 
CCAACTCAGC 
TTCCTCATCC 
CCACTGTCAC 
ATGATGGAGC 
CCTTTGCTGC 
GCACCCCAGC 
ACGGTAATAT 
AGTGGGGCTT 
GCCCCCACCC 
AAGAGGAATT 
CAAGCTATGA 
CTATTTTAGT 
CGTTCTCACC 
AAAGCCACAG 
TArCTAACAT- 
GAGATTATCT 
TTATTATAGT 



51 
1 

ACCTGACACA 

GACCCAAAGG ' 

CTTAAGTTCA 

TGGATTGCTC 

TAAGGAAATG 

TTTCAGAAAT 

AAAACAAGAA 

AGATGATCCA 

CACACCTAAT 

GTTTGTCCAT 

ACCTTTCTAC 

AGGCATTTTT 

TTTTAAAGAA 

GTTCATGCAA 

AGCACCAAAC 

CTCTGCTGAC 

ATTCTCGCTT 

GATGGCAGAG. 

GATTGTTGAA 

AGCCCAGCTA 

CACCACTGTA 

GGTGGTTGAA 

AGATGATAAG. 

CTCCATTGCC 

AGGTTTAAAG 

TAGAATTTCC 

TGAAAATGTC 

CAACGACACT 

TGATCCTGAT 

AGCTAGTCTT- 

TACCCATCAT' 

TGTGCCCCCA 

TGTGATGATT 

TGCCACAGTT 

AGGTGCTGAT 

AAATGGTAGA 

CCACTCTATT 

TCAGATGAAT 

TAGCCGAGTC 

TGATGTGTTT 

GACCCTATCT 

AATAAGAATG 

AAATACATCA 

CCAGATTTCC 

AATTTATGTT 

TGCCCAGGCG 

TATATTGAAA 

TGTGACACAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



60 

120 



60 
120 
180 
240 
300 
360 
420 
480 
S40 
600 
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720 
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900 
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1020 
1080 
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1200 
1260 
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1560 
1620 
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1920 
1980 
2040 
2100 
2160 
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2400 
2460 
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2640 
2700 
2760 
2820 
2880 



382 
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WO 02/086443 

CATACTTTAA GCAGGAAAAA 
ATAAATATCC AAAGTGTCTT 
CATACTAACA AAGTCAAATT 
ATACAGATAA GATTTTTACA- 
CCTTACACTT TGGCTATGAA- 
GCAAAGGGAA GGGTAAAGTC 
AATAGCCCCA AGCAGAGAAA 
TCATTTAGTT " ACTTTG A TTA 
TTTACATGAA GATCATGCTA 
CTTGCTATTT TGTTATATAT. 
TTTCACTGTA- AGAGGTAACC 
TTTATGACAA AGGTCTATTG 
TTTCTAAGTT- TATTGCCTTG 
TACCTAGGAA A • 



Seq ID NO : 519 Protein sequence. 
Protein Accession 8: NPJ006527.1 



PCTYUS02/12476 



GAGAGCAGAC 
CCTTCTTAGA 
AACATCAAAA 
TGGTAGATCA 
CAAATAATAA 
GGACCAGTGT 
AGGAGGGTAG 
ATTTTTCTTT 
TATTTTATAT 
ATTTCAGATG 
TTTAACAAT A 
AATTTATTTG 
GGTTATTATG 



AAGAAAGAGA 
TATAAGACCC 
CTGTATTAAA 
ACAATTCTTT 
AAATTATTCT 
CAAGGAAAGT 
GTCTGCATTA 
TCTCCTTATC 
ATGTAGCCCC 
ACATCTCCCT 
TGGGTATTAC 
TNTGTAAGTT 
GAATGATAGT 



ATGGAACAAA 
ATGGCCTTCG 

. ATGCATTGAG 
TTGGGGGTAG 
TTAAAGTAAT 
TTGTTTTATT 

. TAACTGTCTG 
TGTGCAGTAC 
TAATGCAAAG 

' GCTAATGCTC 
CTTTGTCTCT 
TCTACTCCCA 
TATAGCCCCN 



ATTATTAT AA 
ACTACAAAAA 
TTTTTGTACA 
ATTAGAAAAC 
GTCTTTAAAG 
GAGGTGGAAA 
TGTGAAGCAA 
AGGTTGCTTG 
CTCTTTACCT 
AGAGATCTTT 
TCATACCGGT 
TCAAAGCAGC 
TATAATGCCT 



1 

I ; " 
MTQRSIAGPI 
IKEMITEASP 
GDDPYTI.QYR 
KPFYINGQNQ 
MFMQSIjSSW 
TFSLVQAGDK 
RAQLHQINSN 
GDDKLLGNCL 
SRISSGTGDI 
FDPDGRKYYT 
AVPPATVEAF 
AGADVIKNDG: 
IQMNAPRKSV 
LTLSWTAPGE 
PQISTNGPEH. 
liILKGVLTAM 



11 
I 

CNLKFVTIiLV 
YLFNATKRRV 
GCGKEGKYIH 
IKVTRCSSDI 
EFCNASTHNQ 
WCLVLDVSS. 
DDRKLLVSYL 
PTVLSSGSTI: 
FQQHIQLEST 
NNF I TNIiTFR 
VERDSLHFPH; 
IYSRYFFSFA 
GRNEEERKWG 
DFDOGQATSY 
QPNGETHESH 
GLIGIICLII 



21 

I . 

ALSSELPFLG 
FFRNIKIIjIP 
FTPNFLLNDN 
TGIFVCEKGP 
EAPNLQNQMC 
KMAEADRLLQ 
PTTVSAKTDI- 
HSIALGSSAA 
GENVKPHHQL 
TASLWIPGTA; 
PVMIYANVKQ. 
ANGRYSLKVH-- 
FSRVSSGGSF 
EIRMSKSLQN 
RIYVAIRAMD' 
WTHHTLSRK 



31 
I 

AGVQLQDNGY 
ATWKANNNSK 
LTAGYGSRGR 
CPQENCIISK 
SLRSAWDVIT- 
LQQAAEFYLM 
SICSGLKKGF 
PNLEELSRLT 
KNTVTVDNTV 
KPGHWTYTLN. 
GFYPILNATV 
VNHSPSISTP 
SVLGVPAGPH 
IQDDFNNAIL 
RNSLQSAVSN 
KRADKKENGT 



41 
I 

NGI^IAINPQ 
IKQESYEKAN 
VFVHEWAHLR 
LFKEGCTFIY' 
DSADFHHSFP 
QIVEIHTFVG 
EWEKIiNGKA 
GGLKFFVPDI 
GNDTMFLVTW 
NTHHSLQALK 
TATVEPETGD. 
AHSIPGSHAM 
PDVFPPCKI I '- 
VNTSKRNPQQ 
IAQAPLFIPP 
KLIj . 



Seq ID NOs 520 DNA sequence 

Nucleic Acid Accession ft: NM_000228.1 

Coding sequence: 82.. 3600 



1 

. I 

GCTTTCAGGC 
GGATCACCCC 
CTCCTGCATG 
CTTGTTGGGA 
ACCTACTGCA 
CCTCACAACT 
TGGTGGCAGT. 
TTCCAGCTTC 
GAGCGCT CCT 
ACCTCCACCT 
CAGTCCCTGC 
ATGGATTTAG' 
ATCACAAACT. 
CCTCCCAGCG 
GGCCATGCTG 
CAGGTCCACG 
GCACCCTTCT. 

. CAAAGGTGCG 
GCCAGCCAGG 
AACTGTGAGC 
GAGACCTGCA 
CCAGTGACCG 
AAGCCGGGCT 
AACATCCTGG 
CTGCCCAACG 
AGTGGCCAGG 
CAACCAGTTC 

. GCAGCCATCC 
TGTGACTGTG 
CTCTGCCGCC 
CGCTACCCGG 
GAGCAGGCCC 
GGGCTGGAGG 
ATCCGAGCAG 
GCCATCCTCT 
GAGACGTTGT 
ACTATGTATC 
GCCTTCCGGA 
GACAGCTCGC 
CGGCAGGCGG 
ATGTCTTCGT 
ATGGCTTGCA 
TGTGGCTCGC 
CAGGTGGCTG 



11 

V- 

GATCTGGAGA 
ATTGG CTGAA" 
CCCAACAAGC 
GGACCCGGTT. 
CCCAGTATGG 
ACTACAGTCA 
CCCAGAATGA 
AAGAAGTCAT 
CAGACTTCGG: 
TCCCTCGGGT 
CTCAGAGGCC 
TGTCTGGGAT 
TGAGAGTCAA 
CCTACTATGC 
ATCGCTGCGC 
ATGTCTGTGT 
ACAACAACCG 
ACTGCAATGG 
GGGCATATGG 
GGTGTCAGCT 
TCTCCTGCGA 
GGCAGTGTGT 
TCACTGGACT 
GGTCCCGGAG 
TGGTGGGTCC 
GCTGTGAACC 
ACAGGGCAGT 
GCCAGTGTCC 
ATTTCCGGGG 
CTGGCTTGAC 
TGTGCGTGGC 
TGGGCTTTGG 
ACCGTGGCCT 
TTCTCAGCAG 
CGCTCAGGCG 
CCCTTCCGAG 
AGAGGAAGAG 
TGCTGAGCAC 
GCCTTTTGGA 
GAGGAGGAGG 
TGCCTGACCT 
CCCCAATATC 
GCTGCAGGGG 
AGCAGCTGCG 



21 • 
I 

AAGAACGGCA: 
GATGAGACCA 
CTGCTCCCGT 
TCTCCGAGCT- 
CGAGTGGCAG: 
CCGAGTAGAG 
TGTGAACCCT 
GATGGAGTTC. 
TAAGACCTGG 
CCGCCAGGGT 
TAATGCACGC 
TCCAGCAACT 
TTTCACCAGG 
TGTGTCCCAG 
ACCCAAGCCT 
CTGCCAGCAC 
GCCCTGGAGA 
GCACTCAGAG 

aggtgtgtgt 
gcactatttc 
gtgtgatccg 
gtgcaaggag 
cacctacgcc 
ggacatgccg 
caaatgtgac 
gtgtgcctgc 
gccctgtcgg 
agaccggacc 
aacagagggc. 
cgggccccgc 
ctgccaccct4 
tagactccgc 
ggcctcccgg: 
ccccgcagtc-. 
aactctccag. 
agacctggag 
ggagcagttt 
agcctacgag 
ccagctcagg. 
aggcaccggc. 
gacacccacc 
atgccctggt 
tgtccttccc 
gggcttcaat 



31 
I 

GAACACACAG 
TTCTTCCTCT 

ggggcctgct 
tcatctacct 
atgaaatgct 

AATGTGGCTT 
GTCTCTCTGC 
CAGGGGCCCA 
CGAGTGTACC. 
CGGCCTCAGA 
CTAAATGGGG 
CAAAGTCAAA 
CTGGCCCCTG;. 
CTCCGTCTGC 
GGGGCCTCTG 
AACACTGCCG 
CCGGCGGAGG 
ACATGTCACT- 
GACAATTGCC 
CGGAACCGGC 
GATGGGGCAG 
CATGTGCAGG 
AACCCGCAGG 
TGTGACGAGG 
CAGTGTGCTC 
GACCCGCACA 
GAAGGCTTTG 
TATGGAGACG 
CCGGGCTGCG 
TGTGACCAGT 
TGCTTCCAGA 
AATGCCACCG 
ATCCTAGATG 
ACAGAGCAGG 
GGCCTGCAGC 
AGTCTTGACA- 
GAAAAAATAA' 
CAGTCAGCCC 
GACAGCCGGA 
AGCCCCAAGC 
TTCAACAAGC 
GAGCTATGTC 
AGGGCCGGTG 
GCCCAGCTCC 



41 
I 

CAAGGAAAGG 
TGTGTTTTGC 
ATCCACCTGT 
GTGGACTGAC. 
GCAAGTGTGA 

catcctccgg; 
agctggacct 

TGCCCGCCGG 
AGTACCTGGC 
GCTGGCAGGA- 
GGAAGGTCCA 
AAATTCAAGA 
. TGCCCCAAAG 
AGGGGAGCTG 
CAGGCCCCTC 
GCCCAAATTG 
GCCAGGACGC 
TTGACCCCGC 
GGGACCACAC 
GCCCGGGAGC 
TGCCAGGGGC 
GAGAGCGCTG 
GCTGCCACCG 
AGAGTGGGCG 
CCTACCACTG 
ACTCCCCTCA 
GTGGCCTGAT 
TGGCCACAGG 
ACAAGGCATC 
GCCAGCGAGG 
CCTATGATGC 
CCAGCCTGTG 
CAAAGAGTAA 
AGGTGGCTCA 
TGGATCTGCC 
GAAGCTTCAA 
GCAGTGCTGA 
AGGCTGCTCA 
GAGAGGCAGA 
TTGTGGCCCT 
TCTGTGGCAA 
CCCAAGACAA 
GGGCCTTCTT 
AGOGGACCAG 



2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 



51 
I 

VPENQNLISN 
VIVTDWYGAH 
WGVFDEYNND • 
NSTQNATASI 
MNGTELPPPP 
IASFDSKGEI- 
YGSVMILVTS 
SNSNSMIDAF 
QASGPPEIIL 
VTVTSRASNS 
PVTLRIiLDDG 
YVPGYTANGN 
DIiEAVKVEEE 
AGIREIFTFS 
NSDPVPARDY 



51 
I 

TCCTTTCTGG 

CCTGCCTGGC 

TGGGGACCTG 

CAAGCCTGAG " 

CTCCAGGCAG 

CCCCATGCGC 

GGACAGGAGA 

CATGCTGATT 

TGCCGACTGC ' 

TGTTCGGTGC 

ACTTAACCTT 

GGTGGGGGAG 

GGGCTACCAC 

CTTCTGTCAC 

CACCGCTGTG 

TGAGCGCTGT 

CCATGAATGC 

TGTGTTTGCC 

CGAAGGCAAG 

TTCCATTCAG 

TCCCTGTGAC 

TGACCTATGC 

CTGTGACTGC 

CTGCCTTTGT 

GAAGCTGGCC 

GCCCACAGTG 

GTGCAGCGCT 

ATGCCGAGCC 

AGGCCGCTGC 

CTACTGCAAT ' 

GGACGTCCGG 

GTCAGGGCCT 

GATTGAGCAG 

GGTGGCCAGT 

CCTGGAGGAG 

TGGTCTCCTT 

TCCTTCAGGA - 

GCAGGTCTCC 

GAGGCTGGTG 

GAGGCTGGAG 

CTCCAGGCAG 

TGGCACAGCC 

GATGGCGGGG 

GCAGATGATT 



60 
i20 
180 
240 
300 
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480 
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WO 02/086443 

AGGGCAGCCG AGGAATCTGC CTCACAGATT CAATCCAGTG 
GTGAGCGCCA GCCGCTCCCA GATGGAGGAA GATGTCAGAC 
CAGGTCCGGG ACTTCCTAAC AGACCCCGAC ACTGATGCAG 
GAGGCCGTGC TGGCCCTGTG GCTGCCCACA GACTCAGCTA 
GAGATCCAGG CCATTGCAGC CAGGCTCCCC AACGTGGACT 
CAGGACATTG CGCGTGCCCG CCGGTTGCAG GCTGAGGCTG 
• CATGCAGTGG AGGGCCAGGT GGAAGATGTG GTTGGGAACC 
CTGCAGGAAG CTCAGGACAC CATGCAAGGC ACCAGCCGCT 
AGGGTTGCTG AGGTTCAGCA GGTACTGCGG CCAGCAGAAA 
AAGCAGCTGG. GTGACTTCTG GACACGGATG GAGGAGCTCC 
GGGGCAGAGG CAGTCCAGGC CCAGCAGCTT GCGGAAGGTG. 
GCCCAAGAGG GATTTGAGAG AATAAAACAA AAGTATGCTG 
CAGAGTTCCA TGCTGGGTGA GCAGGGTGCC CGGATCCAGA 
GAGCTGTTTG GGGAGACCAT GGAGATGATG GACAGGATGA 
CTGOGGGGCA GCCAGGCCAT CATGCTGCGC TCGGCGGACC 
GTGGAGCAGA~ TCCGTGACCA CATCAATGGG CGCGTGCTCT 
TGCTACAGCT TCCAGCCCGT TGCCCCACTC . ATC TGCC GCC 
GATTGGGTTG GAATGCTTTC CATCTCCAGG AGACTTTCAT. 
GACCACCCCT GGTGTGTAGC TAGTAAGATT ACCCTGAGCT 
GGGACAGTTA CACTTGACAG ACAAAGATGG TGGAGATTGG 
CTCTCAAGTC AAGGAAGCTG GGCTGGGCAG TATCCCCCGC 
GGAATCCTGG ACCAAGCACA AAAACTTAAC AAAAGTGATG 
AAAATCTTTG G 

Seq ID NO: 521 Protein sequence 
Protein Accession #: NP_000219.1 



PCT/US02/12476 



i 
i 

MRPFFLLCFA 
EWQMKCCKCD. 
MEFQGPMPAG 
NARLNGGKVQ 
VSQLRLQGSC 
PWRPAEGGW 
HYFRNRRPGA- 
TYANPQGCHR 
CACDPHNSPQ 
TEGPGCDKAS 
RLRNATASLW; 
TLQGLQLDLP 
AYEQSAQAAQ. 
TPTFNKLCGN 
GFNAQLQRTR 
DPDTDAATIQ. 
RLQAEAEEAR 
VLRPAEKLVT 
IKQKYAELKD 
MLRSADLTGL 



11 

I ' 

LPGLLHAQQA 
SRQFHNYYSH 
MLIERSSDFG 
LNLMDLVSGI 
FCHGHADRCA 
HECQRCDCNG 
SIQETCISCE 
CDCNILGSRR 
PTVQPVHRAV 
GRCLCRPGLT- 
SGPGLEDRGL 
LEEETLSLPR 
QVSDSSRLLD 
SRQMACTPIS 
QMIRAAEESA 
EVSEAVLALW 
SRAHAVEGQV 
SMTKQLGDFW- 
RLGQSSMLGE 
EKRVEQIRDH : 



21 
I 

CSRGACYPPV 
RVENVASSSG. 
KTWRVYQYLA 
PATQSQKIQE 
PKPGASAGPS 
HSETCHFDPA 
CDPDGAVPGA 
DMPCDEESGR 
PCREGFGGLM 
GPRCDQCQRG 
ASRILDAKSK 
DLESLDRSFN 
QLRDSRREAE' 
CPGELCPQDN 
SQIQSSAQRL 
LPTDSATVLQ 
EDWGNLRQG 



QGARIQSVKT 
INGRVLYYAT 



31 
I 

GDLIiVGRTRF 
PMRWWQSQND. 
ADCTSTFPRV- 
VGEITNLRVN- 
TAVQVHDVCV' 
VFAASQGAYG. 
PCDPVTGQCV 
CLCLPNWGP 
CSAAAIRQCP 
YCNRYPVCVA 
IEQIRAVLSS 
GLLTMYQRKR- 
RLVRQAGGGG 
GTACGSRCRG 
ETQVSASRSQ 
KMNEIQAIAA- 
TVALQEAQDT 
RQQGAEAVQA 
EAEELFGETM 
CK 



Seq ID NO: 522 DNA sequence 
Nucleic Acid Accession #: NM_001944. 
Coding sequence: 84..30B3 



TTTTCTTAGA 
TTTCACCAGG 
CCATCTTCGT 
ATGATGAAGA 
AATTTGCCAA 
TTACTTCAGA 
ATCAGCCGCC 
CTATAGTCGA 
AAGGACTAGA 
ATCCTCCAGT 
ACTCACTGGT 
AAATTGCCTT 
GAAACACTGG 
ATCGTCTGGT. 
GTAATATTAA 
CAGCACGTAT 
TGGATGAAGA 
GAAATTGGTT 
AGGCTCTAGA 
CTGAATTTCA 
AGGTAATAAA 
AAAAAGGCAT, 
ATGAGGACAC 
GATACCTAAT. 
ATTCTACTTT 
CGGGTAAAAC 
CAACAGCTGT 
CTAGAACACT 
TAAAGTTGCC 
CCCAGGAACA 
ACAATCGGTG 



11 
I 

CATTAACTGC 
GAAATCAGAG 
GGTGGTCATA 
AGAGATGACT- 
ACCCTGCAGA 
TTACCAAGCA 
TTTTGGAATC 
CCGGGAGGAA 
TGTAGAGAAA 
ATTTTCACAA' 
GATGATACTA 
CAAAATTGTC 
GGAAGTCCGT' 
TGTGAGTGGTr 
AGTGAAAGAT 
TGAAGAAAAT 
GTACACAGAT 
TGAAATACAA 
TTATGAACAA 
CCAATCAGTT 
TGT AAGAGAA . 
AAGTAGCAAA 
TAACAAAGCT 
GATTGATTCA 
CATAGTTAAC 
TTCTACAGGC 
CCTCGAAAAA 
GAATAATAGA 
TGCCGTATGG 
GATACCTCCT 
TGAGATGCCA 



21 
I 

AGACGGCTGG 
ACAATGATGG 
TTGGTTCATG 
ATGCAACAAG 
GAAGGAGAAG 
ACCCAGAAAA 
TTTGTTGTTG 
ACTCCAAGCT- 
CCACTTATAC 
CAAATTTTCA 
AATGCCACAG 
TCTCAGGAAC 
ACTTTGACCA 
GCAGACAAAG 
GTCAACGATA 
ATTTTAAGTT 
AATTGGCTTG 
ACTGATCCTA 
CTACAAAGCG 
ATCTCTCGAT 
GGAATTGCAT 
AAATTGGTGG 
GCCTCAAATG 
AAAACTGCTG 
AAAACAATCA 
ACGGTATATG 
GATGCAGTTT 
TACACTGGCC 
AGTATCACAA 
GGAGTATACC 
CGCAGCTTGA 



31 
I 

CAGGATAGAA 
GGCTCTTCCC 
GAGAATTGCG 
CTAAAAGAAG 
ATAACTCAAA 
TCACCTACCG 
ACAAAAACAC 
TCCTGATCAC 
TAACGGTTAA 
TGGGTGAAAT 
ATGCAGATGA 
CAGCAGGCAC 
ATTCTCTTGA 
ATGGAGAAGG 
ACTTCCCAAT 
CTGAATTACT 
CAGTATATTT 
GAACTAATGA 
TGAAACTTAG 
ACCGAGTTCA 
TCCGTCCTGC 
ATTATATCCT 
TCAAATATGT 
AAATCAAATT 
CAGCTGAGGT 
TTAGAGTACC 
GCAGTTCTTC 
CCTATACATT 
CCCTCAATGC 
ACATCTCCCT 
CACTGGAAGT 



CCCAGCGCTT 
GCACACGGCT 
CCACTATCCA 
CTGTTCTGCA 
TGGTGCTGTC 
AGGAAGCCAG 
TGCGGCAGGG 
CCCTTCGGCT 
AGCTGGTGAC 
GCCACCAAGC 
CCAGCGAGCA 
AGTTGAAGGA 
GTGTGAAGAC 
AAGACATGGA 
TGACAGGACT 
ACTATGCCAC 
TTTGCTTTTG 
GCAGCCTAAA 
GCAGCTGAGC 
CATGCCATTG 
CTTTAGTTCT 
TAAAAATGAA 



41 
I 

LRASSTCGLT 
VNPVSLQLDL 
RQGRPQSWQD 
FTRLAPVPQR 
CQHNTAGPNC 
GVCDNCRDHT 
CKEHVQGERC 
KCDQCAPYHW 
DRTYGDVATG 
CHPCFQTYDA 
PAVTEQEVAQ 
EQFEKISSAD 
GTGSPKLVAL 
VLPRAGGAFL 
MEEDVRRTRL 
RLPNVDLVLS 
MQGTSRSLRL 
QQLAEGASEQ 
EMMDRMKDME 



41 
I 

GCAGCGGCTC 
CAGAACTACA 
AATAGAGACT 
GCAAAAACGT 
AAGAAACCCA 
AATCTCTGGA 
TGGAGATATT 
ATGTCGGGCT 
AATTTTGGAT 
TGAAGAAAAT 
ACCAAACCAC 
ACCCATGTTC 
CCGAGAGCAA 
ACTATCAACT 
GTTTAGAGAC 
TCGATTTCAA 
CTTTACCTCT 
AGGCATCCTG 
TATTGCTGTC 
GTCAACCCCA 
TTCCAAGACA 
GGG AACAT AT ' 
CATGGGACGT 
TGTCAAAAAT 
TCTGGCCATA 
CGATTTCAAT 
ACCTTCCGTG 
TGCACTGGAA 
TACCTCGGCC 
GGTACTTACA 
CTGTCAGTGT 



GGAGACCCAG 
CCTAATCCAG 
GGAGGTCAGC 
GAAGATGAAT 
CCAGACCAAG 
GAGCCGAGCC 
GACAGTGGCA 
TATCCAGGAC 
AAGCATGACC 
CCGGCAGCAG 
GGCATTGAGT 
CCGGTTGGGT 
AGAGGCAGAG 
GTTGGAGCTG; 
GGAGAAGCGT 
CTGCAAGTGA 
GTTGGGGGCA- 
GTACAGCCTG 
CTGAGCCAAT 
AAACTAAGAG 
CCACTGGGGA 
AAGCCAAATA 



51 

I 

KPETYCTQYG 
DRRFQLQEVM 
VRCQSLPQRP 
GYHPPSAYYA 
ERCAPFYNNR 
EGKNCERCQL 
DLCKPGFTGL 
KLASGQGCEP 
CRACDCDFRG; 
DLREQALRFG 
VASAILSLRR 
PSGAFRMLST 
RLEMSSLPDL 
MAGQVAEQLR 
LIQQVRDFLT 
QTKQDIARAR. 
IQDRVAEVQQ 
ALSAQEGFER 
LELLRGSQAI 
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ACTTGGACTT 

GGGGCTCTGG 

AAAGGTCAAT 

GAATGGGTGA 

ATTGCCAAGA 

GTGGGAATCG 

AACATAACAG- 

CTAAATGCCC 

ATTAATGATA 

AGTGCCTCAA 

TTGAATTCTA 

CTCCTAAGCA 

GCTAGCAGCT 

CAATGTGAAT 

TCTCAGTATT 

GTAACAGATT 

GGGAATGAAG 

AAAGTGGTGA 

AAAAACAAAG 

GTCACAATTC 

TTTACTGTGC 

CAAGCCATCG 

AACGATGGTG 

ATGAACCGAG 

GATGAATACA 

GACAATTGTC . 

GTTGTCTCCG 

GATCAACCTG 

CTCCTCAGAG 

GACAGTCAGA 

GACAACAGGG 



2700 
2760 
2820 
2880 
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3000 
3060 
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3180 
3240 
3300 
3360 
3420 
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3540 
3600 
3660 
3720 
3780 
3840 
3900. 
3960 
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660 
720 
780 
840 
900 
960 
1020 
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60 
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240 
300 
360 
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600 
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1200 
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1800 
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GCATCTGTGG AACTTCTTAC CCAACCACAA GCCCTGGGAC CAGGTATGGC AGGCCGCACT 1920 • 

CAGGGAGGCT GGGGCCTGCC GCCATCGGCC TGCTGCTCCT TGGTCTCCTG CTGCTGCTGT 1980 

TGGCCCCCCT TCTGCTGTTG ACCTGTGACT GTGGGGCAGG TTCTACTGGG GGAGTGACAG 2040 

GTGGTTTTAT CCCAGTTCCT GATGGCTCAQ AAGGAACAAT TCATCAGTGG GGAATTGAAG 2100 

GAGCCCATCC TGAAGACAAG GAAATCACAA ATATTTGTGT GCCTCCTGTA ACAGCCAATG 2160 

GAjGCCGATTT CATGGAAAGT TCTGAAGTTT GTACAAATAC GTATGCCAGA GGCACAGCGG 2220 

TGGAAGGCAC TTCAGGAATG GAAATGACCA CTAAGCTTGG AGCAGCCACT GAATCTGGAG. 2280 

GTGCTGCAGG CTTTGCAACA GGGACAGTGT. CAGGAGCTGC TTCAGGATTC ^CAGCCA 2340 

CTGGAGTTGG CATCTGTTCC TCAGGGCAGT CTGGAACCAT GAGAACAAGG CATTCCACTG , 2400 

GAGGAACCAA TAAGGACTAC GCTGATGGGG CGATAAGCAT GAATTTTCTG GACTCCTACT 2460 

TTTCTCAGAA AGCATTTGCC TGTGCGGAGG AAGACGATGG' CCAGGAAGCA- AATGACTGCT 2520 

ESSJ- ££ZZ* GGCGCAGATG CCACTGGTTC TCCTGTGGGC TCCGTGGGTT 2580 

GTTGCAGTTT TATTGCTGAT GACCTGGATG ACAGCTTCTT GGACTCACTT GGACCCAAAT 2640 

TTAAAAAACT TGCAGAGATA" AGCCTTGGTG TTGATGGTGA AGGCAAAGAA- GTTCAGCCAC ■ 2700 

CCTCTAAAGA CAGCGGTTAT GGGATTGAAT CCTGTGGCCA TCCCATAGAA GTCCAGCAGA- 2760 

CAGGATTTGT TAAGTGCCAG . ACTTTGTCAG-. GAAGTCAAGG AGCTTCTGCT TTGTCCGCCTr 2820 

CTGGGTCTGT CCAGCCAGCT GTTTCCATCC CTGACCCTCT- GCAGCATGGT AACTATTTAG. 2880 

TAACGGAGAC TTACTCGGCT TCTCGTTCCC TCGTGCAACC. TTCCACTGCA GGCTTTGATC 2940 

CACTTCTCAC ACAAAATGTG ATAGTGACAG AAAGGGTGAT CTGTCCCATT TCCAGTGTTC 3000 

CTGGCAACCT AGCTGGCCCA ACGCAGCTAC GAGGGTCACA TACTATGCTC ^TACAGAGG.. 3060 

ATCCTTGCTC CCGTGTAATA- TGACCAGAAT GAGCTGGAAT ACCACACTGA CCAAATCTGG 3120 

ATCTTTGGAC TAAAGTATTC AAAATAGCAT AGCAAAGCTC ACTGTATTGG ™TAATT 3180 

TGGCACTTAT TAGCTTCTCT CATAAACTGA TCACGATTAT- AAATTAAATG TTTGGGTTCA 3240 

TACCCCAAAA GCAATATGTT GTCACTCCTA ATTCTCAAGT ACTATTCAAA TTGTAGTAAA 3300 
TCTTAAAGTT TTTCAAAACC ' CTAAAATCAT ATTCGC 

Seq ID NO: 523 Protein sequence 
Protein Accession 8: NP_0Q193S.l 

1 . 11 21 31 41 51 

Lglfprttg alaifwvil VHGBLRIBTK gqydeeemtm qqakrrqkrb WVKFAKPCRB 60 

G^MSKRNPI- AKITSDYQAT QKITYRISGV GIDQPPFGIF VVDKNTGDIN l™™*** »» 

PSFMTCRAIf NAQGLDVEKP LILTVKILDI- NDNPPVFSQQ IFMGEIEENS ASNSLVMILN: 180 

ATDADEPNHL NSKIAFKIVS QEPAGTPMFL LSRNTGEVRT LTNSLDREQA SSYRLWSGA 240 

DKDGEGLSTQ CECNIKVKDV, NDNFPMFRDS QYSARIEENI LSSELLRFQV TDLDEEYTDN- 300 

WLAVYFFTSG NEGNWFEIQT- DPRTNEGILK WKALDYEQL QSVKLSIAVK NKAEFHQSVI 360 

SRYRVQSTPV- TIQVINVREG IAFRPASKTF- TVQKGI SSKK LVDYILGTYQ AIDEDTNKAA 420 

SNVKYVMGRN- DGGYLMIDSK- TAEIKFVXNM NRDSTFIVNK TITAEVLAID, EYTGKTSTGT 430 

^RVPDFND NCPTAVLEKD; AVCSSSPSW VSARTLNNRY TCPYTFALED QPVKLPAVWS 540 

ITTLNATSAL -LRAQEQIPPG VYHISLVLTD SQNNRCEMPR SLTLEVCQCD NRGICGTSYP 600 

TTSPGTRYGR PHSGRLGPAA IGLLLLGLLL LLLAPLLLLT CDCGAGSTGG VTGGFIPVPD 660 

GSEGTIHQWG IEGAHPEDKE- ITOICVPPVT ANGADFMESS EVCTNTYARG. TAVEGTSGME • 720 

MTTKLGAATE SGGAAGFATG TVSGAASGFG AATGVGICSS GQSGTMRTRH STGGTNKDYA 780 

DGAISMNFLD SYFSQKAFAC AEEDDGQEAN. DCLLIYDNEG ADATGSPVGS VGCCSFIADD 840 

LDDSFLiDSLG PKFKKLAEIS LGVDGEGKEV- QPPSKDSGYG IESCGHPIEV QQTGFVKCQT 900 

LSGSQGASAL SASGSVQPAV SIPDPLQHGN YLVTETYSAS GSLVQPSTAG FDPLLTQNVI 960 
VTERVICPIS SVPGNLAGPT QLRGSHTMLC TEDPCSRLI 

Seq ID NO: 524 DNA sequence 

Nucleic Acid Accession #: XM_058069.2 

Coding sequence : 1 . . 1413 

1 * 11 21 31 41 51 

ATGAAGTTTC TTCTAATACT GCTCCTGCAG GCCACTGCTT CTGGAGCTCT TCCCCTGAAC 60 

AGCTCTACAA GCCTGGAAAA AAATAATGTG CTATTTGGTG AAAGATACTT AG^AATTT 120 

TATGGCCTTG AGATAAACAA ACTTCCAGTG ACAAAAATGA AATATAGTGG AAACTTAATG 180 

AAGGAAAAAA TCCAAGAAAT GCAGCACTTC r TTGGGTCTGA AAGTGACCGG GCAACTGGAC 240 

ACATCTACCC TGGAGATGAT GCACGCACCT CGATGTGGAG TCCCCGATGT CCATCATTTC 300 

AGGGAAATGC CAGGGGGGCC CGTATGGAGG AAACATTATA TCACCTACAG AATCAATAAT 360 

TACACACCTG ACATGAACCG TGAGGATGTT GACTACGCAA TCCGGAAAGC TTTCCAAGTA 420 

TGGAGTAATG TTACCCCCIT GAAATTCAGC AAGATTAACA CAGGCATGGC TGACATTTTG 4 80 

GTGGTTTTTG CCCGTGGAGC TCATGGAGAC TTCCATGCTT TTGATGGCAA AGGTGGAATC 540 

CTAGCCCATG CTTTTGGACC TGGATCTGGC ATTGGAGGGG ATGCACATTT CGATGAGGAC 600 

GAATTCTGGA CTACACATTC AGGAGGCACA AACTTGTTCC TCACTGCTGT TCACGAGATT 660 

GGCCATTCCT. TAGGTCTTGG CCATTCTAGT GATCCAAAGG CCGTAATGTT CCCCACCTAC 720 

AAATATGTTG ACATCAACAC ATTTCGCCTC TCTGCTGATG ACATACGTGG CATTCAGTCC 780 

CTGTATGGAG ACCCAAAAGA GAACCAACGC TTGCCAAATC CTGACAATTC AG^CAGCT 840 

CTCTGTGACC CCAATTTGAG TTTTGATGCT GTCACTACCG TGGGAAATAA GATCTTTTTC 900 

TTCAAAGACA GGTTCTTCTG GCTGAAGGTT TCTGAGAGAC CAAAGACCAG. TGTTAATTTA 960 

ATTTCTTCCT TATGGCCAAC CTTGCCATCT GGCATTGAAG CTGCTTATGA AATTGAAG CC. 1020 

AGAAATCAAG TTTTTCTTTT TAAAGATGAC AAATACTGGT TAATTAGCAA TTTAAGACCA 1080 

GAGCCAAATT ATCCCAAGAG. CATACATTCT TTTGGTTTTC CTAACTTTGT. GAAAAAAATT 1140 

GATGCAGCTG TTTTTAACCC ACGTTTTTAT AGGACCTACT TCTTTGTAGA TAACCAGTAT 1200 

TGGAGGTATG ATGAAAGGAG ACAGATGATG GACCCTGGTT ATCCCAAACT GATTACCAAG 1260 

AACTTCCAAG GAATCGGGCC TAAAATTGAT GCAGTCTTCT ACTCTAAAAA CAAATACTAC 1320 

TA^TCTTCC AAGGATCTAA CCAATTTGAA TATGACTTCC TACTCCAACG TATCACCAAA 13 80 
ACACTGAAAA GCAATAGCTG GTTTGGTTGT TGA 

Seq ID NO :-525 Protein sequence 
Protein Accession S: P39900 

1 11 . -21. " ■ |1 I 1 « 

Lfulilluq Itasgauln sstsi^eknnv LFGERYLEKF YGLEINKLPV TKMKYSGNLM 60 

S^Sqhp lglkvtgqld tstlemmhap rcgvpdvhhf rempggpvwr KHYITYRINN 120 
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YTPDMNREDV DYAIRKAFQV WSNVTPLKFS KINTGMADIL 
LAHAFGPGSG IGGDAHFDED EFWTTHSGGT NLFLTAVHEI- 
• KYVDINTFRL. SADDIRGIQS LYGDPKENQR LPNPDNSEPA 
FKDRFFWLKV SERPKTSVNL ISSLWPTLPS GIEAAYEIEA 
EPNYPKSIHS PGFPNFVKKI DAAVFNPRFY. RTYFFVDNQY 
NFQGIGPKID AVFYSKNKYY YFFQGSNQFB YDFLLQRITK 

Seq ID NOr 525 UNA sequence 

Nucleic Acid Accession fl: NMJ)24423.1 

Coding sequence* 64.. 2590 



PCT/US02/12476 



i 
I 

GGCAGGTCTC 
CCGATGGCCG 
CTGACCCTCG 
CCTTCTAAAC 
TCTGCAGACC 
TACACAGCCA 
GACAAAAGGA 
TCGAAGACAA- 
ATTCCTTGCT 
GAATCTGATG 
AAAGAACCTT 
CCTGTGGATC 
GGATATTCAG- 
. CACCCTGTTT 
ACTAGAGTGG: 
CTGAAATACA- 
AGCACAGGCG 
TCATTGATAA 
ACTTGTATCA 
TATGAAGCAT 
GATAAGGATT 
GAAAATGGAC 
GTAAAGCCAC 
GAAGCGCCAT. 
GTTCATGTGA 
ATTAAAGAAA 
AATAGAAATG 
ATTGATGAAA' 
CCCAAAAATG 
ACTGGAACAC 
GAATATGTAG 
GATGAACCTG 
AGTAGACTGT 
AATGCTGGAT 
GCAACAAAAT 
ACTTCAAGGA 
ATAGCACTGC 
GGGAAACGTT 
CCTGGAGACG 
AGCCAAGGTT 
GAAATGATGA 
ACCCTGGACT 
GAGTGGCACA 
TAAAAATTAA 
CCAAGATTAT 
CTGCTGCAGT 
ATTTATTACA 
TTTGTCAGAC 
TGTATATGAT 
AGCCAGTTGT 
TCTCAAACTC 
GATATTTTAG 
ATATCACATT 
AGTATCACTA 
ATGTTGCAGC 
TTGGAGGCAA 
GGAAATAAAT 
AACAAAGAGG 
GAAAAAAGAG 
GAGGAAATAG 
TTCTGGTTTC 
CTGTTTCAAG 
GGGGTTCCCT 
ATAACAAAAA 
TTCTCTCTTA 
AGTTTAGAGG 
ATTGTCCTTA 
TTTCATTTTT 
AGGCCTTGTG 
CCTTAAGTGA 
CTTTCTCCAG 
AAAGATCAAG 
TATTTGTACA 
AAGGAATATG 



11 
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GCTCTCGGCA 
CCGCTGGGCC 
TGATCTTCAG 
TAGAGGCAGA- 
TCATCCGGTC 
GGGCTGTTGC 
AACAGACACA 
GACACACTAG. 
CTATGCAAGA 
CAGCACAGAA 
TAAATTTGTT 
GTGAAGAATA 
CAGATCTGCC 
TCACAGAAGC 
GGGTGGTTTG 
GCATTTTGCA 
TAATCACCAC 
TGAAAGTACA- 
TAACAGTAAC 
TTGTAGAGGA- 
TAATTAACAC 
ATTTCAAAAT 
TGAATTATGA 1 
TTGCTAGAGA 
GGGATCTGGA- 
ACTTAGCAGT- 
GCAATGGTTT 
TTTCAGGGTC 
AGTTGTATAA- 
TTGCTGTGAA 
TCATTTGCAA 
TCCATGGAGC 
GGAGCCTCAC 
TTCAAGAATA 
TATTGAGAGT 
GTACAGGAGT 
TCTTTTCTGT 
TTCCTGAAGA- 
ATAGAGTGTG. 
TTTGTGGTAC 
AAGGAGGAAA 
CCTGCAGGGG 
GTTTTACTCA" 
ACATAAAAGA 
GTCCTCACTT^ 
GAAAAGCAGG 
-TTAGCAGAAG 
ATTCTGGAGG 
GATTTTTTTC 
TGCTTATCTT 
CAGCACTGGA 
TAATAAATAT 
ATTATGTATT 
TGTGAAGAAA 
TCATAAAGAA- 
AATGTGTTGA 
GTGTGTGTGT 
AAAATGGTAA 
AGAGCTTCCT* 
TTCCTGTCCA- 
TGTGGGAAGG 
ATTTCTGCAT 
GCTT7TTGGT 
CATTTTAAAA 
TAGTGACCAA 
CTAGAGGGAG 
AACCTAAGCC 
CTCCTCACTG 
GGCCCCCTTC 
CTCCAGGTTT 
AGAAATTTTA 
TTGTCATTTT 
GTCAGAGGGC 
GGTGGGAGTA 



21 * . " 
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CCCTCCCGGC 
CCGGCGCTCC 
TCGTGATGGT 
CAAAATAATT 
AAGTGATCCT 
GCTGTCTGAT 
GAAAGAGGTT 
AGAAACTGTT 
GAATTCCTTG 
CTATACTGTC 
TTATATAGAA 
TGATGTTTTT 
CCTCCCACTA 
AATTTATAAT 
TGCCACAGAC 
GCAGACACCA 
AGTCTCTCAT 
AGACATGGATv 
AGATTCAAAT 
AAATGCATTC 
TGCCAATTGG 
CAGCACAGAC 
AGAAAACCGT 
TATTCCCAGA- 
TGAGGGGCCT- 
GGGGTCAAAG 
AAGGTACAAA 
AATCAT AACT 
TATTACAGTC 
CATTGAAGAT 
ACCA AAAA TG 
TCCATTTTAT- 
CAAAGTTAAT 
TACCATTCCT 
TAATCTGTGT 
AATACTTGGA 
ATTGCTAACT 
TTTAGCACAG 
CTCTGCCAAT 
TATGGGATCA 
CCAGACCTTG 
AGGACACACG 
ACCCCGTCTC 
AATTGCATCG 
ATAACTATGA 
AAGAAGATGG 
CATGCACAAA 
TTTCCAAAAA 
TCAATTTTGA 
TTCCAAAAAG 
ATTAAGGTCT 
GCTGGATAAA 
CACTTTAAGT 
GTTTTGGAAA 
TTGGGACTCA 
AGTGCCCTAT 
ATATTATTAT 
AAACTTGAAA 
AGGCCTGGGC 
ATTTGTGTAA 
AAATAGGGAA 
CCACAAGTTA 
AGCAAGGGTC 
CTTACCTTTA 
CATCTTTTTA 
CTGAGGGGAG 
CCACAAACTT 
CCCTTCTTCT 
TTTCGGCTTT 
TCCACCATCC 
AAATAATAGA 
AGAACAGAGG 
'AACAGGAAGA 
AAAGCAACAT 



31 
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GCCCGCGTTC 
GTGCGCGGAG 
GAAGCCTGCA 
GGCAGAGTTA- 
GATTTCAGAG. 
AAGAAAAGAT- 
ACTGTGCTGC 
CTCAGGCGTG 
GGCCCTTTCC 
TTCTACTCAA- 
AGAGACACTG 
GATTTGATTG. 
CCCATCAGGG, 
TTTGAAGTTT 
AGAGATGAAC 
AGGTCACCTG 
TATTTGGACA 
GGCCAGTTTT 
GATAATGCAC 
AATGTGGAAA 
AGAGTCAATT 
AAAGAAACTA 
CAAGTGAACC 
GTGACAGCCT 
GAATGCACTC 
ATCAACGGCT 
AAATTGCATG 
TCCAAAATCC 
CTGGCAATAG 
GTAAATGATA 
GGGTATACCG 
TTCAGTTTGC 
GATACAGCTG 
ATTACTGTAA 
GAATGTACTC 
AAATGGGCAA 
TTAGTATGTG 
CAAAACTTAA 
GGATTTATGA 
GGAATGAAAA 
GAATCCTGCC 
GAGGTGGACA 
GGTGAAGAAT 
ATGTAATCAG 
GGGAAGAGGA 
CCTTGACTTT 
GAGATAATGT 
TAATATTGTA 
ATTATGCTAC 
TGAAAAATGT 
CTAAAGCATC 
TATTAGTCCA 
GATAGTTTAA 
AGAAACAATG 
CCCCTACTGC 
GAAGTAGCAA 
TAATCAATGC 
TGAGGCTGGG 
TCTTAAATGC 
TTTGTTTAAA 
TCCAATGGAA 
GTAGCAAACT 
CAGAGATGAG 
CTGAAGTTAA 
ATTTAGATCC 
GATCTTACTG 
GACACCTGAT 
GAGTGGCATT 
CTGCTAAAGC 
TTCAGCGTGA 
AGAAATAGAA 
GAACTTTGGG 
TGCAGGCCTT 
CGTCTGCTTC 



WFARGAHGD 
GKSLGLGHSS 
LCDPNLSFDA 
RKQVFLFKDD 
WRYDERRQMM 
TLlKSNSWFGC 
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TCCTGGCCCT 
CCGTCTGCCT 
AAAAGGTGAT 
ATTTGGAAGA 
TTCTAAATGA- 
CATTTACCAT 
TAGAACATCA 
CCAAGAGGAG 
CATTGTTTCT 
TAAGTGGACG 
GAAATCTATT 
CTTATGCGTC 
TAGAGGATGA 
TGGAAAGTAG 
CGGACACAAT 
GGCTCTTTTC 
GAGAGGTTGT 
TTGGATTGAT 
CCACTTTCAG. 
TCTTACGAAT 
TTACCATTTT 
ATGAAGGTGT 
TGGAAATTGG 
TGAACAGAGC 
CTGCAGCCCA 
ATAAGGCATA 
ATCCTAAAGG 
TGGATAGGGA 
ACAAAGATGA 
ATCCACCAGA 
ACATTTTAGC 
CCAATACTTC 
CCCGTCTTTC 
AAGACAGGGC 
ATCCAACTCA 
TCCTTGCAAT 
GAGTTTTTGG 
TTATATCAAA 
CCCAAACTAC 
ATGGAGGGCA 
GGGGGGCTGG 
ACTGCAGATA 
CCATTAGAGG 
AATGAAGACC 
TCTCCAGCTG 
TTAAATAATT 
CACAGTGCTA 
AAGTTCAATT 
TCACCAATTT 
TAAAACAGAC 
TGCTCTTTTT 
ACAATAGCTA 
AAAATAAACA 
AAGACTGAAT 
ACTACCAAAT 
TTTTCTATAG 
AATATTTAAA 
GTATAGTTTG 
TGCATTATAA 
ATTGTAAATA 
CAGTAGCTTT 
GGGGAATACT 
GTGTTTTTTT 
ATCCTCTATT 
AAATAACCAT 
AAAGCACCCT 
CAGGTCTGGG 
GGCCTGAATC 
AACACCTCCA 
ATTAATTTTT 
ATTTTGAATG 
AGAAAGCAGC 
CAAGGGCAAG 
ATACTTTTTC 



FHAFDGKGGI 

dpkavmfpty 
vttvgnkiff 
kywlisnlrp 

DPGYPKLITK 



180 
-240 
300 
360 
420 



AGTAAACAAT- 
CTTGGTTACA" 
ATATGTGCGG 
TGACCCCGAA 
TTGGATCACC 
GGTTGAAACT 
TAGATCATGT 
AATACTTCAA 
TGTTGATCCT 
TCCAGAAATC 
ATATCAGAAA 
CGGCCAAGCT 
GTGTCGTGCG 
ATTACTGGGT 
TGCAACTAAA 
CACAGAAGCA 
CAACAACTCT 
GGAAACCATT 
GCATCATCAT . 
CACTTACTCG 
ACACACTGGT 
GCATGCCATC 
GTTCTGTGGG-- 
TGGAACCCAA 
CAATTAGGTC . 
TCAACATGTA 
ATATTTTTAA 
AACTGGTAAA 
TTTTTTTACG 
AGTTATGCTA 
AGAAATATTG 
TAAATTAAAA 
TCATTTGACT ■ 
GAATATAGTT 
ATGAAATGAG 
TCCTACAATA 
CTGAGTCTAT 
AATTAAACTT 
GCTTTGCAGT 
CGCTGCAGCT 
CGGGGAGCTA 
GCTGTTTCTA 
GTCCTCCTAG^ 

ggggagattg; 
agctacaaaa' 
aaggaaagcc 
gcagagattc 

AATCAGTTTG 
TATAAAAGAA 
CCAAGTAGGT 
GAGAGGCCAC 
CTAGGCTTGG 



60 
120 
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240 
300 
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600 
660 
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900 
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1200 
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2100 
2160 
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2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2680 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
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CACTGCCTTT TCCTTTCTCA GGCCAATGGC 
CCAACCTCTT CTCTATGGCT CACCTTATTT 
TGCATGATGA GTCTGAAGGC ATTTGCAGGA 
AGGCATTCAT GGGAATTGTT GTATTCCTTC 
CTATGAATTA- AATGCCTATC TAAAATTCTG 
TGACCCTAAA ATCTATGTGT TTTAGACTTA 
TTGAGACGGA GTCTCGCTCT GACGCACAGG 
CTGAAAGCTC CGCCTCCCGG GTTCATGCCA 
GACTAGAGGC GCCCACCACC ACGCCCGGCT 
TTCACTGTGT TAGCCAGGAT GGTCTCGATC 
TCCCAAAGTG CTGGGATTAC AGGCATGACC 
GTCGTCTTCT TTTAATGTAA- TCATTTTGAA' 
TCAATCTTGA AATACTCAAC CAAAAGACAG 
GCACAAAATA- TTGGTCTGAG AATGGAATTC 
TGTAACCAGA AGCCAGTTTT. ATCTAACGGC 
CCCACTCACC, GATCAAAACC. TGCTACCTCC 
TCAAAGAGCA ACCAGTATCA- CTTCCCTGTT- 
TGAACATGCT GAAAACCACC TGGTCTGCAT 
AAATGAAAATTTAATTTTAG GGATTCATTT 
TCCTTATATG " TGTAAGGTGA AATTTATGGT 
AGCTTTCATT TTTCCCCCAG TGAATGATTT 
TTTTCTTACT TTTATAAGGA AGCAGCTGTC 
TTTAAACAGA' GTTTTAGTAT TGCTATTAAA- 
CTGCTTAAAA TAAGCAAAAA TTGGATGCAT 
AATAAAACAA TATTAACTTG GCTGCTTAAA 
TTTACAGATG" TGGGGAGATG TAATAAAACA 
TTTAGAGATT - AAATAATTCT AAGATGATCA 
GAAATAGAAA TACTCAATTA TGTCTTTGTT 
TCATTATCAA ATTGTCGACA" TCATTAATAT 
TTGAAGCACA GCTTTACAGA TGAGTATCTA- 
GTATTAAAAG TATTAGAAGG- TGGTTATAAT 
ACAGGGGTTT TACTTTGAGG ACCAGTGTAG 
CAGGCAATAT TGCAGTCTTG ATTCTGCCAC 
GACAAGATGA TCCAACCATA- AAGGTGCTCT 
GGAGTGTGCT CCCCTACAAA CGTTAAGACT 
AAAGCCTTAC ATTTTAATAT AGGTTGAACC 
ACCATTATTT TTGTGTATGT CTTCAAGAAT 
ATACCGGATA CATTTCACGT GTCCTTCAGT 
GTTGAGAAGC ATGGACACTA- GAGCCAGAAT 
ACTTCTGTGT GACCTTTGAA AGGCTACTTA 
ATGAACAATG CCAGCCTCAT GGGGTTGTTG 
ACATAGAACA CTGCCTGCAC ATAGTAAAAG 
ATGTAGTTGG ATATACTACC GAACAATATC 
CATATATATA ATCCCGAAAC ' ATG 

. Seq ID NO: 527 Protein sequence 
Protein Accession #: NP_077741.1 
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AACTGCCATT 
GGAGTGAGAA 
TGAGCCTGAA 
TGCAGCCCTC 
ATTTATTCCT 
GACTTTTTAT 
CTGGAGTGCA 
TTCTCCTGCC 
AATTTTTTGT 
TCCTGACCTC 
CACCGCTCCC 
CATGTGTGAA 
TCGAGAAGCC 
TCTGTAAGCC 
TACTGAAACA 
CCAAGACTTT 
TATAAAACCT 
GTATGCCCGA' 
CTATATTTTC 
ATTTGAGTGT 
AGAATTTTTT 
TAAAATGCAG 
AGAAGTTACT 
AAAGTAATAT 
ATAAGCAAAA 
ATATTAACTT. 
CTTTGCAAAA 
GTATTAATGG 
ATATTGTAAT 
TGATACATAT 
TGCAGAGTAT, 
TCAAGGGAAA 
TTACAGGATA 
GTGCTTCACA 
GATCATTTCA 
AAAATTTCAA 
GTTCATTGGA 
ATTGATTTGG 
GCTTGGATAT. 
TTTCCTCTCT' 
AATGATTAAA 
AATTATAAGT 
TAATCTCTTT 



TGAGTCOGGT 
ATCAAGGAGA 
CTGGTTGTGC 
CTTCTGGGCA 
ACATTTTCTG 
TGCCCCCCCC 
GTGGCTCCGA 
TCAGCCTCCT 
ATTTTTAATA 
GTGATCCGCC 
GGCCTTGTTT 
AGTTGATCAT 
AGGGGGAGAA 
TAGTTGCTGA 
CCCACTGTGT 
ACTAGTGCCG 
CTAACCATCT 
ATTTGTAATT 
ACATATGTAG 
GCAAGAAAAT 
ATGTAAATAT 
TGGGGTTTGT 
TTGCTTTTAA 
TTACAGATGT 
ATTGGATGCA 
GGTTTCTTGT 
TTATGCTTAT 
GGAATATTTT 
GTTGGGAAGA 
GTATAATAAA 
TCCATGAATA 
ACATGAGTTA 
GATAATGCCT 
GTGAATCTTT 
AAAATCTATT 
TTCCAGTAAC 
TTTTTGTTTG 
TTGAATATTG 
GAATCCTGGA 
TAGCTTTCTC 
TTAGTTAATA 
GTGAGGTAGT 
TTAGGGAAAT 



GAGGGATCAG 
CAGAGCTGAC 
AGAACAAACA- 
CTAAGAAGGT 
TTTTCTAATT 



TCTCTGCTCA 
GAGTAGCTGG- 
GAGACGGGGT' 
TGCCTCGGCC 
TCCGTTTAAA- 
ACGAATTGGA 
AGAACTCAGG 
AATTTCCTGC 
TTTGCTCACT 
ATAAACTTTC 
CTTTGTTCTT. 
CTTTTCTCTC 
TATTATTATT 
ATATTTTT AA 
ACAGAATGTT 
TTTGCAATGT 
AGAAACTTGG 
GGGGAGATGT 
TAAAGTAATA 
TTTTGCTGTA 
GGCTGGCATG ! 
GGACAATGTT 
GATCACTATT 
TTTTGATCGG' 
GTACACTGAC 
AAAAGAAAAG. 
GAACTTTAAT. 
TCCCCATGCA' 
AGCTATATCA 
TTCTATTGTA 
TAATAGTAAA 
GGTCATAATG 
TCTGTCACTT 
ATTAAAATCA 
TACCTAAAGT 
TGGTAAAATT 
AAAGTTTGTG 



4500 
4560 
4620 - 
•4680 
'4740 
4800 
4860 
4920 - 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5320 
5880 
5940 
6000 
6060 
6120 
61B0 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 



MAAAGPRRSV. 
ADLIRSSDPD 
KTRHTRETVL 
EPLNLFYIER 
PVFTEAIYNF 
TGVITTVSHY 
EAFVEENAFN 
KPLNYEENRQ 
KENLAVGSKI 
KNELYNITVL 
BPVHGAPFYF 
TKLLRVNLCE 
KRFPEDLAQQ 
MMKGGNQTLE 



11 

I . 

RGAVCLHLLL 
PRVLNDGSVY 
RRAKRRWAPI* 
DTGN1*FCTRP 
EVLES SRPGT-' 
LDREWDKYS 
VEILRIPIED. 
VNLEIGVNNE 
NGYKAYDPEN- 
AIDKDDRSCT- 
SLPNTSPEIS 
CTHPTQCRAT 
NLIISNTEAP 
SCRGAGHHHT 



21 
I 

TLVIFSRDGE 
TARAVALSDK 
PCSMQENSLG 
VDREEYDVFD 
TVGWCATDR- 
LIMKVQDMDG" 
KDLINTANWR 
APFARDIPRV- 
RNGNGLRYKK 
GTLAVNIEDV 
RLWSLTKVND 
SRSTGVILGK 
GDDRVCSANG 
LDSCRGGHTE 



31 

1 

ACKKVILNVP 
KRSPTIWLSD 
PFPLFIjQQVE 
LIAYASTADG 
DEPDTMHTRL 
QFFGLIGTST 
VNFTILKGNE 
TAXiNRALVTV 
LHDPKGWITI V 
NDNPPEILQE 
TAAPXSYQKN: 
WAILAILLGI 
FMTQTTNNSS 
VDNCRYTYSE 



41 

I 

SKLEADKIIG 
KRKQTQKEVT 
SDAAQNYTVF 
YSADLPLPLP 
KYSILQQTPR 
CIITVTDSND 
NGHFKISTDK 
HVRDLDEGPE 
DEISGSIITS 
YWICKPKMG 
AGFQEYTIPI 
ALLFSVLLTL 
QGFCGTMGSG 
WHSFTQPRLG 



51 
I 

RVNLEECFRS 
VLLEHQKKVS 
YSISGRGVDK 



SPGLFSVHPS 
NAPTFRQNAY 
ETNEGVLSW 
CTPAAQYVRI 
KILDREVETP 
YTDILAVDPD 
TVKDRAGQAA 
VCGVFGATKG 
MKNGGQETIE 
EESIRGHTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



Seq ID NO: 528 DNA sequence 

Nucleic Acid Accession #: NM_001941.2 

Coding sequence: 64.-2754 



GGCAGGTCTC 
CCGATGGCCG 
CTGACCCTCG 
CCTTCTAAAC 
TCTGCAGACC 
TACACAGCCA 
GACAAAAGGA 
. TCGAAGACAA 
ATTCCTTGCT 
GAATCTGATG 
AAAGAACCTT 
CCTGTGGATC 
GGATATTCAG 
CACCCTGTTT 
ACTACAGTGG 
CTGAAATACA 



11 
I 

GCTCTCGGCA 
CCGCTGGGCC 
TGATCTTCAG 
TAGAGGCAGA 
TCATCCGGTC 
GGGCTGTTGC 
AACAGACACA 
GACACACTAG. 
CTATGCAAGA 
CAGCACAGAA 
TAAATTTGTT 
GTGAAGAATA 
CAGATCTGCC 
TCACAGAAGC 
GGGTGGTTTG 
GCATTTTGCA 



21 
I 

CCCTCCCGGC 
CCGGCGeTCC 
TCGTGATGGT 
CAAAATAATT 
AAGTGATGCT 
GCTGTCTGAT 
GAAAGAGGTT 
AGAAACTGTT. 
GAATTCCTTG 
CTATACTGTC 
TT AT AT AGAA 
TGATGTTTTT. 
CCTCCCACTA 
AATTTATAAT 
TGCCACAGAC 
GCAGACACCA 



31 
i 

GCCCGCGTTC 
GTGCGCGGAG 
GAAGCCTGCA 
GGCAGAGTTA 
GATTTCAGAG 
AAGAAAAGAT 
ACTGTGCTGC 
CTCAGGOGTG 
GGCCCTTTCC 
TTCTACTCAA 
AGAGACACTG 
GATTTGATTG 
CCCATCAGGG 
TTTGAAGTTT 
AGAGATGAAC 
AGGTCACCTG 



41 
I 

TCCTGGCCCT 
CCGTCTGCCT 
AAAAGGTGAT 
ATTTGGAAGA 
TTCTAAATGA 
CATTTACCAT 
TAGAACATCA 
CCAAGAGGAG 
CATTGTTTCT 
TAAGTGGACG 
GAAATCTATT 
CTTATGCGTC 
TAGAGGATGA 
TGGAAAGTAG 
CGGACACAAT 
GGCTCTTTTC 



■51 
I 

GCCCGGCATC 
GCATCTGCTG 
ACTTAATGTA 
GTGCTTCAGG 
TGGGTCAGTG 
ATGGCTTTCT- 
GAAGAAGGTA 
ATGGGCACCT 
TCAACAAGTT 
TGGAGTTGAT 
TTGCACTCGG 
AACTGCAGAT 
AAATGACAAC 
TAGACCTGGT 
GCATACGCGC 
TGTGCATCCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720- 
780 
840 
900 
960 
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WO 02/086443 

AGCACAGGCG TAATCACCAC ; AGTCTCTCAT TATTTGGACA GAGAGGTTGT AGACAAGTAC 1020 
TCATTGATAA TGAAAGTACA AGACATGGAT GGCCAGTTTT TTGGATTGAT AGGCACATCA .1080 
ACTTGTATCA TAACAGTAAC AGATTCAAAT GATAATGCAC "CCACTTTCAG ACAAAATGCT 1140 
TATGAAGCAT TTGTAGAGGA AAATGCATTC AATGTGGAAA TCTTACGAAT ACCTATAGAA 1200 
GATAAGGATT TAATTAACAC TGCCAATTGG AGAGTCAATT TTACCATTTT AAAGGGAAAT 1260 
GAAAATGGAC ATTTCAAAAT " CAGCACAGAC AAAGAAACTA' ATGAAGGTGT. TCTTTCTGTT' 1320 
GTAAAGCCAC TGAATTATGA AGAAAACCGT CAAGTGAACC TGGAAATTGG AGTAAACAAT 1380 
GAAGCGCCAT TTGCTAGAGA TATTCCCAGA GTGACAGCCT TGAACAGAGC CTTGGTTACA 1440 
GTTCATGTGA GGGATCTGGA TGAGGGGCCT GAATGCACTC CTGCAGCCCA ATATGTGCGG . 1500 
ATTAAAGAAA ACTTAGCAGT GGGGTCAAAG ATCAACGGCT ATAAGGCATA TGACCCCGAA 1560 
AATAGAAATG GCAATGGTTT AAGGTACAAA AAATTGCATG -ATCCTAAAGG TTGGATCACC ' 1620 
ATTGATGAAA TTTCAGGGTC AATCATAACT TCCAAAATCC TGGATAGGGA GGTTGAAACT 1680 
CCCAAAAATG AGTTGTATAA TATTACAGTC CTGGCAATAG ACAAAGATGA TAGATCATGT 1740 
ACTGGAACAC TTGCTGTGAA CATTGAAGAT GTAAATGATA ' ATCCACCAGA AATACTTCAA 1800 
GAATATGTAG TCATTTGCAA ACCAAAAATG' GGGTATACCG ACATTTTAGC TGTTGATCCT 1860 
. GATGAACCTG TCCATGGAGC TCCATTTTAT TTCAGTTTGC CCAATACTTC TCCAGAAATC 1920 
AGTAGACTGT GGAGCCTCAC CAAAGTTAAT. GATACAGCTG CCCGTCTTTC. ATATCAGAAA 1980 
AATGCTGGAT TTCAAGAATA TACCATTCCT ATTACTGTAA AAGACAGGGC CGGCCAAGCT; 2040 
GCAACAAAAT TATTGAGAGT TAATCTGTGT GAATGTACTC ATCCAACTCA- GTGTCGTGCG 2100 
ACTTCAAGGA GTACAGGAGT AATACTTGGA AAATGGGCAA TCCTTGCAAT ATTACTGGGT 2160 
ATAGCACTGC TCTTTTCTGT ATTGCTAACT TTAGTATGTG GAGTTTTTGG TGCAACTAAA ' 2220 
GGGAAACGTT TTCCTGAAGA TTTAGCACAG CAAAACTTAA' TTATATCAAA CACAGAAGCA 2280 
CCTGGAGACG ATAGAGTGTG CTCTGCCAAT GGATTTATGA CCCAAACTAC CAACAACTCT 2340 
AGCCAAGGTT TTTGTGGTAC TATGGGATCA GGAATGAAAA ATGGAGGGCA GGAAACCATT 2400 
GAAATGATGA AAGGAGGAAA CCAGACCTTG GAATCCTGCC GGGGGGCTGG GCATCATCAT 2460 
ACCCTGGACT CCTGCAGGGG AGGACACACG GAGGTGGACA ACTGCAGATA CACTTACTCG .2520 
GAGTGGCACA GTTTTACTCA ACCCCGTCTC GGTGAAAAAT . TGCATCGATG TAATCAGAAT 2580 
GAAGACCGCA TGCCATCCCA- AGATTATGTC CTCACTTATA" ACTATGAGGG AAGAGGATCT 2640 
CCAGCTGGTT CTGTGGGCTG CTGCAGTGAA AAGCAGGAAG AAGATGGCCT TGACTTTTTA 2700 
AATAATTTGG AACCCAAATT TATTACATTA GCAGAAGCAT GCACAAAGAG ATAATGTCAC 2760 
AGTGCTACAA TTAGGTCTTT. GTCAGACATT CTGGAGGTTT CCAAAAATAA TATTGTAAAG 2820 
TTCAATTTCA' ACATGTATGT ATATGATGAT TTTTTTCTCA ATTTTGAATT ATGCTACTCA " 2880 
CCAATTTATA TTTTTAAAGC CAGTTGTTGC TTATCTTTTC CAAAAAGTGA AAAATGTTAA 2940 
AACAGACAAC TGGTAAATCT CAAACTCCAG CACTGGAATT- AAGGTCTCTA AAGCATCTGC 3000 
TCTTTTTTTT TTTTACGGAT ATTTTAGTAA TAAATATGCT GGATAAATAT TAGTCCAACA 3060 
ATAGCTAAGT TATGCTAATA' TCACATTATT ATGTATTCAC TTTAAGTGAT AGTTTAAAAA 3120. 
ATAAACAAGA AATATTGAGT ATCACTATGT GAAGAAAGTT TTGGAAAAGA AACAATGAAG 31 BO 
ACTGAATTAA ATTAAAAATG TTGCAGCTCA TAAAGAATTG GGACTCACCC CTACTGCACT 3240 
ACCAAATTCA TTTGACTTTG GAGGCAAAAT GTGTTGAAGT GCCCTATGAA GTAGCAATTT 3300 
TCTATAGGAA TATAGTTGGA AATAAATGTG TGTGTGTATA" TTATTATTAA TCAATGCAAT 3360 
ATTTAAAATG AAATGAGAAC AAAGAGGAAA ATGGTAAAAA CTTGAAATGA GGCTGGGGTA 3420 
TAGTTTGTCC TACAATAGAA AAAAGAGAGA GCTTCCTAGG CCTGGGCTCT TAAATGCTGC 3480 
ATTATAACTG AGTCTATGAG GAAATAGTTC CTGTCCAATT TGTGTAATTT GTTTAAAATT 3540 
GTAAATAAAT TAAACTTTTC TGGTTTCTGT GGGAAGGAAA TAGGGAATCC AATGGAACAG 3600 
TAGCTTTGCT TTGCAGTCTG TTTCAAGATT TCTGCATCCA CAAGTTAGTA GCAAACTGGG 36S0 
GAATACTCGC TGCAGCTGGG GTTCCCTGCT TTTTGGTAGC AAGGGTCCAG AGATGAGGTG 3720 
TTTTTTTCGG GGAGCTAATA ACAAAAACAT TTTAAAACTT ACCTTTACTG AAGTTAAATC 3780 
CTCTATTGCT GTTTCTATTC TCTCTTATAG TGACCAACAT CTTTTTAATT TAGATCCAAA 3 840 
TAACCATGTC CTCCTAGAGT TTAGAGGCTA GAGGGAGCTG AGGGGAGGAT CTTACTGAAA 3 900 
GCACCCTGGG GAGATTGATT GTCCTTAAAC CTAAGCCCCA CAAACTTGAC AOCTGATCAG 3960 
GTCTGGGAGC TACAAAATTT CATTTTTCTC CTCACTGCCC TTCTTCTGAG TGGCATTGGC 4020 
CTGAATCAAG GAAAGCCAGG CCTTGTGGGC CCCCTTCTTT CGGCTTTCTG CTAAAGCAAC 4080 
ACCTCCAGCA GAGATTCCCT TAAGTGACTC CAGGTTTTCC ACCATCCTTC AGCGTGAATT 4140 
AATTTTTAAT CAGTTTGCTT TCTCCAGAGA AATTTTAAAA TAATAGAAGA AATAGAAATT 4200 
TTGAATGTAT AAAAGAAAAA GATCAAGTTG. TCATTTTAGA ACAGAGGGAA CTTTGGGAGA 4260 
AAGCAGCCCA AGTAGGTTAT TTGTACAGTC AGAGGGCAAC AGGAAGATGC AGGCCTTCAA 4320 
GGGCAAGGAG AGGCCACAAG. GAATATGGGT GGGAGTAAAA GCAACATCGT CTGCTTCATA 4380 
CTTTTTCCTA GGCTTGGCAC TGCCTTTTCC TTTCTCAGGC CAATGGCAAC TGCCATTTGA 4440 
GTCCGGTGAG GGATCAGCCA ACCTCTTCTC TATGGCTCAC CTTATTTGGA GTGAGAAATC 4500 
AAGGAGACAG AGCTGACTGC ATGATGAGTC TGAAGGCATT TGCAGGATGA GCCTGAACTG 4560 
GTTGTGCAGA ACAAACAAGG CATTCATGGG AATTGTTGTA TTCCTTCTGC AGCCCTCCTT 4620 
CTGGGCACTA AGAAGGTCTA TGAATTAAAT GCCTATCTAA- AATTCTGATT TATTCCTACA * 4680 
TTTTCTGTTT TCTAATTTGA CCCTAAAATC TATGTGTTTT AGACTTAGAC TTTTTATTGC 4740 
CCCCCCCCCC TTTTTTTTTG AGACGGAGTC TCGCTCTGAC GCACAGGCTG GAGTGCAGTG 4800 
GCTCCGATCT- CTGCTCACTG AAAGCTCCGC CTCCCGGGTT CATGCCATTC TCCTGCCTCA 4860 
GCCTCCTGAG TAGCTGGGAC TACAGGCGCC CACCACCACG CCCGGCTAAT TTTTTGTATT 4920 
TTTAATAGAG ACGGGGTTTC ACTGTGTTAG CCAGGATGGT CTCGATCTCC TGACCTCGTG 4980 
ATCCGCCTGC CTCGGCCTCC CAAAGTGCTG GGATTACAGG CATGACCCAC CGCTCCCGGC 5040 
CTTGTTT TCC GTTTAAAGTC GTCTTCTTTT AATGTAATCA TTTTGAACAT GTGTGAAAGT 5100 
TGATCATACG AATTGGATCA ATCTTGAAAT ACTCAACCAA AAGACAGTCG AGAAGCCAGG 5160 
GGGAGAAAGA ACTCAGGGCA CAAAATATTG GTCTGAGAAT GGAATTCTCT GTAAGCCTAG 5220 
TTGCTGAAAT TTCCTGCTGT AACCAGAAGC CAGTTTTATC TAACGGCTAC TGAAACACCC 5280 
ACTGTGTTTT GCTCACTCCC TCACTCACCG ATCAAAACCT GCTACCTCCC CAAGACTTTA 5340 ' 
CTAGTGCCGA TAAACTTTCT CAAAGAGCAA CCAGTATCAC TTCCCTGTTT ATAAAACCTC ' 5400 
TAACCATCTC TTTGTTCTTT GAACATGCTG AAAACCACCT GGTCTGCATG TATGCCCGAA 5460 
TTTGTAATTC TTTTCTCTCA AATGAAAATT TAATTTTAGG GATTCATTTC TATATTTTCA ' 5520 
CATATGTAGT ATTATTATTT CCTTATATGT GTAAGGTGAA ATTTATGGTA TTTGAGTGTG ' 5580 
CAAGAAAATA TATTTTTAAA GCTTTCATTT TTCCCCCAGT GAATGATTTA GAATTTTTTA 5640 
TGTAAATATA CAGAATGTTT TTTCTTACTT TTATAAGGAA GCAGCTGTCT AAAATGCAGT 5700 
GGGGTTTGTT TTGCAATGTT TTAAACAGAG TTTTAGTATT GCTATTAAAA GAAGTTACTT 5760 . 
TGCTTTTAAA GAAACTTGGC TGCTTAAAAT AAGCAAAAAT TGGATGCATA AAGTAATATT 5820 . 
TACAGATGTG GGGAGATGTA ATAAAACAAT ATTAACTTGG TTTCTTGTTT TTGCTGTATT 5880 
TAGAGATTAA ATAATTCTAA GATGATCACT TTGCAAAATT ATGCTTATGG CTGGCATGGA 5940 
AATAGAAATA CTCAATTATG TCTTTGTTGT ATTAATGGGG AATATTTTGG ACAATGTTTC 6000 
ATTATCAAAT TGTCGACATC ATTAATATAT ATTGTAATGT TGGGAAGAGA TCACTATTTT 6060 . 
GAAGCACAGC TTTACAGATG AGTATCTATG ATACATATGT A7AATAAATT TTGATCGGGT 6120 
ATTAAAAGTA TTAGAAGGTG GTTATAATTG CAGAGTATTC CATGAATAGT ACACTGACAC 6180 



388 



AGC^^^^^GAC CAGTGTAGTC AAGGGAAAAC ATGAGTTAAA AAGAAAAGCA 6240 

GGCAATATTG CAGTCTTGAT TCTGCCACTT ACAGGATAGA TAATGCCTGA ACTTTAATGA 6300 

CAAGATGATC CAACCATAAA GGTGCTCTGT GCTTCACAGT GAATCTTTTC CCCATGCAGG 636.0 

AGTGTGCTCC CCTACAAACG TTAAGACTGA TCATTTCAAA AATCTATTAG CTATATCAAA 6420 

AGCCTTACAT TTTAATATAG- GTTGAACCAA- AATTTCAATT CCAGTAACTT CTATTGTAAC 6480 

CATTATTTTT GTGTATGTCT TCAAGAATGT TCATTGGATT- TTTGTTTGTA ATAGTAAAAT 6540 

ACCGGATACA TTTCAGGTGT CCTTCAGTAT. TGATTTGGTT GAATATTGGG TCATAATGGT 6600 

TGAGAAGCAT GGACACTAGA GCCAGAATGC TTGGATATGA ATCCTGGATC TGTCACTTAC 6660 

TTCTGTGTGA CCTTTGAAAG GCTACTTATT TCCTCTCTTA GCTTTCTCAT TAAAATCAAT 6720 

GAACAATGCC AGCCTCATGG. GGTTGTTGAA TGATTAAATT AGTTAATATA- CCTAAAGTAC 6780 

ATAGAACACT GCCTGCACAT AGTAAAAGAA- TTATAAGTGT GAGGTAGTTG GTAAAATTAT 6840 

GTAGTTGGAT ATACTACCGA ACAATATCTA ATCTCTTTTT AGGGAAATAA AGTTTGTGCA 6900 
TATATATAAT CCCGAAACAT G 

Seq ID NO: 529 Protein sequence 
Protein Accession #: NP_001932.1* 

1 IX 21 : 31 41 51 

MAAAGPRRSV RGAVCLHLLL TLVIESRDGE ACKKVILNVP SKIiEADKIIG RVNLEECFRS 60 

ADLIRSSDPD FRVLNDGSVY TARAVALSDK KRSPTIWLSD- KRKQTQKEVT VLLEHQKKVS. 120 

KTRHTRETVL RRAKRRWAPI PCSMQENSLG PFPLFLQQVE- SDAAQNYTVF YSISGRGVDK- 180 

EPLNLFYIER- DTGNLFCTRP VDREEYDVFD I»I AYASTADG YSADLPLPLP IRVED2NDNH* 240 

PVFTEAIYNP EVLESSRPGT TVGWCATDR- DEPDTMHTRL. KYSILQQTPR SPGLFSVHPS; 300 

TGVITTVSHY LDREWDKYS LIMKVQDMDG QFFGLIGTST- CIITVTDSND- NAPTFRQNAY 360 

EAFVEENAFN VBILRIPIED KDLINTANWR VNFTILKGNS NGHFKISTDK ETNEGVLSW 420 

KPLNYEENRQ - VNLEIGVNNE APFARDIPRV- TALNRALVTV HVRDLDEGPE. CTPAAQYVRI- 480 

• KENLAVGSKL NGYKAYDPEN RNGNGLRYKK" bHDPKGWITl DE1SGSIITS "KILDREVETP 540 

KNELYNITVL AIDKDDRSCT GTIAVNIEDV NDNPPEILQE YWICKPKMG YTDILAVDPD 600 

EPVHGAPFYF SLPNTSPEIS RLWSLTKVND . TAARLSYQKN- AGFQEYTIPI TVKDRAGQAA 660 

TKLLRVNLCE CTHPTQCRAT SRSTGVILGR WAILAILLGI ALLFSVLLTL VCGVFGATKG 720 

KRFPEDLAQQ NLIISNTEAP GDDRVCSANG FMTQTTNNSS-' QGFCGTMGSG MKNGGQETIE 780 

MMKGGNQTLE SCRGAGHHHT LDSCRGGHTE VDNCRYTYSE WHSFTQPRLG EKLHRCNQNE 840 
DRMPSQDYVL TYNYEGRGSP AGSVGCCSEK QEEDGLDFLN NUEPKFITLA EACTKR 



Seq ID NO: -530 DNA sequence 

Nucleic Acid Accession #x NM_016583.2 

Coding sequence : 72 . . 842 

1 11' • 21 . 31 |1 51 

GGAGTGGGGG AGAGAGAGGA GACCAGGACA GCTGCTGAGA CCTCTAAGAA GTCCAGATAC 60 

TAAGAGCAAA. GATGTTTCAA ACTGGGGGCC TCATTGTCTT CTACGGGCTG TTAGCCCAGA- 120 

CCATGGCCCA GTTTGGAGGC CTGCCCGTGC CCCTGGACCA GACCCTGCCC TTGAATGTGA 180 

ATCCAGCCCT GCCCTTGAGT- CCCACAGGTC TTGCAGGAAG. CTTGACAAAT GCCCTCAGCA 240 

ATGGCCTGCT GTCTGGGGGC CTGTTGGGCA TTCTGGAAAA CCTTCCGCTC CTGGACATCC 300 

TGAAGCCTGG AGGAGGTACT TCTGGTGGCC TCCTTGGGGG ACTGCTTGGA AAAGTGACGT 360 

CAGTGATTCC TGGCCTGAAC AACATCATTG ACATAAAGGT CACTGACCCC CAGCTGCTGG 420 

AACTTGGCCT TGTGCAGAGC CCTGATGGCC ACCGTCTCTA TGTCACCATC CCTCTCGGCA 480 

TAAAGCTCCA AGTGAATACG CCCCTGGTCG GTGCAAGTCT GTTGAGGCTG GCTGTGAAGC 540 

TGGACATCAC TGCAGAAATC TTAGCTGTGA GAGATAAGCA GGAGAGGATC CACCTGGTCC 600 

TTGGTGACTG CACCCATTCC CCTGGAAGCC TGCAAATTTC TCTGCTTGAT GGACTTGGCC 660 

CCCTCCCCAT TCAAGGTCTT CTGGACAGCC TCACAGGGAT CTTGAATAAA GTCCTGCCTG 720 

AGTTGGTTCA GGGCAACGTG TGCCCTCTGG TCAATGAGGT TCTCAGAGGC TTGGACATCA- 780 

CCCTGGTGCA TGACATTGTT AACATGCTGA TCCACGGACT ACAGTTTGTC ATCAAGGTCT 840 

AAGCCTTCCA GGAAGGGGCT GGCCTCTGCT GAGCTGCTTC CCAGTGCTCA CAGATGGCTG 900 

GCCCATGTGC TGGAAGATGA CACAGTTGCC TTCTCTCCGA GGAACCTGCC CCCTCTCCTT 960 

TCCCACCAGG CGTGTGTAAC ATCCCATGTG CCTCACCTAA TAAAATGGCT CTTCTTCTGC 1020 
AAAAAAAAAA AAAAAAAAAA .AAAAAAAAA 

Seq ID NO: 531 Protein sequence 
Protein Accession &: NP_057667.1 

1 11 • 21 - 31 41 51 

MFQTGGLIVF YGLLAQTMAQ FGGLPVPLDQ TLPLNVNPAL PLSPTGLAGS LTNALSNGLL 60 

SGGLLGILEN LPbLDILKPG GG1SGGLLGG LIX5KVTSVIP GLNNI IDIKV TDPQLLEU3L 120 

VQSPDGHRLY VTIPLGIKLQ VNTPLVGASL LRLAVKLDIT AEILAVRDKQ ERIHLVLGDC 180 

THSPGSLQIS LLDGLGPLPI QGLLDSLTGI LiNKVLPELVQ GNVCPLVNEV LRGLDITLVH 24 0 
DIVNMLIHGL QFVIKV 



Seq ID NOs 532 DNA sequence -: 

Nucleic Acid Accession 8: NM_0 043 63.1 • 
Coding sequence : 115.. 2223 

1 11 21 31 .41 51 

CTCAGGGCAG AGGGAGGAAG GACAGCAGAC CAGACAGTCA CAGCAGCCTT GACAAAACGT 60 

TCCTGGAACT CAAGCTCTTC TCCACAGAGG AGGACAGAGC AGACAGCAGA GACCATGGAG 120 

TCTCCCTCGG CCCCTCCCCA CAGATGGTGC ATCCCCTGGC AGAGGCTCCT GCTCACAGCC 180 

TCACTTCTAA CCTTCTGGAA CCCGCCCACC ACTGCCAAGC TCACTATTGA ATCCACGCCG 240 

TTCAATGTCG CAGAGGGGAA : GGAGGTGCTT CTACTTGTCC ACAATCTGCC CCAGCATCTT 300 

TTTGGCTACA GCTGGTACAA AGGTGAAAGA GTGGATGGCA ACCGTCAAAT TATAGGATAT 360 

GTAATAGGAA CTCAACAAGC TACCCCAGGG CCCGCATACA GTGGTCGAGA GATAATATAC 420 

CCCAATGCAT CCCTGCTGAT CCAGAACATC ATCCAGAATG ACACAGGATT CTACACCCTA 480 
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CACGTCATAA AGTCAGATCT- TGTGAATGAA GAAGCAACTG GCCAGTTCCG GGTATACCOG 
GAGCTGCCCA AGCCCTCCAT CTCCAGCAAC AACTCCAAAC CCGTGGAGGA CAAGGATGCT 
GTGGCCTTCA CCTGTGAACC TGAGACTCAG GACGCAACCT ACCTGTGGTG GGTAAACAAT. 
CAGAGCCTCC CGGTCAGTCC CAGGCTGCAG CTGTCCAATG GCAACAGGAC CCTCACTCTA 
TTCAATGTCA CAAGAAATGA CACAGCAAGC TACAAATGTG AAACCCAGAA CCCAGTGAGT 
GCCAGGCGCA GTGATTCAGT; CATCCTGAAT GTCCTCTATG GCCCGGATGC CCCCACCATT 
TCCCCTCTAA ACACATCTTA CAGATCAGGG : GAAAATCTGA ACCT CTCCTG- CCACGCAGCC : 
TCTAACCCAC CTGCACAGTA CTCTTGGTTT GTCAATGGGA CTTTCCAGCA ATCCACCCAA 
GAGCTCTTTA' TCCCCAACAT CACTGTGAAT AATAGTGGAT CCTATACGTG CCAAGCCCAT 
AACTCAGACA- CTGGCCTCAA TAGGACCACA GTCACGACGA TCACAGTCTA TGCAGAGCCA 
CCCAAACCCF TCATCACCAG CAACAACTCC AACCCCGTGG AGGATGAGGA" TGCTGTAGCC 
TTAACCTGTG AACCTGAGAT TCAGAACACA ACCTACCTGT- GGTGGGTAAA TAATCAGAGC 
CTCCCGGTCA GTCCCAGGCT, GCAGCTGTCC AATGACAACA GGACCCTCAC TCTACTCAGT 
GTCACAAGGA ATGATGTAGG- ACCCTATGAG TGTGGAATCC AGAACGAATT AAGTGTTGAC 
CACAGCGACC CAGTCATCCT. GAATGTCCTC TATGGCCCAG' ACGACCCCAC CATTTCCCCC 
TCATACACCT ATTACCGTCC AGGGGTGAAC CTCAGCCTCT- CCTGCCATGC AGCCTCTAAC 
CCACCTGCAC AGTATTCTTG" GCTGATTGAT GGGAACATCC AGCAACACAC ACAAGAGCTC 
TTTATCTCCA" ACATCACTGA - GAAGAACAGC GGACTCTATA CCTGCCAGGC CAATAACTCA 
GCCAGTGGCC' ACAGCAGGAC TACAGTCAAG ACAATCACAG TCTCTGCGGA GCTGCCCAAG- 
CCCTCCATCT CCAGCAACAA CTCCAAACCC GTGGAGGACA AGGATGCTGT GGCCTTCACC 
TGTGAACCTG- AGGCTCAGAA CACAACCTAC CTGTGGTGGG TAAATGGTCA GAGCCTCCCA 
GTCAGTCCCA- GGCTGCAGCT- GTCCAATGGC AACAGGACCC TCACTCTATT- CAATGTCACA 
AGAAATGACG: CAAGAGCCTA* TGTATGTGGA ATCCAGAACT CAGTGAGTGC AAACCGCAGT. 
GACCCAGTCA CCCTGGATGT CCTCTATGGG CCGGACACCC CCATCATTTC - CCCCCCAGAC 
TCGTCTTACC TTTCGGGAGC GAACCTCAAC CTCTCCTGCC ACTCGGCCTC TAACCCATCC 
CCGCAGTATT- CTTGGCGTAT CAATGGGATA CCGCAGCAAC ACACACAAGT, TCTCTTTATC 
GCCAAAATCA CGCCAAATAA TAACGGGACC TATGCCTGTT TTGTCTCTAA CTTGGCTACT 
GGCCGCAATA ATTCCATAGT CAAGAGCATC ACAGTCTCTG CATCTGGAAC TTCTCCTGGT 
CTCTCAGCTG' GGGCCACTGT CGGCATCATG - ATTGGAGTGC TGGTTGGGGT TGCTCTGATA 
TAGCAGCCCT GGTGTAGTTT CTTCATTTCA GGAAGACTGA" CAGTTGTTTT GCTTCTTCCT. 
TAAAGCATTT GCAACAGCTA CAGTCTAAAA TTGCTTCTTT ACCAAGGATA TTTACAGAAA 
AGACTCTG AC ' CAGAGATCGA GACCATCCTA GCCAACATCG TGAAACCCCA TCTCTACTAA 
AAATACAAAA ATGAGCTGGG CTTGGTGGCG CGCACCTGTA GTCCCAGTTA- CTCGGGAGGC 
TGAGGCAGGA GAATCGCTTG AACCCGGGAG GTGGAGATTG. CAGTGAGCCC; AGATCGCACC 
ACTGCACTCC AGTCTGGCAA CAGAGCAAGA CTCCATCTCA AAAAGAAAAG AAAAGAAGAC 
TCTGACCTGT ACTCTTGAAT ACAAGTTTCT GATACCACTG CACTGTCTGA GAATTTCCAA 
AACTTTAATG" AACTAACTGA CAGCTTCATG AAACTGTCCA CCAAGATCAA GCAGAGAAAA 
TAATTAATTT" CATGGGACTA AATGAACTAA TGAGGATTGC TGATTCTTTA AATGTCTTGT 
TTCCCAGATT TCAGGAAACT- TTTTTTCTTT TAAGCTATCC ACTCTTACAG CAATTTGATA 
AAATATACTT. TTGTGAACAA AAATTGAGAC ATTTACATTT TCTCCCTATG TGGTCGCTCC 
AGACTTGGGA AACTATTCAT GAATATTT AT ATTGTATGGT AATATAGTTA TTGCACAAGT 
TCAATAAAAA- TCTGCTCTTT GTATAACAGA AAAA 
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Seq ID NO: 533 Protein sequence 
Protein Accession #: NP_004354.1 



MESPSAPPHR 
HLFGYSWYKG: 
TLHVIKSDLV 
NNQSLPVSPR' 
TISPLNTSYR 
AHNSDTGLNR 
QSLPVSPRLQ: 
SPSYTYYRPG 
HSASGHSRTT 
LPVSPRLQLS, 
PDSSYLSGAN 
ATGRNNSIVK 



11 

I 

WCIPWQRLLL 
ERVDGNRQII 
NEEATGQFRV. 
LQLSNGNRTL 
SGENLNLSCH 
TTVTTITVYA 
LSNDNRTLTL 
VNLSLSCHAA 
VKTITVSAEL 
NGNRTLTIiFN 
LNLSCHSASN 
SITVSASGTS 



21 

I . 

TASLLTFWNP 
GYVIGTQQAT 
YPELPKPSIS 
TLFNVTRNDT 
AASKPPAQYS 
EPPKPFITSN 
LSVTRNDVGP 
SNPPAQYSWL 
PKPSISSNNS 
VTRNDARAYV 
PSPQYSWRIN 
PGLSAGATVG 



31 

I 

PTTAKLTIES 
PGPAYSGREI 
SNNSKPVEDK 
ASYKCETQNP 
WFVNGTFQQS 
NSMPVEDEDA 
YECGIONELS 
IDGNIQQHTQ 
KPVEDKDAVA 
CGIQNSVSAN 
GIPQQHTQVL 
IMIGVLVGVA 



41 

I 

TPFNVAEGKE 
IYPKASLLIQ ; 
DAVAFTCEPE 
VSARRSDSVI- 
TQELFIPNIT 
VALTCEPEIQ 
VDHSDPVIliN' 
ELFISNITEK 
FTCEPEAQNT. 
RSDPVTLDVL 
FIAKITPNNN 
LI 



51 
I 

VLLLVHNLPQ 
NIIQNDTGFY 
TQDATYLWflV 
LNVLYGPDAP 
VNNSGSYTCQ 
NTTYLWWVNN 
VLYGPDDPTI 
NSGLYTCQAN 
TYLWWVNGQS 
YGPDTPIISP 
GTYACFVSNL 



Seq ID NO: 534 DNA sequence 

Nucleic Acid Accession ft; NM_006952.1 

Coding sequences 11.. 793 



AATCCCGACA 
TGGAAATGTG 
ATCTGACCAA 
GGCTGCCTGG 
TGTAGGCATC. 
AGTATATGCC 
ACCCAACCTC 
TGATGACCAG' 
CAATTGCTGT 
TGAGAATAAT 
AGAACCTCTC 
CTGCTATGAA 
ATTTGCCATT 
AATTGAATAT 



11 
I 

ATGGCGAAAG 
ATTATTGGTT 
CACAGCCTCT 
ATCGGCATAT 
ATGAAGTCCA 
TTTGAAGTGG 
TTCCTGAAGC 
TGGAAAAACA 
GGCGTAAATG 
GATGCTGACT 
AACCTGGAGG 
CTGATCTCTG 
CTCTG CTGG A 
TAAGAA • 



21 
I 

ACAACTCAAC 
GTTGCGGCAT 
ACCCACTGCT 
TTGTGGGCAT 
GCAGGAAAAT 
CATCTTGTAT 
AGATGCTAGA 
ATGGAGTCAC 
GTCCATCAGA 
ATCCCTGGCC 
CTTGTAAACT 
GTCCAATGAA 
CTTTTTGGGT 



31 
I 

TGTTCGTTGC 
TGCCCTGACT 
TGAAGCCACC 
CTGCCTCTTC 
TCTTCTGGCG 
CACAGCAGCA 
GAGGTACCAA 
CAAAACCTGG 
CTGGCAAAAA 
TCGTCAATGC 
AGGCGTGCCT 
CCGACACGCC 
TCTCCTGGGT 



41 
I 

TTCCAGGGCC 
GCGGAGTGCA 
GACAACGATG 
TGCCTGTCTG 
TATTTCATTC 
ACACAACGAG 
AACAACAGCC 
GACAGGCTCA 
TACACATCTG 
TGTGTTATGA" 
GGTTTTTATC 
TGGGGGGTTG 
ACCATGTTCT 



Seq ID NO: 535 Protein sequence 
Protein Accession' # :' NP_008B83 .1 
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51 
I 

TGCTGATTTT 
TCTTCTTTGT 
ACATCTATGG 
TTCTAGGCAT 
TGATGTTTAT 
ACTTTTTCAC 
CTCCAAACAA 
TGCTCCAGGA 
CCTTCCGGAC 
ACAATCTTAA 
ACAATCAGGG 
CCTGGTTTGG 
ACTGGAGCAG 
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MAKDNSTVRC FQGLLIFGNV IIGCCGIALT AECIFFVSDQ HSLYPLLEAT BNDDIYGAAN 60 
IGIFVGIO.F CLSVLGIVGI MKSSRKILIA YFILMFIVYA FEVASCITAA TQRDFFTPNL 120 

FLiKQMLlERYQ nnsppnnddq- wknngvtktw drlmlqdncc-gvngpsdwqk ytsafrtenn. 180 

DADYPWPRQC CVMNNLKEPL NLEACKLGVP GFYHNQGCYE LISGPMNHKA WGVAWFGFAI 240 
LCWTFWVLLG TMFYWSRIEY 

Seq ID NO; 536 DNA sequence 

Nucleic Acid Accession §: NM_002638.1 

Coding sequence: 120.;. 473 - . 

• 1 11 . 21 ; 31 |1 51 

CAATACAGCT AAGGAATTAT CCCTTGTAAA TACCACAGAC CCGCCCTGGA GCCAGGCCAA 60 
GCTGGACTGC ATAAAGATTG GTATGGCCTT AGCTCTTAGC CAAACACCTT CCTGACACCA 120 
TGAGGGCCAG " CAGCTTCTTG ATCGTGGTGG TGTTCCTCAT. CGCTGGGACG CTGGTTCTAG 180 
AGGCAGCTGT. CACGGGAGTT CCTGTTAAAG GTCAAGACAC TGTCAAAGGC CGTGTTCCAT 240 
TCAATGGACA AGATCCCGTT AAAGGACAAG TTTCAGTTAA ■ AGGTCAAGAT AAAGTCAAAG 300 
CGCAAGAGCC AGTCAAAGGT CCAGTCTCCA CTAAGCCTGG CTCCTGCCCC ATTATCTTGA 
TCCGGTGCGC CATGTTGAAT CCCCCTAACC GCTGCTTGAA AGATACTGAC TGCCCAGGAA 
TCAAGAAGTG CTGTGAAGGC TCTTGCGGGA" TGGCCTGTTT CGTTCCCCAG- TGAAGGGAGC 
CGGTCCTTGC TGCACCTGTG CCGTCCCCAG AGCTACAGGC CCCATCTGGT CCTAAGTCCC 540 
TGCTGCCCTT." CCCCTTCCCA. CACTGTCCAT TCTTCCTCCC ATT CAGGATG CCCACGGCTG 600 
GAGCTGCCTC TCTCATCCAC TTTCCAATAA A 

Seq ID NO: 537 Protein sequence 
Protein Accession fh NP_002 629.1 r 

1 11 21 .31. ' 51 

LaSSFLIW VFLIAGTLVL LaVTGVPVK GODTVKGRVP ' FNGQDPVKGO VSVKGQDKVK 60 
AQEPVKGPVS TKPGSCPIIL IRCAMLNPPN- RCLKDTDCPG IKKCCEGSCG KACFVPQ 

Seq ID NO:. 53 B DNA sequence 

Nucleic Acid Accession NM_001793.2 

Coding sequence: 71. .2560 

1 11 . 21 31 |1 51 

Laggggcaa Lgctgagcg gaacaccggc CCGCCGTCGC GGCAGCTGCT TCACCCCTCT 
CTCTGCAGCC ATGGGGCTCC CTCGTGGACC TCTCGCGTCT CTCCTCCTTC TCCAGGTTTG 
CTGGCTGCAG TGCGCGGCCT CCGAGCCGTG, CCGGGCGGTC TTCAGGGAGG CTGAAGTGAC 
CTTGGAGGCG GGAGGCGCGG AGCAGGAGCC CGGCCAGGCG CTGGGGAAAG TATTCATGGG 
CTGCCCTGGG- CAAGAGCCAG CTCTGTTTAG, CACTGATAAT GATGACTTCA CTGTGCGGAA 
TGGCGAGACA GTCCAGGAAA GAAGGTCACT GAAGGAAAGG AATCCATTGA AGATCTTCCC 

ATCCAAACGT ATCTTACGAA GACACAAGAG AGATTGGGTG GTTGCTCCAA TATCTGTCCC 420 

TGAAAATGGC AAGGGTCCCT TCCCCCAGAG ACTGAATCAG CTCAAGTCTA ATAAAG ATAG ~ ' " 
AGACACCAAG ATTTTCTACA" GCATCACGGG: GCCGGGGGCA GACAGCCCCC CTGAGGGTGT 
CTTCGCTGTA GAGAAGGAGA CAGGCTGGTT GTTGTTGAAT AAGCCACTGG ACCGGGAGGA 

GATTGCCAAG TATGAGCTCT TTGGCCACGC TGTGTCAGAG AATGGTGCCT CAGTGGAGGA 660 

CCCCATGAAC ATCTCCATCA TCGTGACCGA CCAGAATGAC CACAAGCCCA" AGTTTACCCA. 720 

GGACACCTTC CGAGGGAGTG TCTTAGAGGG AGTCCTACCA GGTACTTCTG TGATGCAGGT 7B0 

GACAGCCACG GATGAGGATG ATGCCATCTA" CACCTACAAT GGGGTGGTTG CTTACTCCAT 840 

CCATAGCCAA GAACCAAAGG ACCCACACGA CCTCATGTTC ACCATTCACC GGAGCACAGG 900 

CACCATCAGC GTCATCTCCA GTGGCCTGGA CCGGGAAAAA GTCCCTGAGT- ACACACTGAC- 960 

CATCCAGGCC A CAGACATGG ATGGGGACGG CTCCACCACC ACGGCAGTGG ■ CAGTAGTGGA 1020 

GATCCTTGAT GCCAATGACA ATGCTCCCAT GTTTGACCCC CAGAAGTACG AGGCCCATGT ■ 1080 

GCCTGAGAAT GCAGTGGGCC ATGAGGTGCA GAGGCTGACG GTCACTGATC TGGACGCCCC 1140 

CAACTCACCA GCGTGGCGTG. CCACCTACCT TATCATGGGC GGTGACGACG GGGACCATTT 1200 

TACCATCACC ACCCACCCTG AGAGCAACCA GGGCATCCTG ACAACCAGGA AGGGTTTGGJ 1260 

TTTTGAGGCC AAAAACCAGC ACACCCTGTA CGTTGAAGTG. ACCAACGAGG CCCCTTTTGT-' 1320 

GCTGAAGCTC CCAACCTCCA CAGCCACCAT AGTGGTCCAC GTGGAGGATG TGAATGAGGC 1380 

ACCTGTGTTT GTCCCACCCT CCAAAGTCGT TGAGGTCCAG GAGGGCATCC CCACTGGGGA 1440 

GCCTGTGTGT GTCTACACTG CAGAAGACCC TGACAAGGAG AATCAAAAGA TCAGCTACCG 1500 

CATCCTGAGA GACCCAGCAG GGTGGCTAGC CATGGACCCA GACAGTGGGC AGGTCACAGC 1560 

TGTGGGCACC CTCGACCGTG. AGGATGAGCA GTTTGTGAGG AACAACATCT ATGAAGTCAT 1620 

GGTCTTGGCC ATGGACAATG GAAGCCCTCC CACCACTGGC ACGGGAACCC TTCTGCTAAC 1680 

ACTGATTGAT GTCAATGACC ATGGCCCAGT CCCTGAGCCC CGTCAGATCA CCATCTGCAA 1740 



60 
120 
1B0 
240 
300 
360 



480 
540 
600 



CCAAAGCCCT GTGCGCCAGG TGCTGAACAT 
CCCTTTCCAG GCCCAGCTCA CAGATGACTC 
GGAAGGTGAC ACAGTGGTCT TGTCCCTGAA 
GCACCTTTCT CTGTCTGACC ATGGCAACAA 
GTGCGACTGC CATGGCCATG TCGAAACCTG 



CACGGACAAG GACCTGTCTC CCCACACCTC 1800- 

AGACATCTAC TGGACGGCAG AGGTCAACGA 1860 

GAAGTTCCTG AAGCAGGATA CATATGACGT 1920 

AGAGCAGCTG ACGGTGATCA GGGCCACTGT 1980 

CCCTGGACCC TGGAAGGGAG GTTTCATCCT 2040 

CCCTGTGCTG GGGGCTGTCC TGGCTCTGCT GTTCCTCCTG CTGGTGCTGC TTTTGTTGGT 2100 

GAGAAAGAAG . CGGAAGATCA- AGGAGCCCCT CCTACTCCCA GAAGATGACA CCCGTGACAA 2160 

CGTCTTCTAC TATGGCGAAG AGGGGGGTGG CGAAGAGGAC CAGGACTATG ACATCACCCA 2220 

GCTCCACCGA GGTCTGGAGG CCAGGCCGGA GGTGGTTCTC CGCAATGACG TGGCACCAAC 2280 

CATCATCCCG ACACCCATGT ACCGTCCTCG GCCAGCCAAC CCAGATGAAA TCGGCAACTT 2340 

TATAATTGAG AACCTGAAGG- CGGCTAACAC AGACCCCACA GCCCOGCCCT ACGACACCCT 2400 

CTTGGTGTTC GACTATGAGG" GCAGCGGCTC CGACGCCGCG TCCCTGAGCT CCCTCACCTC 2460 

CTCCGCCTCC GACCAAGACC AAGATTACGA TTATCTGAAC GAGTGGGGCA GCCGCTTCAA 2520 

GAAGCTGGCA GACATGTACG GTGGCGGGGA GGACGACTAG GCGGCCTGCC TGCAGGGCTG 2580 

GGGACCAAAC GTCAGGCCAC AGAGCATCTC CAAGGGGTCT CAGTTCCCCC TTCAGCTGAG 2640 

GACTTCGGAG CTTGTCAGGA AGTGGCCGTA GCAACTTGGC GGAGACAGGC •TATGAGTCTG 2700 

ACGTTAGAGT GGTTGCTTCC TTAGCCTTTC AGGATGGAGG AATGTGGGCA GTTTGACTTC 2760 

AGCACTGAAA ACCTCTCCAC CTGGGCCAGG GTTGCCTCAG AGGCCAAGTT TCCAGAAGCC 2820 

TCTTACCTGC CGTAAAATGC TCAACCCTGT GTCCTGGGCC TGGGCCTGCT GTGACTGACC 2880 

TACAGTGGAC TTTCTCTCTG GAATGGAACC TTCTTAGGCC TCCTGGTGCA ACTTAATTTT 2940 



391 
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TTTTTTTAAT GCTATCTTCA AAACGTTAGA GAAAGTTCTT CAAAAGTGCA GCCCAGAGCT 3000 

GCTGGGCCCA CTGGCCGTCC TGCATTTCTG GTTTCCAGAC CCCAATGCCT CCCATTCGGA 3060 • 

TGGATCTCTG CGTTTTTATA CTGAGTGTGC CTAGGTTGCC CCTTATTTTT T ATTTTCC CT 3120 

GTTGCGTTGC TATAGATGAA GGGTGAGGAC AATCGTGTAT ATGTACTAGA ACTTTTTTAT 3180 

TAAAGAAACT TTTCCCAGAA AAAAA , . 

Seq ID NO: 539 Protein sequence 
Protein Accession ft: NP_001784.2 

1 11 21 31 41" 51 

MGLPRGPLAS LLLLQVCWLQ CAASEPCRAV FREAEVTLEA GGAEQEPGQA LGKVPKGCPG 60 

Q2PALFSTDN DDFTVRNGET VQERRSLKER NPLKIFPSKR ILRRHKRDWV. VAPISVPENG 120 

KGPFPQRLiNQ LKSNKDRDTK IFYSITGPGA DSPPEGVFAV EKETGWLLLN KPLDREEIAK ISO 

YELFGHAVSE NGASVEDPMN ISIIVTDQND HKPKFTQDTP RGSVLEGVLP GTSVKQVTAT 240 

DEDDAIYTYN GWAYSIHSQ EPKDPHDLMP: TIHRSTGTIS VISSGLDREK VPEYTLTIGA 300 

TDMDGDGSTT, TAVAWEILD ANDNAPMFDP QKYEAHVPEN AVGHEVQRLT VTDLDAPN5P" 360 

AWRATYLIMG- GDDGDHFTIT THPESNQGIL TTRXGLDFEA KNQHTLYVEV. TNEAPFVLKL 420 

PTSTATIWH VEDVNEAPVF VPPSKWEVQ EGIPTGEPVC VYTAEDPDKE NQKI SYRILR- 480 

DPAGWLAMDP DSGQVTAVGT LDREDEQFVR NNIYEVMVLA MDNGSPPTTG TGTLLLTLID 540 

VNDHGPVPEP RQITICNQSP VRQVLNITDK- DLSPHTSPFQ" AQLTDDSDIY 4 WTAEVNEEGD 600 

TWLSLKKFL KQDTYDVHLS LSDHGNKEQL TVIRATVCDC HGHVETCPGP WKGGFILPVL 660 

GAVLAIiLFLL LVLLLLVRKK RKIKEPLLLP EDDTRDNVFY YGEEGGGEED QDYDITQLHR- 720 

GliEARPEWL RNDVAPTIIP TPMYRPRPAN PDEIGNFIIE NLKAANTDPT APPYDTLLVF 780 
DYEGSGSDAA SLSSLTSSAS DQDQDYDYLM EWGSRFKKLA DMYGGGEDD 

.Seq ID NO: 540 DNA sequence 
Nucleic Acid Accession Eos sequence 
Coding sequence: 1..672 

1 11 21 31 41 51 

ATGAGGCTCC AAAGACCCCG ACAGGCCCCG- GCGGGTGGGA GGCGCGCGCC CCGGGGCGGG 60 

CGGGGCTCCC CCTACCGGCC AGACCCGGGG AGAGGCGCGC. GGAGGCTGCG AAGGTTCCAG 120 

AAGGGCGGGG AGGGGGCGCC GCGCGCTGAC CCTCCCTGGG CACCGCTGGG GACGATGGOG 1B0 

CTGCTCGCCT TGCTGCTGGT CGTGGCCCTA CCGCGGGTGT GGACAGACGC CAACCTGACT 240 

GCGAGACAAC GAGATCCAGA GGACTCCCAG: CGAACGGACG AGGGTGACAA TAGAGTGTGG 300 

TGTCATGTTT GTGAGAGAGA AAACACTTTC GAGTGCCAGA ACCCAAGGAG GTGCAAATGG 360 

ACAGAGCCAT 1 ACTGCGTTAT AGCGGCCGTG AAAATATTTC CACGTTTTTT CATGGTTGCG 420 

AAGCAGTGCT CCGCTGGTTG TGCAGCGATG GAGAGACCCA AGCCAGAGGA GAAGCGGTTT 480 

CTCCTGGAAG AGCCCATGCC CTTCTTTTAC, CTCAAGTGTT GTAAAATTCG CTACTGCAAT 540 

TTAGAGGGGC CACCTATCAA CTCArCAGTG TTCAAAGAAT ATGCTGGGAG CATGGGTGAG 600 

AGCTGTGGTG GGCTGTGGCT GGCCATCCTC CTGCTGCTGG CCTCCATTGC AGCCGGCCTC 660 
AGCCTGTCTT GA 

Seq ID NO: 541 Protein sequence 
protein Accession Eos sequence 

1 11. 21 31 41 51 

Llqrprqap AGGRRAPRGG RGSPYRPDPG RGARRLRRFQ KGGEGAPRAD PPWAPLGTMA 60 

LLALLLWAL PRVWTDANLT ARQRDPEDSQ RTDEGDNRVW CHVCERENTP ECQNPRRCKW 120 

TEPYCVIAAV KIFPRFFMVA KQCSAGCAAM ERPKPEEKRF LLEEPMPPFY LKCCKIRYCN 180 
LEGPPINSSV FKEYAGSMGE SCGGLWIAII. LLLASIAAGL SLS 

Seq ID NO: 542 DNA sequence 

Nucleic Acid Accession ft:: XM_035292,2 

Coding sequence: 53.. 1576 

1 11 21 31 41 51 • 

GCTCGCTGGG CCGCGGCTCC CGGGTGTCCC AGGCCCGGCC GGTGCGCAGA GCATGGCGGG 60 

TGCGGGCCCG AAGCGGCGCG CGCTAGCGGC GCCGGCGGCC GAGGAGAAGG AAGAGGCGCG 120 

GGAGAAGATG CTGGCCGCCA AGAGCGCGGA CGGCTCGGCG CCGGCAGGCG AGGGCGAGGG 1B0 

CGTGACCCTG CAGCGGAACA TCACGCTGCT CAACGGCGTG GCCATCATCG TGGGGACCAT 240 

TATCGGCTCG GGCATCTTCG TGACGCCCAC GGGCGTGCTC AAGGAGGCAG GCTCGCCGGG 300 
GCTGGCGCTG GTGGTGTGGG CCGCGTGCGG CGTCTTCTCC ATCGTGGGCG CGCTCTGCTA- 360 

CGCGGAGCTC GGCACCACCA- TCTCCAAATC GGGCGGCGAC TACGCCTACA TGCTGGAGGT 420 

CTACGGCTCG CTGCCCGCCT TCCTCAAGCT CTGGATCGAG CTGCTCATCA TCCGGCCTTC 480 

ATCGCAGTAC ATCGTGGCCC TGGTCTTCGC CACCTACCTG CTCAAGCCGC TCTTCCCCAC 540 

CTGCCCGGTG CCCGAGGAGG CAGCCAAGCT CGTGGCCTGC CTCTGCGTGC TGCTGCTCAC 600 

GGCCGTGAAC TGCTACAGCG TGAAGGCCGC CACCCGGGTC CAGGATGCCT TTGCCGCCGC 660 

CAAGCTCCTG GCCCTGGCCC ' TGATCATCCT GCTGGGCTTC GTCCAGATCG GAAAGGGTGA 720 

TGTGTCCAAT CTAGATCCCA- ACTTCTCATT TGAAGGCACC AAACTGGATG TGGGGAACAT - 780 

TGTGCTGGCA TTATACAGCG GCCTCTTTGC CTATGGAGGA- TGGAATTACT TGAATTTCGT 840 

CACAGAGGAA ATGATCAACC CCTACAGAAA CCTGCCCCTG GCCATCATCA TCTCCCTGCC 900 

CATCGTGACG- CTGGTGTACG TGCTGACCAA CCTGGCCTAC TTCACCACCC TGTCCACCGA 960 

GCAGATGCTG TCGTCCGAGG CCGTGGCOGT- GGACTTCGGG AACTATCACC TGGGCGTCAT 1020 

GTCCTGGATC ATCCCCGTCT TCGTGGGCCT GTCCTGCTTC GGCTCCGTCA ATGGGTCCCT 1080 

GTTCACATCC TCCAGGCTCT TCTTCGTGGG GTCCCGGGAA GGCCACCTOC CCTCCATCCT 1140 

CTCCATGATC CACCCACAGC TCCTCACCCC CGTGCCGTCC CTCGTGTTCA CGTGTGTGAT 1200 

GACGCTGCTC TACGCCTTCT CCAAGGACAT CTTCTCCGTC ATCAACTTCT TCAGCTTCTT 1260 

CAACTGGCTC TGCGTGGCCC TGGCCATCAT CGGCATGATC TGGCTGCGCC ACAGAAAGCC 1320 

TGAGCTTGAG CGGCCCATCA AGGTGAACCT GGCCCTGCCT GTGTTCTTCA TCCTGGCCTG 1380 

CCTCTTCCTG ATCGCCGTCT CCTTCTGGAA GACACCCGTG GAGTGTGGCA TCGGCTTCAC 1440 

CATCATCCTC AGCGGGCTGC* CCGTCTACTT CTTCGGGGTC TGGTGGAAAA ACAAGCCCAA 1500 

GTGGCTCCTC CAGGGCATCT TCTCCACGAC CGTCCTGTGT CAGAAGCTCA TGCAGGTGGT 1560 
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5 



45 



60 



80 



WO 02/086443 . 

CCCCCAGGAG ACATAGCCAG GAGGCCGAGT GGCTGCCGGA GGAGCATGC 

Seq* ID NO: 543 Protein sequence 
Protein Accession #: XP_03S292.2 



Seq ID NO: 545 Protein sequence 
Protein Accession #: NP_005259.1 



Seq ID NO: 546 DNA sequence 

Nucleic Acid Accession #: NM_002391.1 

Coding sequence: 26.-457 



Seq ID NO: 547 Protein sequence 
Protein Accession #: NP_.002382.1 



1 11 21 r r i 1 

MAGAGPKRRA LAAPAAEEKE EAREKMLAAK SADGSAPAGE GEGVTLQRNI TLLNGVAIIV 60 

GTIIGSGIFV TPTGVLKEAG SPGLALWWA ACGVFSIVGA LCYAELGTTI SKSGGDYAYM 120 

10 LEVYGSLPAP LKLWIELLII RPSSQYIVAL VFATYLLKPL FPTCPVPEEA AKLVACLCVL 180 

' LLTAVNCYSV KAATRVQDAF AAAKLLALAL IILLGPVQIG KGDVSNLDPN FSPEGTKLDV 240 

GNIVLALYSG LFAYGGWNYL NFVTEEMINP YRNLP1AIII SLPIVTLVYV- LTNLAYFTTL 300 

STEQMLSSEA VAVDFGNYHL GVMSWIIPVE VGLSCFGSVN GSLPTSSRLF FVGSREGHLP 360 

SILSMIHPOL LTPVPSLVFT CVMTLL YAFS : KDIFSVINFF SFFNWLCVAL AIIGMIWLRH 420 

15 RKPELERPIK VNLALPVFFI' LACLFLIAVS FWKTPVECGI GFTIILSGU> VYFFGVWWKN- 480 
KPKWLLQGIF STTVLCQKLM QWPQET 
i 

Seq ID NO: 544 DNA sequence 
Nucleic Acid Accession #: NM_005268.1 
20 Coding sequence: 168.. 989 

1 11 21 r r r 

taaaaagcaa AAGAATTCGC GGCCGCGTCG ACAOGGGCTT CCCCGAAAAC CTTCCCCGCT 60 

25 TCTGGATATG AAATTCAAGC TGCTTGCTGA GTCCTATTGC CGGCTGCTGG GAGCCAGGAG 120- 

AGCCCTGAGG AGTAGTCACT CAGTAGCAGC TGACGCGTGG GTCCACCATG AACTGGAGTA 180 

• • TCTTTGAGGG ACTCCTGAGT GGGGTCAACA AGTACTCCAC AGCCTTTGGG CGCATC^GC 240 

TGTCTCTGGT- CTTCATCTTC CGCGTGCTGG TGTACCTGGT GACGGCCGAG CGTGTGTGGA 300 

GTGATGACCA CAAGGACTTC GACTGCAATA CTCGCCAGCC CGGCTGCTCC AACOTCTGCT 360 

30 TTGATGAGTT- CTTCCCTGTG TCCCATGTGC GCCTCTGGGC CCTGCAGCTT ATCCTGGTGA 420 

CATGCCCCTC ACTGCTCGTG GTCATGCACG- TGGCCTACCG GGAGGTTCAG GAGAAGAGGC 480 

ACCGAGAAGC CCATGGGGAG AACAGTGGGC GCCTCTACCT GAACCCCGGC AAGAAGCGGG 540 

GTGGGCTCTG GTGGACATAT GTCTGCAGCC TAGTGTTCAA GGCGAGCGTG GACATCGCCT 600 

TTCTCTATGT GTTCCACTCA TTCTACCCCA AATATATCCT CCCTCCTGTG GTCAAGTGCC 660 

35 ACGCAGATCC ATGTCCCAAT ATAGTGGACT GCTTCATCTC CAAGCCCTCA GAGAAGAACA 720 

TTTTCACCCT CTTCATGGTG GCCACAGCTG CCATCTGCAT. CCTGCTCAAC CTCGTGGAGC 780 

TCATCTACCT GGTGAGCAAG AGATGCCACG AGTGCCTGGC AGCAAGGAAA GCTCAAGCCA 840 

TGTGCACAGG- TCATCACCCC CACGGTACCA- CCTCTTCCTG CAAACAAGAC GACCTCCTTT 900 

CGGGTGACCT CATCTTTCTG GGCTCAGACA GTCATCCTCC TCTCTTACCA GACCGCCCCC 960 

40 GAGACCATGT GAAGAAAACC ATCTTGTGAG; GGGCTGCCTG GACTGGTCTG GCAGGTTGGG 1020 

CCTGGATGGG. GAGGCTCTAG CATCTCTCAT AGGTGCAACC TGAGAGTGGG GGAGCTAAGC 1080 

CATGAGGTAG. GGGCAGGCAA GAGAGAGGAT TCAGACGCTC TGGGAGCCAG TTCCTAGTCC 1140 

TCAACTCCAG CCACCTGCCC CAGCTCGACG GCACTGGGCC AGTTCCCCCT CTGCTCTGCA 1200 
GCTCGGTTTC CTTTTCTAGA ATGGAAATAG TGAGGGCCAA TGC 



1 11 21 31 41 51 

50 MNWSIFEGLL SGVNKYSTAF GRIWLSLVFI FRVLVYLVTA ERVWSDDHKD FDOJTRQPGC 60 

SNVCFDEFFP VSHVRLWALQ LILVTCPSLL WMHVAYREV QEKRHREAHG ENSGRLYLNP 120 

GKKRGGLWWT YVCSLVPKAS VDIAFLYVFH SFYPKYILPP WKCHADPCP NI^FISKP 180 

SEKNIFTLFM VATAAICILL NLVELIYLVS KRCHECLAAR KAQAMCTGHH PHGTTSSCKQ 240 

55 DDLLSGDLIP LGSDSHPPLL PDRPRDHVKK TIL 



1 .11 21 31 41 51 

CGGGCGAAGC AGCGCGGGCA GCGAGATGCA GCACCGAGGC TTCCTCCTCC TCACCCTCCT 60 

CGCCCTGCTG GCGCTCACCT CCGCGGTCGC CAAAAAGAAA GATAAGGTGA AGAAGGGCGG 120 

65 CCCGGGGAGC GAGTGCGCTG AGTGGGCCTG GGGGCCCTGC ACCCCCAGCA GCAAGGATTG 180 

CGGCGTGGGT TTCCGOGAGG GCACCTGCGG GGCCCAGACC CAGOGCATCC GGTGCAGGGT 240 

GCCCTGCAAC TGGAAGAAGG AGTTTGGAGC CGACTGCAAG TACAAGTTTG AGAACTGGGG 300 

TGCGTGTGAT GGGGGCACAG GCACCAAAGT CCGCCAAGGC ACCCTGAAGA AGGCGCGCTA 360 

CAATGCTCAG TGCCAGGAGA CCATCCGCGT CACCAAGCCC TGCACCCCCA AGACCAAAGC 420 

70 AAAGGCCAAA GCCAAGAAAG GGAAGGGAAA GGACTAGACG CCAAGCCTGG ATGCCAAGGA 480 

GCCCCTGGTG TCACATGGGG CCTGGCCACG CCCTCCCTCT CCCAGGCCCG AGATGTGACC 540 

CACCAGTGCC TTCTGTCTGC TCGTTAGCTT TAATCAATCA TGCCCTGCCT TGTCCCTCTC 600 

ACTCCCCAGC CCCACCCCTA AGTGCCCAAA GTGGGGAGGG ACAAGGGATT CTGGGAAGCT 660 
TGAGCCTCCC CCAAAGCAAT GTGAGTCCCA GAGCCCGCTT TTGTTCTTCC CCACAATTCC 

75 ATTACTAAGA AACACATCAA ATAAACTGAC TTTTTCCCCC CAATAAAAGC TCTTCTTTTT 
TAATAT 



720 
780 



' MQHRGFLLLT LLALLALTSA VAKKKDKVKK GGPGSECAEW AWGPCTPSSK DCGVGFREGT .60 
CGAQTQRIRC RVPCNWKKEF GADCKYKFEN WGACDGGTGT KVRQGTLKKA RYNAQCQETI 120 
85 RVTKPCTPKT KAKAKAKKGK GKD 

Seq ID NO; 548 DNA sequence 



393 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

Nucleic Acid Accession 8: NM_006783.1 
Coding sequence: l. .786 



PCT/US02/12476 



ATGGATTGGG 
GGGAAGGTGT 
CAGGAAGTGT 
AAAAATGTGT 
CTGATCTTCG 
GAAACCACTC 
ATTAAAAAGC 
TTTTTCCGAA 
TACCACCTGC 
TTTATTTCTA 
ATTTGCATGC 
AGATCAAAGA 
CAGAATGAAA 
AGCTAA 



11 
1 " 

GGACGCTGCA 
GGATCACAGT 
GGGGTGACGA 
GCTATGACCA 
TCTCCACCCC 
GCAAGTTCAG 
ACAAGGTTCG 
TCATCTTTGA 
CCTGGGTGTT 
GGCCAACAGA 
TGCTTAACGT 
GAGCACAGAC 
TGAATGAGCT 



21 
I 

CACTTTCATC 
CATCTTTATT 
GCAAGAGGAC 
CTTTTTCCCG 
AGCGCTGCTG 
GCGAGGAGAG 
GATAGAGGGG 
AGCAGCCTTT 
GAAATGTGGG 
GAAGACCGTG 
GGCAGAGTTG 
GCAAAAAAAT. 
GATTTCAGAT 



31 
I 

GGGGGTGTCA 
TTCCGAGTCA 
TTCGTCTGCA 
GTGTCCCACA 
GTGGCCATGC 
AAGAGGAATG 
TCGCTGTGGT 
ATGTATGTGT 
ATTGACCCCT. 
TTTACCATTT 
TGCTACCTGC 
CACCCCAATC 
AGTGGTCAAA 



41 
I 

■ ACAAACACTC 
TGATCCTAGT 
ACACACTGCA- 
TCCGGCTGTG 
ATGTGGCCTA- 
ATTTCAAAGA 
GGACGTACAC 
TTTACTTCCT- 
GCCCCAACCT 
TTATGATTTC 
TGCTGAAAGT- 
ATGCCCTAAA 
ATGCAATCAC 



51 

I . 

CACCAGCATC 

GGTGGCTGCC 

ACCGGGATGC 

GGCCCTCCAG 

CTACAGGCAC 

CATAGAGGAC 

CAGCAGCATC 

TTACAATGGG 

TGTTGACTGC : 

TGCGTCTGTG 

GTGTTTTAGG. 

GGAGAGTAAG 

AGGTTTCCCA 



Seq ID NO: 549 Protein sequence 
protein Accession #: NP_006774.1 



ll 



21 



41 



51 



MDWGTLHTFI 
KNVCYDHFFP 
IKKHKVRIEG 
FISRPTEKTV 
QNEMNELISD 



31 

I I I ! I 

GGVNKHSTSI GXVWITVIFI FRVMILWAA QEVWGDEQED FVCNTLQPGC 
VSHIRLWALQ LIFVSTPALL VAMHVAYYRH ETTRKFRRGE KRNDFKDIED 
SLWWTYTSSI- FFRIIFEAAF MYVFYFLYNG YHlaPWVLKCG IDPCPNLVDC 
FTIFMISASV ICMLLNVAEI/ CYLLLKVCFR RSKRAQTQKN HPNHALKESK 
SGQNAITGFP 



Seq ID NO: 550 DNA sequence 

Nucleic Acid Accession #: NM_002S71.1 

Coding sequence r 99 . . 587 



CATCCCTCTG 
TCACCCTGGG 
AGGACCTGGA 
ACATCTCCCT 
CCACCCCCGA 
AGAAGAAGGT 
TGGCGAACGA 
AGGACACCAC 
AGGACGATGA 
GGTACTTGCT 
CCAGGAAGAC 
TTTCAAAGAA 
TCCTGCTGCA 
GCAGAGGTTA 



11 
I 

GCTCCAGAGC 
CGTGGCCCTG 
GCTCCCAAAG 
CATGGCGACA 
GGACAACCTG 
CCTTGGAGAG 
GGCCACGCTG 
CACCCCCATC 
GATCATGCAG 
GGACTTGAAA 
CAGACTCCCA 
TAACCACAGC 
CACCTGCACC 
TTAATAAACC 



21 

I 

TCAGAGCCAC 
GTCTGTGGTG: 
TTGGCAGGGA 
CTGAAGGCCC 
GAG AT CGTTC 
AAGACTGGGA 
CTCGATACTG 
CAGAGCATGA 
GGATTCATCA 
CAGATGGAAG 
CCCTTCCACA 
TCAGAAGACG 
ATTGCCATGG 
CTTGGAGCAT 



31 
( 

CCACAGCCGC. 
TCCCGGCCAT" 
CCTGGCACTC 
CTCTGAGGGT 
TGCACAGATGi 
ATCCAAAGAA 
ACTACGACAA 
TGTGCCAGTA 
GGGCTTTCAG 
AGCCGTGCCG 
CCTCCAGAGC 
ATGACGTGGT 
GGAGGCTGCT 
G 



41 
I 

AGCCATGCTG 
GGACATCCCC 
CATGGCCATG 
CCACATCACC 
GGAGAACAAC 
GTTCAAGATC 
TTTCCTGTTT 
CCTGGCCAGA 
GCCCCTGCCC 
TTTCTAGCTC 
AGTGGGACTT 
CATCTGTGTC 
CCCTGGGGGC 



51 

I - 

TGCCTCCTGC 
CAGACCAAGC 
GCGACCAACA- 
TCACTGTTGC 
AGCTGTGTTG 
AACTATACGG 
CTCTGCCTAC 
GTCCTGGTGG 
AGGCACCTAT 
ACCTCCGCCT; 
CCTCCTGCCC 
GCCATCCCCT 
AGAGTCTCTG 



Seq ID NO: 551 Protein sequence 
Protein Accession ft: NP_002562.1 



1 11 21 31 41 51 

I I I I I I 

MDIPQTKQDL ELPKLAGTWH SKAMATNNIS LMATLKAPliR VHITSLLPTP. EDNLEIVLHR 
WENNSCVEKK VLGEKTGNPK KFKINYTVAN EATLLDTDYD NFLFLCLQDT TTPIQSMMCQ 
YLARVLVEDD EIMQGFIRAF RPLPRHLWYL LDLKQMEEPC RF 

Seq ID NO: 552 DNA sequence 

Nucleic Acid Accession #: NM_00 6500.1 

Coding sequence: 27.. 1967 



1 • 
I 

ACTTGCGTCT 
TCGCCGCCTG 
CGCCTGAGCT 
AGTCCCAAGG 
TCATCTTCCG 
TCAGCCTCCA 
GCATCTTCTT 
TCTACAAAGC 
GTAAGGAGCC 
TCATCTGGTA 
CGTCCCAGAC 
TGGTTAAAGA 
GGAACCACAT 
TGTGGCTGGA 
GTTTGGCTGA 
GGGAGGCAGA 
AGGAACACAG 
TGAGTGAACC 
CCCCTGAGAG 
ACCTCGAGTT 



11 
I 

CGCCCTCCGG 
CTGCTGCTGT 
GGTGGAGGTG 
CAACCTCAGC 
TGTGCGCCAG 
GGACAGAGGG 
GTGCCAGGGC 
TCCGGAGGAG 
TGAGGAGGTC 
CAAGAATGGC 
TGTGGAGTCG 
AGACAAAGAT 
GAAGGAGTCC 
AGTGGAGCCC 
TGGCAACCCT 
GGAAGAGACA 
TGGGCGCTAT 
ACAGGAACTA 
ACAGGAAGGC 
CCAGTGGCTG 



21 
I 

CCAAGCATGG 
CCTCGCGTCG 
GAAGTGGGCA 
CATGTCGACT 
GGCCAGGGCC 
GCTACTCTGG 
AAGCGCCCTC 
CCAAACATCC 
GCTACCTGTG. 
CGGCCTCTGA 
AGTGGTTTGT 
GCCCAGTTTT 
AGGGAAGTCA 
GTGGGAATGC 
CCACCACACT 
ACCAACGACA 
GAATGTCAGG 
CTGGTGAACT 
AGCAGCCTCA 
AGAGAAGAGA 



31 
I 

GGCTTCCCAG 

CGGGTGTGCC 

GCACAGCCCT 

GGTTTTCTGT 

AGAGCGAACC 

CCCTGACTCA 

GGTCCCAGGA 

AGGTCAACCC 

TAGGGAGGAA 

AGGAGGAGAA 

ACACCTTGCA. 

ACTGTGAGCT-; 

CCGTCCCTGT- 

TGAAGGAAGG 

TCAGCATCAG 

ACGGGGTCCT 

CCTGGAACTT 

ATGTGTCTGA 

CCCTGACCTG 

CAGACCAGGT 



41 



I 

GCTGGTCTGC 
CGGAGAGGCT 
TCTGAAGTGC 
CCACAAGGAG 
TGGGGAGTAC 
AGTCACCCCC 
GTACCGCATC 
CCTGGGCATC 
CGGGTACCCC 
GAACCGGGTC 
GAGTATTCTG. 
CAACTACCGG 
TTTCTACCCG' 
GGACCGCGTG 
CAAGCAGAAC 
GGTGCTGGAG 
GGACACCATG 
CGTCCGAGTG 
TGAGGCAGAG 
GCTGGAAAGG 



51 

I 

GCCTTCTTGC 

GAGCAGCCTG 

GGCCTCTCCC 

AAGCGGACGC 

GAGCAGCGGC 

CAAGACGAGC 

CAGCTCCGCG 

CCTGTGAACA 

ATTCCTCAAG ' 

CACATTCAGT 

AAGGCACAGC 

CTGCCCAGTG ' 

ACAGAAAAAG- 

GAAATCAGGT 

CCCAGCACCA 

CCTGCCCGGA 

ATATCGCTGC 

AGTCCCGCAG 

AGTAGCCAGG 

GGGCCTGTGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 



60 
120 
180 ' 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



394 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75. 

80 

85 



WO 02/086443 

„ TTCAGTTGCA TGACCTGAAA CGGGAGGCAG GAGGOGGCTA 
CCAGCATACC OGGCCTGAAC CGCACACAGC TGGTCAAGCT 
GGATGGCATT CAAGGAGAGG AAGGTGTGGG TGAAAGAGAA 
GTGAAGOGTC AGGGCACCCC CGGCCCACCA TCTCCTGGAA 
AACAAGACCA AGATCCACAG OGAGTCCTGA GCACCCTGAA 
TGTTGGAGAC AGGTGTTGAA TGCACGGCCT CCAACGACCT 
TCTTCCTGGA GCTGGTCAAT TTAACCACCC TCACACCAGA 
TCAGCACTTC CACTGCCAGT CCTCATACCA GAGCCAACAG 
TGCCGGAGCC GGAGAGCCGG. GGCGTGGTCA TCGTGGCTGT 
TGGCGGTGCT GGGCGCTGTC CTCTATTTCC TCTATAAGAA 
GCTCAGGGAA GCAGGAGATC ACGCTGCCCC OGTCTCGTAA 
TTAAGTCAGA- TAAGCTCCCA* GAAGAGATGG- GCCTCCTGCA 
GGGCTCCGGG AGACCAGGGA GAGAAATACA TCGATCTGAG 
CAGCTCCCTT CCCTGCCTGG ACCATTCCCA GCTCCCTGCT 
CCTCCAAAGG GACTAGAGAG AAGCCTCCTG CTCCCCTCAC 
GGCCACTGGG: TTAGGACCTG AGGACCTCAC TTGGCCCTGC 
GTCCACCACC ATCTCCTCCA CGTTGAGTGA AGCTCATCCC 
CCGAGCGGGT. 'AGGAGAGTTT CTTGCAGAAC. GTGTTTTTTC 
AAATACCTGG. CTCCTGCCAG CAGCTGAGCT GGGTAGCCTC 
CAAAGGCTGG. CTTCCACCAT CCAGGTGCAC CACTGAAGTG 
GCCTGCTCAT GTTGAAGTGC GCTGTTCACA CCCGCTCCGG 
AGAAGCAGCT GCAGTGTTGC TGCCACCACC CTCCTGCTCG. 
ACATTTTTTC TTTGGTCAGA AGCCAGGAAC TGGTGT CATT 
GGCCAGGTGT GGTGGCTCAC GCCTGTAATC CCAGCACTTT 
TCACAAAGTC AGGACGAGAC CATCCTGGCT. AACACGGTGA 
TACAAAAAAA AATTAGCTAG - GCGTAGTGGT TGGCACCTAT 
CTGAAGCAGG AGAATGGTAT: GAATCCAGGA . GGTGGAGCTT 
CACTGCACTC CAGCCTGGGC AACACAGCGA GACTCCGTCT. 
ACGCGTACCT. GCGGTGAGGA AGCTGGGCGC TGTTTTCGAG 
TCCCCGTGTT CACTTGCTCC CATAGCCCTC TTGATGGATC 
GGGGAGCAGA' CAAAGATGAG GTCTACACTG TCCTTCATGG 
TTAGCACCAA ACTTCTACAA ACCAAGCTCA GGGCCCCAAC 
AGAATGGTAC TTAGGGATGG AAAACGGGGC CTGGCTAGAG 
CTGTGTGTAT GCATACATAT GTGTGTATAT ATGGTTTTGT 
TTGTTTCCTT . TATATATGTA TGTATATATA TATATGAAAA 
AAAGCTTAAT- TGTCCCAGAA AATCATACAT, TGCTTTTTTA 
AACCTGGGGG. CCTGTGAAAC TACAACCAAA AGGCACACAA 
AGAGATCAGG GGTTACCTCT GCTTCTGAGC AAATGGCTCA 
CTACCCTACT. TTTCAGCAGC. AAAACGTCCC GTATGACGCA 
TGTTAGCAGG AGCTATGTCC CTTCCTATCG TTTCCGTCCA 

Seq ID NO: 553 Protein sequence 
Protein Accession 8: NP_006491.1 



TCGCTGCGTG 
GGCCATTTTT 
TATGGTGTTG 
CGTCAACGGC 
TGTCCTCGTG 
GGGCAAAAAC 
CTCCAACACA 
CACCTCCACA 
GATTGTGTGC 
GGGCAAGCTG 
GACCGAACTT 
GGGCAGCAGC 
GCATTAGCCC ' 
CACTCTTCTC 
CTGCACACCC 
AAGCCGCTTT 
AAGCAAGGAG. 
TTTACACACA 
TCTGAGCTGG 
AGGACACACC 
AGAGCACCCC 
CCTCTTCAAA- 
CCTTAAAAGA- 
GGGAGGCOGA 
AACCCTGTCT 
AGTCCCAGCT 
GCAGTGAGCC 
CGAGGAAAAA 
TTCAGGTGAA 
ACGTAAAACT 
GGATTAAAGC 
CCTAGAAGGG 
CTTCGGGTGT 
CAGGTGTGTA 
TATATATATA- 
TTCTACATGG 
AACCGTTTCC 
AGCTCTACCA 
GCACGAAGGG 
CTT 



GCGTCTGTGC 
GGCCCCCCTT 
AATCTGTCTT 
ACGGCAAGTG 
ACCCCGGAGC 
ACCAGCATCC 
ACCACTGGCC 
GAGAGAAAGC 
ATCCTGGTCC 
CCGTGCAGGC 
GTAGTTGAAG 
GGTGACAAGA . 
CGAATCACTT. 
TCAGCCAAAG 
CCTTTCAGAG ' 
TCAGGGACCA- 
CCCCAGTCTC ; 
TTATGGCTGTi 
TTTCCTGCCC 
GGAGCCAGGC 
AGCGGCATCC 
GTCTCCTGTG 
TACGTGCCGG 
GGCGGGCGGA- 
CTACTAAAAA 
ACTCGGAAGG 
GAGACCGTGC 
AAAAGAAAAG 
TTAGCCTCAA 
GAAAGGCAGC 
TATGGTTATA 
CCCAAATGAG. 
GTGTGTCTGT 
AATTTGCAAA- 
TATGAAAAAT 
GTACCACAGG 
AGTTGGCAGC 
GAGCAGACAG 
CCTGGCAGGC 



1260 

1320. 

1380 

1440 

1500 

1560 

1620 ' 

1680 

1740. 

1800 

1860 

1920. 

1980 

2040 

2100 

2160 

2220 

2260 

2340 

2400 

2460 

2520; 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 



PCT/US02/12476 



GLPRLVCAFT, 
WFSVHKEKRT 
RSQEYRIQLR 
KEEKNRVHIQ. 
TVPVFYPTEK 
NGVLVLSPAR- 
TLTCEAESSQ 
LVKLAIFGPP 
STLNVLVTPE 
RANSTSTERK 
PSRKTELWS 



11 
I 

LAACCCCPRV 
LIFRVRQGQG 
VYKAPEEPNI 1 
SSQTVESSGL. 
VWLEVEPVGM. 
KEHSGRYECQ. 
DLEFQWLREE 
WMAPKERKVW 
LLETGVECTA: 
LPEPESRGW 
VKSDKLPEEM 



21 . 
I 

AGVPGEAEQP 
QSSPGEYEQR 
QVNPLGIPVN 
YTLQSILKAQ 
LKEGDRVEIR 
AWNLDTMISL 
TDQVLERGPV 
VKENMVLNLS 
SNDLGKNTSI 
IVAVIVCILV 
GLLQGSSGDK 



31 
I 

APBLVEVEVG 
IiSLQDRGATL 
SKSPEEVATC 
LVKEDKDAQF 
CLADGNPPPH 
LSEPQELLVN 
LQLHDLKREA 
CEASGHPRPT 
LFIiELVNLTT 
LAVLGAVLYF 
RAPGDQGEKY 



41 
I 

STALLKCGLS 
ALTQVTPQDE 
VGRNGYPIPQ 
YCELNYRLPS 
FSISKQNPST 
YVSDVRVSPA 
GGGVRCVASV 
ISWNVKGTAS 
LTPDSNTTTG 
LYXKGKLPCR 
IDLRH 



51 
I 

QSQGNLSHVD 
RIPLCQGKRP 
VIWYKNGRPL 
GUHMKESREV 
REAEEETTND 
APERQBGS5L 
FSIPGLNRTQ. 
EQDQDPQRVL 
IiSTSTASPHT 
RSGKQEITLP 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



Seq ID NO: 554 DNA. sequence 

Nucleic. Acid Accession 8 : NM_003183 . 3 

Coding sequence: 165.-2639 



TCGAGCCTGG 
GCTAGGCCGG 
CGAAGGCTGC 

. TATTCCTGAC 
TCGGCCCCCA 
CTTTATCTAA 
TAGAAACACT 
GTACTGAACG' 
AGTACACTGC 
GGGTTCTAGC 
AATATAACAT 
TTTATAAATC 
ATTTAAAAGT 
AAGAGCTTGT 
AATTATTGGT 
CAACTACAAA 
CATGGGATAA 
AGTCTCCACA 
A7GAAGAAAA 
CTGAGGAAGC 

. GAACTCTTGG 
CAAAGGCTTA 
GCACAAAGAA 



11 
I 

CGGTAGAATC 
GCGGATCCCG 
CCAGAGAGGT 
CAGCGTGGTT 
CCAGAGACTC 
TATCCAGCAG 
ACTAACTTTT 
TTTTTCACAA 
AAAATGGCAG 
CCACATAAGA 
AGAGCCACTT 
TGAAGATATC 
GGATAATGAA 
TCATCGAGTG 
GGTAGCAGAT 
TTACTTAATA 
TGCAGGTTTT- 
AGAGGTAAAA 
GGATGCTTGG 
ATCTAAAGTT 
ATTAGCTTAT 
TTATAGCCCA 
. TTATGGTAAA 



21 
I 

TTCCCAGTAG 
TCCTCCCCCG 
GGAGTCGGTA 
CCTTTOGTGC 
GAGAAGCTTG 
CATTCGGTAA 
TCAGCTTTGA 
AATTTCAAGG 
GACTTCTTCA 
GATGATGATG 
TGGAG ATTTG 
AAGAATGTTT 
GAGTTGCTCC 
AAAAGAAGAG 
CATCGCTTCT- 
GAGCTAATTG 
AAAGGCTATG 
CCTGGTGAAA 
GATGTGAAGA 
TGCTTGGCAC 
GTTGGCTCTC 
GTTGGGAAGA 
ACCATCCTTA 



31 

I 

GCGGCGGGGG 
ATGTGAGCAG 
GCGGGGCCGG 
TGGCGCCGCG 
ATTCTTTGCT 
GAAAAAGAGA 
AAAGGCATTT 
TCGTGGTGGT 
CTGGACACGT 
TTATAATCAG 
TTAATGATAC 
CACGTTTGCA 
CAAAAGGGTT 
CTGACCCAGA 
ACAGATACAT 
ACAGAGTTGA 
GAATACAGAT 
AGCACTACAA 
TGTTGCTAGA 
ACCTTTTCAC 
CCAGAGCAAA 
AAAATATCTA 
CAAAGGAAGC 



41 

I 

AGGGAAAAGA 
TTTTCCGAAA 
GAACATGAGG 
ACCTCCGGAT- 
CTCAGACTAC 
TCTACAGACT 
TAAATTATAC 
GGATGGTAAA 
GGTTGGTGAG 
AATCAACACA 
CAAAGACAAA 
GTCTCCAAAA. 
AGTAGACAGA 
TCCCATGAAG 
GGGCAGAGGG 
TGACATCTAT 
AGAGCAGATT 
CATGGCAAAA 
GCAATTTAGC 
ATACCAAGAT 
CAGCCATGGA 
TTTGAATAGT 
TGACCTGGTT 



51 
I 

GGATTGAGGG 
CCCCGTCAGG 
CAGTCTCTCC 
GACCCGGGCT 
GATATTCTCT 
TCAACACATG 
CTGACATCAA 
AACGAAAGCG 
CCTGACTCTA 
GATGGGGCCG 
AGAATGTTAG 
GTGTGTGGTT 
GAACCACCTG 
AACACGTGTA 
GAAGAGAGTA 
CGGAACACTT 
CGCATTCTCA 
AGTTACCCAA 
TTTGATATAG 
TTTGATATGG 
GGTGTTTGTC 
GGTTTGACGA 
ACAACTCATG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 

1200 : 

1260 
1320 
1380 . 
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AATTGGGACA TAATTTTGGA GCAGAACATG ATCCGGATGG. TCTAGCAGAA TGTGCCCCGA 1440 

ATGAGGACCA GGGAGGGAAA ■ TATGTCATGT ATCCCATAGC TGTGAGTGGC GATCACGAGA 1500 

ACAATAAGAT GTTTTCAAAC TGCAGTAAAC AATCAATCTA TAAGACCATT GAAAGTAAGG 15S0 

CCCAGGAGTG TTTTCAAGAA ■ CGCAGCAATA AAGTTTGTGG- GAACTCGAGG GTGGATGAAG 1620 

GAGAAGAGTG TGATCCTGGC ATCATGTATC TGAACAAGGA CACCTGCTGC AACAGCGACT 1680 

GCACGTTGAA GGAAGGTGTC CAGTGCAGTG ACAGGAACAG TCCTTGCTGT. AAAAACTGTC 1740 

AGTTTGAGAC TGCCCAGAAG AAGTGCCAGG . AGGCGATTAA' TGCTACTTGC AAAGGCGTGT ' 1800 

CCTACTGCAC AGGTAATAGC AGTGAGTGCC CGCCTCCAGG AAATGCTGAA AATGACACTG 1860 

TTTGCTTGGA "tCTTGGCAAG TGTAAGGATG GGAAATGCAT CCCTTTCTGC GAGAGGGAAC 1920 

AGCAGCTGGA GTCCTGTGCA TGTAATGAAA CTGACAACTC CTGCAAGGTG TGCTGCAGGG 1980 

ACCTTTCTGG CCGCTGTGTG CCCTATGTCG ATGCTGAACA AAAGAACTTA TTTTTGAGGA 2040 

AAGGAAAGCC CTGTACAGTA GGATTTTGTG ACATGAATGG CAAATGTGAG AAACGAGTAC 2100 

AGGATGTAAT- TGAACGATTT TGGGATTTCA TTGACCAGCT GAGCATCAAT ACTTTTGGAA • 2160 

agtttttagc-'agacaacatc' GTTGGGTCTG. TCCTGGTTTT CTCCTTGATA TTTTGGATTC 2220 

CTTTCAGCAT TCTTGTCCAT' TGTGTGGATA AGAAATTGGA TAAACAGTAT GAATCTCTGT 22 BO 

CTCTGTTTCA- CCCCAGTAAC ' GTCGAAATGC TGAGCAGCAT GGATTCTGCA TCGGTTCGCA 2340 

TTATCAAACC CTTTCCTGCG CCCCAGACTC GAGGCCGCCT GCAGCCTGCC CCTGTGATCC 2400 

CTTCGGCGCC AGCAGCTCCA AAACTGGACC- ACCAGAGAAT GGACACCATC CAGGAAGACC. 2460 

CCAGCACAGA CTCCCATATG GACGAGGATG. GGTTTGAGAA GGACCCCTTC CCAAATAGCA 2520 

GCACAGCTGC CAAGTCATTT GAGGATCTCA CGGACCATCC. GGTCGCCAGA AGTGAAAAGG 2580 

CTGCCTCCTT TAAACTGCAG CGTCAGAATC GTGTTAACAG CAAAGAAACA . GAGTGCTAAT 2640 

TTAGTTCTCA GCTCTTCTGA CTTAAGTGTG CAAAATATTT TTATAGATTT GACCTACAAA 2700 
TCAATCACAG- CTTGTATTTT GTGAAGACTG GGAAGTGACT TAGCAGATGC TGGTCATGTG -2760 

TTTGAACTTC CTGCAGGTAA ACAGTTCTTG TGTGGTTTGG CCCTTCTCCT TTTGAAAAGG - 2820 

TAAGGTGAAA GTGAATCTAC TTATTTTGAG GCTTTCAGGT TTTAGTTTTT AAAATATCTT 2880 

TTGACCTGTG GTGCAAAAGC AGAAAATACA GCTGGATTGG GTTATGAATA TTTACGTTTT; 2 94 0 

TGTAAATTAA- TCTTTTATAT TGATAACAGC ACTGACTAGG GAAATGATCA GTTTTTTTTT ■ 3000 

ATACACTGTA ATGAACCGCT ' GAATATGAAG CATTTGGCAT: TTATTTGTGA GAAAAGTGGA" 3060 

ATAGTTTTTT TTTTTTTTTT TTTTTTTTGC CTTCAACTAA' AAACAAAGGA GATAAATTTA 3120 

GTATACATTG TATCTAAATT GTGGGTCTAT TTCTAGTTAT TACCCAGAGT TTTTATGTAG, 3180 

CAGGGAAAAT- ATATATCTAA' ATTTAGAAAT CATTTGGGTT AATATGGCTC TTCATAATTC 3240 

TAAGACTAAT- GCTCAGAACC TAACCACTAC CTTACAGTGA- GGGCTATACA TGGTAGCCAG 3300 

TTGAATTTAT GGAATCTACC AACTGTTTAG GGCCCTGATT. TGCTGGGCAG TTTTTCTGTA 3360 

TTTTATAAGT ATCTTCATGT ATCCCTGTTA CTGATAGGGA' TACATGTCTT AGAAAATTCA 3420 

CTATTGGCTG GGAGTGGTGG CTCATGCCTG- TAATCCCAGC ACTTGGAGAG GCTGAGGTTG 3480 
CGCCACTACA CTCCAGCCTG GGTGACAGAG: TGAGATCTGC CTC 

Seq ID NOs 555 Protein sequence , . „ ... 

Protein Accession # : NP_003174.2 

] r r r r r 

MRQSLLFLTS WPFVLAPRP PDDPGFGPHQ RLEKLDSLLS DYDILSLSNI QQHSVRKRDL 
QTSTHVETLL TPSALKRHFK LYLTSSTERP SQNFKWWD GKNESEYTAK WQDFFTGHW 

GEPDSRVLAH IRDDDVIIRI' NTDGAEYNIE PLWRFVNDTK DKRMLVYKSE DIKNVSRLQS 180 

PKVCGYLKVD NEELLPKGLV DREPPEELVH RVKRRADPDP MKNTCKLLW ADHRFYRYMG 240 

RGEESTTTNY- LIELIDRVDD IYRNTSWDNA ' GFKGYGIQIE QIRILKSPQE VKPGEKHYNM 300 

AKSYPNEEKD AWDVKMLLEQ PSFDIAEEAS KVCLAHLFTY QDFDMGTLGL AYVGSPRANS 360 

HGGVCPKAYY SPVGKKNIYL NSGLTSTKNY GKTILTKEAD LVTTHELGHN FGAEHDPDGL 420 

AECAPNEDQG GKYVMYPIAY SGDHENNKMF, SNCSKQSIYK TIESKAQECF QERSNKVCGN 480 

SRVDEGEECD PGIMYLNNDT CCNSDCTLKE GVQCSDRNSP CCKNCQFETA QKKCQEAINA 540 

TCKGVSYCTG 1 NSSECPPPGN AENDTVCLDL GKCKDGKCIP FCEREQQIiES CACNETDNSC 600 

KVOCRDhSGR CVPYVDAEQK NLFLRKGKPC TVGFCDMNGK CEKRVQDVIE RFWDFIDOLS 660 

INTFGKFLAD MIVGSVLVFS- LIFWIPFSIL VHCVDKKLDK QYESLSLFHP SNVEMLSSMD 720 

SASVRIIKPF PAPQTPGRLQ PAPVIPSAPA APKLDHQRMD TIQEDPSTDS HMDEDGFEKD 780 
PFPNSSTAAK SFEDLTDHPV ARSEKAASFK LQRQNRVNSK ETEC 

Seq ID NO: 556 DNA sequence 

Nucleic Acid Accession #: NM_021832.1 

Coding sequence: 164.. 2248 

1 11 21 31 41 51 

TCGAGGCTGG CGGTAGAATC TTCCCAGTAG GCGGCGCGGG AGGAAAAGAG GATTGAGGGG 60 

CTAGGCCGGG CGGATCCCGT CCTCCCCCGA TGTGAGCAGT TTTCCGAAAC CCCGTCAGGC 120 

GAAGGCTGCC GAGAGAGGTG GAGTCGGTAG CGGGGCCGGG AACATGAGGC AGTCTCTCCT 180 

ATTCCTGACC AGCGTGGTTC CTTTCGTGCT GGCGCCGCGA CCTCCGGATG ACCCGGGCTT 240 

CGGCCCCCAC CAGAGACTCG AGAAGCTTGA TTCTTTGCTC TCAGACTACG AT ATT CT CTC 300 

TTTATCTAAT ATCCAGCAGC ATTCGGTAAG AAAAAGAGAT CTACAGACTT CAACACATGT • 360 

AGAAACACTA CTAACTTTTT CAGCTTTGAA AAGGCATTTT AAATTATACC TGACATCAAG 420 

TACTGAACGT TTTTCACAAA ATTTCAAGGT CGTGGTGGTG GATGGTAAAA ACGAAAGCGA 480 

GTACACTGTA AAATGGCAGG ACTTCTTCAC TGGACACGTG GTTGGTGAGC CTGACTCTAG 540 

GGTTCTAGCC CACATAAGAG ATGATGATGT TATAATCAGA ATCAACACAG ATGGGGCCGA . 600 

ATATAACATA GAGCCACTTT GGAGATTTGT TAATGATACC AAAGACAAAA GAATGTTAGT 660 

TTATAAATCT GAAGATATCA AGAATGTTTC ACGTTTGCAG TCTCCAAAAG TGTGTGGTTA 720 

TTTAAAAGTG GATAATGAAG AGTTGCTCCC AAAAGGGTTA GTAGACAGAG AACCACCTGA - 780 

AGAGCTTGTT CATCGAGTGA AAAGAAGAGC TGACCCAGAT CCCATGAAGA ACACGTGTAA - 840 

ATTATTGGTG GTAGCAGATC ATCGCTTCTA CAGATACATG GGCAGAGGGG AAGAGAGTAC 900 

AACTACAAAT TACTTAATAG AGCTAATTGA CAGAGTTGAT GACATCTATC GGAACACTTC 960 

ATGGGATAAT GCAGGTTTTA AAGGCTATGG AATACAGATA GAGCAGATTC GCATTCTCAA 1020 

GTCTCCACAA GAGGTAAAAC CTGGTGAAAA GCACTACAAC ATGGCAAAAA GTTACCCAAA 1080 

TGAAGAAAAG GATGCTTGGG ATGTGAAGAT GTTGCTAGAG CAATTTAGCT TTGATATAGC 1140 

TGAGGAAGCA TCTAAAGTTT GCTTGGCACA CCTTTTCACA TACCAAGATT TTGATATGGG 1200 

AACTCTTGGA- TTAGCTTATG TTGGCTCTCC CAGAGCAAAC AGCCATGGAG GTGTTTGTCC 1260 

AAAGGCTTAT TATAGCCCAG TTGGGAAGAA AAATATCTAT TTGAATAGTG GTTTGACGAG 1320 
CACAAAGAAT TATGGTAAAA CCATCCTTAC AAAGGAAGCT GACCTGGTTA CAACTCATGA '1380 

ATTGGGACAT AATTTTGGAG CAGAACATGA TCCGGATGGT CTAGCAGAAT GTGCCCCGAA 1440 



60 
120 
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TGAGGACCAG GGAGGGAAAT ATGTCATGTA TCCCATAGCT GTGAGTGGCG ATCACGAGAA 1500 

CAATAAGATG TTTTCAAACT GCAGTAAACA ATCAATCTAT AAGACCATTG AAAGTAAGGC 1560 

CCAGGAGTGT TTTCAAGAAC GCAGCAATAA AGTTTGTGGG AACTCGAGGG TGGATGAAGG 1620 

AGAAGAGTGT GATCCTGGCA . TCATGTATCT GAACAACGAC ACCTGCTGCA ACAGOGACTG 1680 

CACGTTGAAG GAAGGTGTCC AGTGCAGTGA CAGGAACAGT CCTTGCTGTA AAAACTGTCA .1740 

GTTTGAGACT GCCCAGAAGA - AGTGCCAGGA GGCGATTAAT GCTACTTGCA AAGGCGTGTC 1800 

CTACTGCACA GGTAATAGCA GTGAGTGCCC GCCTCCAGGA AATGCTGAAG ATGACACTGT 1860 

TTGCTTGGAT CTTGGCAAGT. GTAAGGATGG- GAAATGCATC CCTTTCTGCG AGAGGGAACA 1920 

GCAGCTGGAG- TCCTGTGCAT GTAATGAAAC. TGACAACTCC TGCAAGGTGT GC7GCAGGGA 1980 

CCTTTCCGGC CGCTGTGTGC , CCTATGTCGA TGCTGAACAA AAGAACTTAT TTTTGAGGAA. 2040 

AGGAAAGCCC TGTACAGTAG - GATTTTGTGA CATGAATGGC AAATGTGAGA AACGAGTACA- 2100 

GGATGTAATT GAACGATTTT GGGATTTCAT. TGACCAGCTG. AGCATCAATA CTTTTGGAAA 2160 

GTTTTTAGCA GACAACATCG TTGGGTCTGT CCTGGTTTTC- TCCTTGATAT TTTGGATTCC 2220 

TTTCAGCATT CTTGTCCATT GTGTGTAACG TCGAAATGCT GAGCAGCATG GATTCTGCAT 2280 

GGGTTCGCAT TATCAAACCC TTTCCTGCGC CCCAGACTCC AGGCCGCCTG CAGCCTGCCC 2340 

CTGTGATCCC TTCGGCGCCA GCAGCTCCAA AACTGGACCA CCAGAGAATG GACACCATCC 2400 

AGGAAGACCC CAGCACAGAC TCACATATGG . ACGAGGATGG GTTTGAGAAG GACCCCTTCC 2460 

CAAATAGCAG CACAGCTGCC AAGTCATTTG AGGATCTCAC GGACCATCCG GTCACCAGAA 2520 

GTGAAAAGGC TGCCTCCTTT AAACTGCAGC GTCAGAATCG TGTTGACAGC AAAGAAACAG 2580 

AGTGCTAATT TAGTTCTCAG CTCTTCTGAC- TTAAGTGTGC AAAATATTTT TATAGATTTG 2640 

ACCTACAATC AATCACAGCT TATATTTTGT GAAGACTGGG: AAGTGACTTA GCAGATGCTG: 2700 

GTCATGTGTT TGAACTTCCT GCAGGTAAAC AGTTCTTGTG TGGTTTGGCC CTTCTCCTTT 2760 

TGAAAAGGTA r AGGTGAAGGT .GAATCTAGCT TATTTTGAGG CTTTCAGGTT TTAGTTTTTA. 2820 

AAATATCTTT TGACCTGTGG TGCAAAAGCA GAAAATACAG; CTGGATTGGG TTATGAGTAT; 2880 

TTACGTTTTT GTAAATTAAT • CTTTTATATT GATAACAGGC ACTGACTAGG GAAATGATCA 2940 

GTTTTTTTTT ATACACTGTA ATGAACCGCT GAATATGAAG' CATTTGGCAT TTATTTGTGA" 3000 

GAAAAGTGGA ATAGTTTTTT TTTTTTTTTT TTTTTTTTGC CTTCAACTAA" AAACAAAGGA 3060 

GATAAATTTA GTATACATTG TATCTAAATT GTGGGTCTAT TTCTAGTTAT TACCCAGAGT, 3120 

TTTTATGTAG CAGGGAAAAT ATATATCTAA ATTTAGAAAT CATTTGGGTT AATATGGCTC 3180 

TTCATAATTC TAAGACTAAT GCTCAGAACC TAACCACTAC CTTACAGTGA GGGCTATACA 3240 

TGGTAGCCAG TTGAATTTAT GGAATCTACC AACTGTTTAG GGCCCTGATT TGCTGGGCAG.. 3300 

TTTTTCTGTA TTTTATAAGT ATCTTCATGT- ATCCCTGTTA CTGATAGGGA TACATGTCTT- 3360 

AGAAAATTCA CTATTGGCTG GGAGTGGTGG CTCATGCCTG TAATCCCAGC ACTTGGAGAG 3420 
3421 GCTGAGGTTG CGCCACTACA CTCCAGCCTG GGTGACAGAG TGAGATCTGC CTC 

Seq ID NO: 557 Protein sequence 
Protein Accession ft: NP_06B604.1 

ilRQSLLFLTS WFFVLAPRP PDDPGFGPHQ. RLEKLDSLLS DYDILSLSNI WKSVRKRI* 60 

QTSTHVETLL TFSALKRHFK LYLTSSTERF SQNFKWWD GKNESEYTVK WQDFFTGHW 120 

GEPDSRVLAH IRDDDVIIRI- NTDGAEYNIE PLWRFVNDTK DKRMLVYXSE DIKNVSRLQS- 180 

P^VCGYLKVD NEELLPKGLV DREPPEELVH RVKRRADPDP KKNTCKLLW ADHRFYRYMG 240 

RGEESTTTNY LIELIDRVDD IYRNTSWDNA GFKGYGIQIE QIRILKSPQE VKPGEKHYNM. 300 

AKSYPNEEKD AWDVKMLLEQ. FSFDIAEEAS KVCLAHLFTY QDFDMGTLGL AYVGSPRANS 3 60 

HGGVCPKAYY SPVGKKNIYL NSGLTSTKNY- GKTILTKEAD LVTTHELGHN FGAmDPDGL 420 

AECAPNEDQG GKYVMYP I AV ■ SGDHENNKMF SNCSKQSIYK T IESKAQECF WSNKVOG* 480 

SRVDEGEECD PGIMYLNNDT. CCNSDCTLKE GVQCSDRNSP CCKNCQFETA QKKCQEAINA- • 540 

TCKGVSYCTG NSSECPPPGN.AEDDTVCLDL GKCKDGKCIP FCEREQQLES CACNETDNSC • 600 

KVCCRDLSGR CVPYVDAEQK NLFLRKGKPC TVGFCDMNGK CEKRVQDVIE RFWDFIDQLS 660 
INTFGKFLAD NIVGSVLVFS LIFWIPPSIL VHCV 

Seq ID NO: 558 DNA sequence 

Nucleic Acid Accession fh NM_004994.1 

Coding sequence: 20.. 2143 

1 11 21 31 41 51 

AGACACCTCT GCCCTCACCA TGAGCCTCTG GCAGCCCCTG GTCCTGGTGC TCCTGGTGCT 60 

GGGCTGCTGC TTTGCTGCCC CCAGACAGCG CCAGTCCACC CTTGTGCTCT TCCCTGGAGA 120 

CCTGAGAACC AATCTCACCG ACAGGCAGCT GGCAGAGGAA TACCTGTACC GCTATGGTTA 180 

CACTCGGGTG GCAGAGATGC GTGGAGAGTC GAAATCTCTG GGGCCTGCGC TGCTGCTTCT 240 

CCAGAAGCAA CTGTCCCTGC CCGAGACCGG TGAGCTGGAT AGCGCCACGC TGAAGGCCAT 300 

GCGAACCCCA CGGTGCGGGG. TCCCAGACCT GGGCAGATTC CAAACCTTTG AGGGCGACCT 360 

CAAGTGGCAC CACCACAACA' TCACCTATTG GATCCAAAAC TACTCGGAAG ACTTCCCGCG 420 

GGCGGTGATT GACGACGCCT- TTGCCCGCGC CTTCGCACTG TGGAGCGCGG TGACGCCGCT 480 

CACCTTCACT CGCGTGTACA GCCGGGACGC AGACATCGTC ATCCAGTTTG GTGTCGCGGA 540 

GCACGGAGAC GGGTATCCCT TCGACGGGAA GGACGGGCTC CTGGCACACG CCTTTCCTCC 600 

-TGGCCCCGGC ATTCAGGGAG ACGCCCATTT- CGACGATGAC GAGTTGTGGT COTGGGCAA 660 

GGGCGTCGTG GTTCCAACTC GGTTTGGAAA CGCAGATGGC GCGGCCTGCC ACTTCCCCTT 720 

CATCTTCGAG GGCCGCTCCT ACTCTGCCTG CACCACCGAC GGTCGCTCCG ACGGCTTGCC 780 

SgG^CAGT ACCACGGCCA ACTACGACAC CGACGACCGG TTTGGCTTCT GCCCCAGCGA 840 

gag^ctac acccgggacg gcaatgctga tgggaaaccc tgccagtttc ca^catctt 900 

CCAAGGCCAA TCCTACTCCG CCTGCACCAC GGACGGTCGC TCCGACGGCT ACCGCTGGTG 960 

CG^CCA^ GCCAACTACG ACCGGGACAA GCTCTTCGGC TTCTGCCCGA CCGGAGCTGA 1020 

CTCGACGGTG ATGGGGGGCA ACTCGGCGGG GGAGCTGTGC GTCTTCCCCT TCACTTTCCT 1080 

GGGTAAGGAG TACTCGACCT GTACCAGCGA GGGCCGCGGA GATGGGCGCC TCTGGTGCGC 1140 

TACCA^CTCG aXcTTTGACA- GCGACAAGAA- GTGGGGCTTC TGCCCGGACC AAGGATACAG 1200 

TTTGTTCCTC GTGGCGGCGC ATGAGTTCGG CCACGCGCTG GGCTTAGATC ATTCCTCAGT 1260 

GCCGGAGGCG CTCATGTACC CTATGTACCG CTTCACTGAG GGGCCCCCCT TGCATAAGGA 1320 

CGACGTGAAT GGCATCCGGC ACCTCTATGG TCCTCGCCCT GAACCTGAGC CACGGCCTCC 1380 

AACCACCACC ACACCGCAGC CCACGGCTCC CCCGACGGTC ^CCCCACCG GACCCCCCAC 1440 

TGTCCACCCC TCAGAGCGCC CCACAGCTGG CCCCACAGGT CCGCOTCAG C^GCCCCAC 1500 

AGGTCCCCCC ACTGCTGGCC CTTCTACGGC CACTACTGTG CCTTTGAGTC CGGTGGACGA 1560 

TGC^TGCAAC GTGAACATCT TCGACGCCAT CGCGGAGATT GGGAACCAGC TGTATTTGTT 1620 

CAAgS£ ^SgGC GATTCTCTGA GGGCAGGGGG AGCCGGCCGC AGGGCCCCTT 1680 
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CCTTATCGCC GACAAGTGGC CCGOGCT6CC CCGCAAGCTG GACTCGGTCT TTGAGGAGCC - 1740 

GCTCTCCAAG AAGCTTTTCT TCTTCTCTGG GCGCCAGGTG TGGGTGTACA CAGGCGCGTC 1800 

- GGTGCTGGGC CCGA6G06TC TGGACAAGCT GGGCCTGGGA GCCGACGTGG CCCAGGTGAC 'i860 

CGGGGCCCTC CGGAGTGGCA GGGGGAAGAT GCTGCTGTTC AGCGGGCGGC GCCTCTGGAG , 1920. 

GTTCGACGTG AAGGCGCAGA TGGTGGATCC CCGGAGCGCC AGCGAGGTGG ACCGGATGTT 1980 

CCCCGGGGTG CCTTTGGACA CGCACGACGT CTTCCAGTAC CGAGAGAAAG CCTATTTCTG 2040 

CCAGGACCGC TTCTACTGGC GCGTGAGTTC CCGGAGTGAG TTGAACCAGG TGGACCAAGT 2100 

GGGCTACGTG. ACCTATGACA TCCTGCAGTG CCCTGAGGAC TAGGGCTCCC GTCCTGCTTT 2160 

GCAGTGCCAT GTAAATCCCC ACTGGGACCA ACCCTGGGGA AGGAGCCAGT TTGCCGGATA 2220 

CAAACTGGTA TTCTGTTCTG GAGGAAAGGG AGGAGTGGAG GTGGGCTGGG CCCT CTCTTC 2280 
TCACCTTTGT TTTTTGTTGG AGTGTTTCTA ATAAACTTGG ATTCTCTAAC CTTT 

Seq ID NO: 559 Protein sequence 
Protein Accession 8: NP_004985.1 

1 11 21 31 41 51 

I • I I I I I 

MSLWQPLVLV LLVLGCCFAA PRQRQSTLVL FPGDLRTNLT DRQIAEEYLY RYGYTRVAEM 60 

RGESKSLGPA LLLLQKQLSL PETGELDSAT LKAMRTPRCG VPDLGRFQTF EGDLKWHHHN. 120 
ITYWIQNYSE DLPRAVIDDA FARAFALWSA VTPLTFTRVY" SRDADIVIQF GVAEHGDGYP ■ 180 
FDGKDGLLAH APPPGPGIQG DAHFDDDELW SLGKGVWPT RFGKADGAAC HFPFIFEGRS . 240 

YSACTTDGRS DGLPWCSTTA NYDTDDRFGF CPSERLYTRD GNADGKPCQF- PFIFQGQSYS" 300 

ACTTDGRSDG YRWCATTANY DRDKLFGFCP TRADSTVMGG NSAGELCVFP FTFLGKEYST 3 60 

CTSEGRGDGR LWCATTSNFD SDKKWGFCPD QGYSLFLVAA HEFGHALGLD HSSVPEALMY 420 

PMYRFTEGPP LHKDDVNGIR HLYGPRPEPE PRPPTTTTPQ PTAPPTVCPT GPPTVHPSER "480 

PTAGPTGPPS AGPTGPPTAG PSTATTVPLS PVDDACNVNI FDAIAEIGNQ LYLFKDGKYW 540 

RFSEGRGSRP QGPFLIADKW PALPRKLDSV FEEPLSKKLF FFSGRQVWVY TGASVLGPRR 600 

LDKLGLGADV AQVTGALRSG RGKMLLFSGR RLWRFDVKAQ MVDPRSASEV DRMFPGVPLD '660 
THDVFQYREK AYFCQDRFYW RVSSRSBUJQ VDQVGYVTYD ILQCPED 

Seq ID NO: 560 DNa' sequence. 

Nucleic Acid Accession ft: NM_000213.1 

Coding sequence : 127 .. 5385 

1 11 21 31 41 51 

I I I I I I 

CGCCCGCGCG CTGCAGCCCC ATCTCCTAGC GGCAGCCCAG GCGCGGAGGG AGCGAGTCCG 60 

CCCCGAGGTA GGTCCAGGAC GGGCGCACAG" CAGCAGCCGA GGCTGGCCGG- GAGAGGGAGG 120 

AAGAGGATGG CAGGGCCACG CCCCAGCCCA TGGGCCAGGC TGCTCCTGGC AGCCTTGATC 1B0 

AGCGTCAGCC TCTCTGGGAC CTTGGCAAAC CGCTGCAAGA AGGCCCCAGT GAAGAGCTGC 240 

ACGGAGTGTG TCCGTGTGGA TAAGGACTGC GCCTACTGCA CAGACGAGAT GTTCAGGGAC 300 

CGGCGCTGCA ACACCCAGGC GGAGCTGCTG GCCGCGGGCT GCCAGCGGGA GAGCATCGTG 360 

GTCATGGAGA GCAGCTTCCA AATCACAGAG GAGACCCAGA TTGACACCAC CCTGCGGCGC 420 

AGCCAGATGT CCCCCCAAGG CCTGCGGGTC CGTCTGOGGC CCGGTGAGGA GCGGCATTTT 480 

GAGCTGGAGG TGTTTGAGCC ACTGGAGAGC CCCGTGGACC TGTACATCCT CATGGACTTC 540 

TCCAACTCCA TGTCCGATGA TCTGGACAAC CTCAAGAAGA TGGGGCAGAA CCTGGCTCGG 600 

GTCCTGAGCC AGCTCACCAG CGACTACACT ATTGGATTTG GCAAGTTTGT GGACAAAGTC 660 

AGCGTCCCGC AGACGGACAT GAGGCCTGAG AAGCTGAAGG AGCCCTGGCC CAACAGTGAC 720 

CCCCCCTTCT CCTTCAAGAA CGTCATCAGC CTGACAGAAG ATGTGGATGA GTTCCGGAAT 780 

AAACTGCAGG GAGAGCGGAT CTCAGGCAAC CTGGATGCTC CTGAGGGCGG CTTCGATGCC 840 

ATCCTGCAGA CAGCTGTGTG. CACGAGGGAC" ATTGGCTGGC GCCCGGACAG CACCCACCTG 900 

CTGGTCTTCT CCACCGAGTC AGCCTTCCAC TATGAGGCTG ATGGCGCCAA CGTGCTGGCT 960 

GGCATCATGA GCCGCAACGA TGAACGGTGC. CACCTGGACA CCACGGGCAC CTACACCCAG 1020 

TACAGGACAC AGGACTACCC GTCGGTGCCC ACCCTGGTGC GCCTGCTCGC CAAGCACAAC 1080 

ATCATCCCCA TCTTTGCTGT CACCAACTAC TCCTATAGCT ACTACGAGAA GCTTCACACC 1140 

TATTTCCCTG TCTCCTCACT GGGGGTGCTG CAGGAGGACT CGTCCAACAT CGTGGAGCTG 1200 

CTGGAGGAGG CCTTCAATCG GATCCGCTCC AACCTGGACA TCCGGGCCCT AGACAG CCCC 1260 

CGAGGCCTTC GGACAGAGGT CACCTCCAAG ATGTTCCAGA AGACGAGGAC TGGGTCCTTT 1320 

CACATCCGGC GGGGGGAAGT GGGTATATAC CAGGTGCAGC TGCGGGCCCT TGAGCACGTG 1380 

GATGGGACGC ACGTGTGCCA GCTGCCGGAG GACCAGAAGG GCAACATCCA TCTGAAACCT 1440 

TCCTTCTCCG ACGGCCTCAA GATGGACGCG GGCATCATCT GTGATGTGTG CACCTGCGAG 1500 

CTGCAAAAAG AGGTGCGGTC AGCTCGCTGC AGCTTCAACG GAGACTTCGT GTGCGGACAG 1560 

TGTGTGTGCA GCGAGGGCTG GAGTGGCCAG ACCTGCAACT GCTCCACCGG CTCTCTGAGT 1620 

GACATTCAGC CCTGCCTGCG GGAGGGCGAG GACAAGCCGT GCTCCGGCCG TGGGGAGTGC 1680 

CAGTGCGGGC ACTGTGTGTG CTACGGCGAA GGCCGCTACG AGGGTCAGTT CTGCGAGTAT 1740 

GACAACTTCC AGTGTCCCCG CACTTCCGGG TTCCTCTGCA ATGACCGAGG ACGCTGCTCC 1800 

ATGGGCCAGT GTGTGTGTGA GCCTGGTTGG ACAGGCCCAA GCTGTGACTG TCCCCTCAGC 1860 . 

AATGCCACCT GCATCGACAG CAATGGGGGC ATCTGTAATG GACGTGGCCA CTGTGAGTGT 192 0 

GGCCGCTGCC ACTGCCACCA GCAGTCGCTC. TACACGGACA CCATCTGCGA GATCAACTAC 1980 

TCGGCGATCC ACCCGGGCCT CTGCGAGGAC CTACGCTCCT GCGTGCAGTG CCAGGCGTGG 2040 

GGCACCGGCG AGAAGAAGGG ■ GCGCACGTGT GAGGAATGCA ACTTCAAGGT CAAGATGGTG 2100 

GACGAGCTTA AGAGAGCCGA GGAGGTGGTG GTGCGCTGCT CCTTCCGGGA CGAGGATGAC 2160 

GACTGCACCT ACAGCTACAC CATGGAAGGT GACGGCGCCC CTGGGCCCAA CAGCACTGTC 2220 

CTGGTGCACA AGAAGAAGGA CTGCCCTCCG GGCTCCTTCT GGTGGCTCAT CCCCCTGCTC 2280 

CTCCTCCTCC TGCCGCTCCT GGCCCTGCTA CTGCTGCTAT GCTGGAAGTA CTGTGCCTGC 2340 

TGCAAGGCCT ' GCCTGGCACT TCTCCCGTGC TGCAACCGAG GTCACATGGT GGGCTTTAAG 2400 

GAAGACCACT. ACATGCTGCG GGAGAACCTG ATGGCCTCTG ACCACTTGGA CACGCCCATG 2460 

CTGCGCAGCG GGAACCTCAA GGGCCGTGAC GTGGTCCGCT GGAAGGTCAC CAACAACATG 2520 

CAGCGGCCTG GCTTTGCCAC TCATGCCGCC AGCATCAACC CCACAGAGCT GGTGCCCTAC 2580. 

GGGCTGTCCT TGCGCCTGGC CCGCCTTTGC ACCGAGAACC TGCTGAAGCC TGACACTCGG 2640 

GAGTGCGCCC AGCTGCGCCA GGAGGTGGAG GAGAACCTGA ACGAGGTCTA CAGGCAGATC 2700 

TCGGGTGTAC ACAAGCTCCA GCAGACCAAG TTCCGGCAGC AGCCCAATGC CGGGAAAAAG 2760 

CAAGACCACA CCATTGTGGA CACAGTGCTG ATGGCGCCCC GCTCGGCGAA GCCGGCCCTG 2 82 0 . 

CTGAAGCTTA CAGAGAAGCA GGTGGAACAG AGGGCCTTCC ACGACCTCAA GGTGGCCCCC 2880 

GGCTACTACA CCCTCACTGC AGACCAGGAC GCCCGGGGCA TGGTGGAGTT CCAGGAGGGC 2940 

GTGGAGCTGG TGGACGTACG GGTGCCCCTC TTTATCCGGC CTGAGGATGA CGACGAGAAG 3000- 

CAGCTGCTGG TGGAGGCCAT CGACGTGCCC GCAGGCACTG CCACCCTCGG CCGCCGCCTG 3060 



398 
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GTAAACATCA CCATCATCAA GGAGCAAGCC AGAGACGTGG TGTCCTTTGA GCAGCCTGAG 3120 
TTCTCGGTCA GCCGCGGGGA CCAGGTGGCC CGCATCCCTG TCATCCGGCG TGTCCTGGAC 3180' 
GGCGGGAAGT CCCAGGTCTC CTACCGCACA .CAGGATGGCA CCGCGCAGGG CAACCGGGAC 3240 
TACATCCCCG TGGAGGGTGA GCTGCTGTTC CAGCCTGGGG AGGCCTGGAA AGAGCTGCAG 3300 
GTGAAGCTCC TGGAGCTGCA AGAAGTTGAC TCCCTCCTGC GGGGCCGCCA GGTCCGCCGT 3360 
TTCCACGTCC AGCTCAGCAA CCCTAAGTTT GGGGCCCACC TGGGCCAGCC CCACTCCACC 3420 
ACCATCATCA TCAGGGACCC AGATGAACTG GACCGGAGCT TCACGAGTCA GATGTTGTCA 3480 
TCACAGCCAC CCCCTCAOGG . CGACCTGGGC GCCCCGCAGA ACCCCAATGC TAAGGCCGCT 3540 
GGGTCCAGGA AGATCCATTT CAACTGGCTG CCCCCTTCTG GCAAGCCAAT GGGGTACAGG 3600 
GTAAAGTACT GGATTCAGGG- TGACTCCGAA TCCGAAGCCC. ACXTTGCTCGA CAGCAAGGTG 3660 
CCCTCAGTGG AGCTCACCAA CCTGTACCCG; TATTGCGACT ATGAGATGAA GGTGTGCGCC 3720 
TACGGGGCTC AGGGCGAGGG ACCCTACAGC TCCCTGGTGT CCTGCCGCAC CCACCAGGAA 3780 
GTGCCCAGCG AGCCAGGGCG TCTGGCCTTC AATGTCGTCT CCTCCACGGT GACCCAGCTG 3840 
AGCTGGGCTG AGCCGGCTGA GACCAACGGT GAGATCACAG CCTACGAGGT CTGCTATGGC .3900 
CTGGTCAACG ATGACAACCG ACCTATTGGG. CCCATGAAGA AAGTGCTGGT TGACAACCCT '3960 
AAGAACCGGA TGCTGCTTAT TGAGAACCTT CGGGAGTCCC AGCCCTACCG CTACACGGTG 4020 
AAGGCGCGCA ACGGGGCCGG CTGGGGGCCT GAGCGGGAGG CCATCATCAA CCTGGCCACC 4080 
CAGCCCAAGA GGCCCATGTC CATCCCCATC. ATCCCTGACA- TCCCTATCGT GGACGCCCAG 414 0 
AGCGGGGAGG ACTACGACAG CTTCCTTATG- TACAGCGATG ACGTTCTACG CTCTCCATCG- -4200 
GGCAGCCAGA GGCCCAGCGT CTCCGATGAC ACTGAGCACC TGGTGAATGG CCGGATGGAC - 4260 
TTTGCCTTCC CGGGCAGCAC CAACTCCCTG- CACAGGATGA CCACGACCAG TGCTGCTGCC 4320 
TATGGCACCC. ACCTGAGCCC ACACGTGCCC CACCGCGTGC. TAAGCACATC CTCCACCCTC. 4380. 
ACACGGGACT ACAACTCACT GACCCGCTCA GAACACTCAC ACTCGACCAC ACTGCCGAGG 4440 
GACTACTCCA CCCTCACCTC CGTCTCCTCC CACGACTCTC GCCTGACTGC TGGTGTGCCC .4500- 
GACACGCCCA CCCGCCTGGT- GTTCTCTGCC CTGGGGCCCA CATCTCTCAG AGTGAGCTGG 4560 
CAGGAGCCGC GGTGCGAGCG. GCCGCTGCAG. GGCTACAGTG : TGGAGTACCA GCTGCTGAAC 4620 
GGCGGTGAGC TGCATCGGCT CAACATCCCC AACCCTGCCC AGACCTCGGT GGTGGTGGAA ■ 4680 
GACCTCCTGC CCAACCACTC CTACGTGTTC CGCGTGCGGG CCCAGAGCCA GGAAGGCTGG 4740 . 
GGCCGAGAGC GTGAGGGTGT, CATCACCATT- GAATCCCAGG TGCACCCGCA GAGCCCACTG 4800 
TGTCCCCTGC. CAGGCTCCGC CTTCACTTTG; AGCACTCCCA GTGCCCCAGG CCCGCTGGTG 4860 
TTCACTGCCC TGAGCCCAGA CTOGCTGCAG CTGAGCTGGG AGCGGCCACG GAGGCCCAAT 4 920 
GGGGATATCG TCGGCTACCT" GGTGACCTGT GAGATGG CCC AAGGAGGAGG GCCAGCCACC 4980 
GCATTCCGGG TGGATGGAGA' CAGCCCCGAG AGCCGGCTGA CCGTGCCGGG CCTCAGCGAG 5040 
AACGTGCCCT' ACAAGTTCAAT GGTGCAGGCC AGGACCACTG AGGGCTTCGG GCCAGAGCGC 5100 
GAGGGCATCA TCACCATAGA GTCCCAGGAT GGAGGACCCT TCCCGCAGCT GGGCAGCCGT - 5160 
GCCGGGCTCT TCCAGCACCC GCTGCAAAGC GAGTACAGCA" GCATCACCAC CACCCACACC 522 0 
AGCGCCACCG AGCCCTTCCT- AGTGGATGGG CCGACCCTGG 1 GGGCCCAGCA" CCTGGAGGCA 5280 
GGCGGCTCCC TCACCCGGCA TGTGACCCAG GAGTTTGTGA- GCCGGACACT GACCACCAGC- 5340 
GGAACCCTTA GCACCCACAT GGACCAACAG TTCTTCCAAA CTTGACCGCA CCCTGCCCCA. .5400 
CCCCCGCCAT GTCCCACTAG GCGTCCTCCC GACTCCTCTC CCGGAGCCTC CTCAGCTACT 5460 
OCATCCTTGC ACCCCTGGGG GCCCAGCCCA CCCGCATGCA CAGAGCAGGG GCTAGGTGTC 5520. 
TCCTGGGAGG CATGAAGGGG GCAAGGTCCG TCCTCTGTGG GCCCAAACCT ATTTGTAACC 5580 
AAAGAGCTGG GAGCAGCACA AGGACCCAGC CTTTGTTCTG CACTTAATAA ATGGTTTTGC 5640 
TACTG , 

Seq ID NO: 561 Protein sequence 
Protein Accession ft: NP_000204.l 

1 11''. '. 21 31 41 51 

j I • I I" = I I 

MAGPRPSPWA RLLLAALISV SLSGTLANRC KKAPVKSCTB CVRVDKDCAY CTDEMFRDRR 60 

CNTQABLLAA GCQRESIWM ESSFQITEET QIDTTLRRSQ MSPQGUWRL RPGEERHFEL 120 

EVFEPLESPV- DLYILMDFSN- SMSDDLDNLK- KMGQNLARVL SQLTSDYTIG FGKFVDKVSV- iao' 

PQTDMRPEKL KEPWPNSDPP FSFKNVISLT EDVDEFRNKL QGERISGNLD APEGGFDAIL 240 

QTAVCTRDIG WRPDSTHLLV - FSTESAFHYE ADGANVLAGI MSRNDERCHL DTTGTYTQYR 300. 

TQDYPSVPTL VRLLAKHNTI PIFAVTNYSY- SYYEKLHTYP- PVSSLGVLQE DSSNXVELLE 360 

EAFNRIRSNL DIRALDSPRG LRTEVTSKMF QKTRTGSFHI- RRGEVGIYQV QLRALEHVDG 420 

THVCQLPEDQ KGNIHLKPSF SDGLKKDAGI ICDVCTCELQ KEVRSARCSP KGDFVCGQCV 480 

CSEGWSGQTC NCSTGSLSDI ' QPCLREGEDK PCSGRGECQC GHCVCYGEGR YEGQFCEYDN 540 

FQCPRTSGFL CNDRGRCSMG QCVCEPGWTG PSCDCPLSNA TCIDSNGGIC NGRGHCECGR 600 

CHCHQQSLYT DTICEINYSA IHPGLCEDLR- SCVQCQAWGT GEKKGRTCEE CNFKVKMVDE 660 

LKRAEEVWR CSFRDEDDDC TYSYTMEGDG APGPNSTVLV HKKKDCPPGS FWWLIPLLLL " 720 

LLPLLALLLL LCWKYCACCK ACLALLPCCN RGHMVGFKED HYMLRENLMA SDHLDTPMLR 780 

SGNLKGRDW RWKVTNNMQR PGFATHAASI NPTELVPYGL SLRLARLCTE NLLKPDTREC 840 

AQLRQEVEEN LNEVYRQISG VHKLQQTKFR QQPNAGKKQD HTIVDTVLMA PRSAKPALLK 900 

LTEKQVEQRA PHDLKVAPGY YTLTADQDAR GMVEFQEGVE LVDVRVPLFI RPEDDDEKQL 960 

LVEAIDVPAG TATLGRRLVN ITIIKEQARD WSFEQPEFS. VSRGDQVARI PVIRRVLDGG 1020 

KSQVSYRTQD GTAQGNRDYI PVEGELLFQP GEAWKELQVK LLELQEVDSL LRGRQVRRFH 1080 

VQLSNPKFGA'HLGQPHSTTI IIRDPDELDR SFTSQMLSSQ PPPHGDLGAP QNPNAKAAGS 1140 

RKIHFNWLPp; SGKPMGYRVK YWIQGDSESE AHLLDSKVPS VELTNLYPYC DYEMKVCAYG 1200 

AQGEGPYSSt, VSCRTHQEVP SEPGRLAFNV; VSSTVTQLSW* ABPAETNGEI TAYEVCYGLV 1260 

NDDNRP IGPM . KKVLVDUPKN RMLLIENLRe' SQPYRYTVKA' RNGAGWGPER EAIINLATQP 1320 

FCRPMSIPIIP-DIPIVDAQSG EDYDSFLMYS DDVLRSPSGS QRPSVSDDTE HLVNGRMDFA 1380 

FPGSTNSLHR. MTTTSAAAYG THLSPHVPHR VLSTSSTLTR DYNSLTRSEH SHSTTIiPRDY 1440 

STLTSVSSHD SRLTAGVPDT PTRLVFSALG PTSLRVSWQE PRCERPLQGY SVEYQLLNGG 1500 

ELHRLNIPNP AQTSVWEDL LPNHSYVFRV RAQSQEGWGR EREGVITIES QVHPQSPLCP 1560 

LPGSAFTLST PSAPGPLVFT ALSPDSLQLS WERPRRPNGD IVGYLVTCEM AQGGGPATAF 1620 

RVDGDSPESR LTVPGLSENV PYKFKVQART TEGFGPEREG IITIESQDGG PFPQLGSRAG 1680 

LFQHPLQSEY SSITTTHTSA TEPFLVDGPT LGAQHIiEAGG SLTRHVTQEF VSRTLTTSQT 1740 
LSTHMDQQPF QT^ 

Seq ID NO: 562 DNA sequence 

Nucleic Acid Accession #: NM_013332.1 

Coding sequence « 1..63 

1 ' 11 21 31 41 51 

r ; : [ r. i • i i i 
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GCACGAGGGC GCTTTTGTCT CCGGTGAGTT TTGTGGCGGG AAGCTTCTGC GCTGGTGCTT 
AGTAACCGAC TTTCCTCOGG ACTCCTGCAC GACCTGCTCC TACAGCCGGC GATCCACTCC 
CGGCTGTTCC CCCGGAGGGT CCAGAGGCCT TTCAGAAGGA GAAGGCAGCT CTGTTTCTCT 
GCAGAGGAGT AGGGTCCTTT CAGCCATGAA GCATGTGTTG AACCTCTACC TGTTAGGTGT 
GGTACTGACC CTACTCTCCA TCTTCGTTAG AGTGATGGAG TCCCTAGAAG GCTTACTAGA 
GAGCCCATCG CCTGGGACCT CCTGGACCAC CAGAAGCCAA CTAGCCAACA CAGAGCCCAC 
CAAGGGCCTT CCAGACCATC CATCCAGAAG CATGTGATAA GACCTCCTTC CATACTGGCC 
ATATTTTGGA- ACACTGACCT AGACATGTCC AGATGGGAGT CCCATTCCTA GCAGACAAGC 
TGAGCACCGT TGTAACCAGA GAACTATTAC TAGGCCTTGA AGAACCTGTC TAACTGGATG. 
CTCATTGCCT GGGCAAGGCC TGTTTAGGCC GGTTGCGGTG GCTCATGCCT- GTAATCCTAG 
CACTTTGGGA GGCTGAGGTG GGTGGATCAC CTGAGGTCAG GAGTTCGAGA CCAGCCTCGC 
CAACATGGCG AAACCCCATC TCTACTAAAA ■ ATACAAAAGT TAGCTGGGTG TGGTGGCAGA 
GGCCTGTAAT CCCAGTTCCT TGGGAGGCTG AGGCGGGAGA ATTGCTTGAA CCCGGGGACG- 
GAGGTTGCAG TGAACCGAGA TCGCACTGCT- GTACCCAGCC' TGGGCCACAG TGCAAGACTC 
CATCTCAAAA AAAAAAAGAA AAGAAAAAGC CTGTTTAATG" CACAGGTGTG AGTGGATTGC 
TTATGGCTAT GAGATAGGTT GATCTCGCCC TTACCCCGGG GTCTGGTGTA TGCTGTGCTT 
TCCTCAGCAG TATGGCTCTG ACATCTCTTA- GATGTCCCAA CTTCAGCTGT TGGGAGATGG 
TGATATTTTC AACCCTACTT CCTAAACATC, TGTCTGGGGT ' TCCTTTAGTC TTGAATGTCT • 
TATGCTCAAT TATTTGGTGT TGAGCCTCTC TTCCACAAGA 1 GCTCCTCCAT GTTTGGATAG 
CAGTTGAAGA GGTTGTGTGG GTGGGCTGTT GGGAGTGAGG ATGGAGTGTT CAGTGCCCAT 
TTCTCATTTT ACATTTTAAA GTCGTTCCTC CAACATAGTG TGTATTGGTC TGAAGGGGGT 
GGTCGGATGC CAAAGCCTGC TCAAGTTATG. GACATTGTGG- CCACCATGTG GCTTAAATGA 
TTTTTTCTAA CTAAT AAAGT • GGAATAT AT A TTTCAAAAAA AAAAAAAAAA AA 

Seq ID NO: 563 Protein sequence 
Protein Accession ft: NP_037464.1 

1 11 . 21. 31 41 51 . . 

HKHVLNLYLL GVVLTLLSIP VRVMESLEGL LESPSPGTSW TTRSQLANTE PTKGLPDHPS 
RSM 

Seq ID NO: 564 DNA sequence 

Nucleic Acid Accession. ft; NM_023915.1 

Coding sequence i 250 . . 1326 



PCTAJS02/12476 



GGCACGAGGG 
TCAAAGCTTA- 
GTGAATGGAC 
CCCACGCCTC 
AACTGAAGAA 
CAAGAGAGTC 
AATGAATTTG 
TTGCTGAATG 
TTCTATCTCA 
ATAGTCCATG 
TCAGTTTTGT 
GATCGCTATC 
ACGAAGGTTT 
.ATCCTGACAA 
CCTTTGGGGG 
GTGCTGGTGA 
AGGCAATTCA 
GTGGCTGTGT. 
AGTCACTTAG. 
ATTACACTTT 
TGTAGGTCAT 
ATCAGATCAC ■ 
GTGTAGGCCT 
TTCATTATCC 



11 

I " 

TTTCGTTTTC 
TTCTTAATTA 
AGCCAGCCAC 
AATCGTCCCC 
TGGGGTTCAA 
ACAATTCAGG 
ACACAATTGT 
GTTTAGCAGT 
AAAACATAGT 
ATGCAGGATT 
TTTATGCAAA 
TGAAGGTGGT 
TATCTGTTTG 
ATGGTCAGCC 
TCAAATGGCA 
TTCTGATCGG 
TAAGTCAGTC 
TTTTTACCTG 
ACAGGCTTTT 
TCTTGTCTGC 
TTTCAAGAAG 
TGCAAAGTGT 
TTTATTGTTT 
TTAAAAAAAA 



21 
I 

ATGCTTTACC 
GAGACAAGAA 
CACAATGAAA 
AAGTGTTTCC 
CTTGACGCTT- 
CAACAGGAGC 
CTTGCCGGTG 
GTGGATCTTC 
GGTTGCAGAC 
TGGACCTTGG 
CATGTATACT 
CAAGCCATTT 
TGTTTGGGTG 
AACAGAGGAC 
TACGGCAGTC 
ATGTTACATA 
AAGCCGAAAG 
CTTTCTACCA 
AGATGAATCT 
GTGTAATGTT 
GCTGTTCAAA 
GAGAAGATCG 
GTTGGAATCG 
AA - 



31 
I 

AGAAAATCCA 
ACCTGTTTCA 
GAAATCAAAC 
TGACACGCAT 
GCAAAATTAC" 
GACGGGCCAG 
CTTTATCTCA 
TTCCACATTA 
CTCATAATGA 
TACTTCAAGT 
TCCATCGTGT 
GGGGACTCTC 
ATCATGGCTG 
AATATCCATG 
ACCTATGTGA 
GCCATATCCA: 
CGAAAACATA 
TATCACTTGT 
GCACAAAAAA 
TGCCTGGATC" 
AAATCAAATA 
GAAGTTCGCA 
ATATGTACAA 



41 
I 

CTTCCCTGCC 
ACTTGAAGAC. 
CAGGAATAAC. 
CTTTGCTTAC 
CAAATAACGA 
GAAAGAACAC 
TTATATTTGT 
GGAATAAAAC 
CGCTGACATT 
TTATTCTCTG 
TCCTTGGGCT 
GGATGTACAG 
TTTTGTCTTT 
ACTGCTCAAA 
ACAGCTGCTT 
GGTACATCCA 
ACCAGAGCAT 
GCAGAATTCC 
TCCTATATTA 
CAATAATTTA 
TCAGAACCAG 
TATATTATGA 
AGTGTAAATA 



Seq ID NO: 565 Protein sequence 
Protein Accession ft: NP_076404 



1 
I 

MGFNLTLAKL 
GLAVWIFFHI 
FYANMYTSIV 
NGQPTEDNIH 
ISQSSRKRICH 
FLSACNVCLD. 



■ 1-1 . 

1 "■ : 

PNNELHGQES. 
RNKTSFIFYIi 
FLGI*ISIDRV 
DCSKLKSPLG 
NQSIRVVVAV 
PIIYFFMCRS 



21 

I - ' . 
HNSGNRSDGP 
KNIWADLIM 
•LKWKPFGDS 
VKWHTAVTYV 
FFTCFLPYHI* 
FSRRLFKKSN 



31 
I . 

GKNTTLHNEF 
TLTFPPRIVH 
RMYSITFTKV 
NSCLFVAVLV 
CRIPFTFSHL 
IRTRSESIRS 



41 
I 

DTIVLPVLYL 
DAGFGPWYFK 
LSVCVWVIMA 
ILIGCYIAIS 
DRLLDESAQK 
LQSVRRSEVR 



51 
I 

GACCTTAGTT 

ACCGTATGAG 

CTATGCTGAA 

AGTGCATCAC 

GCTGCACGGC 

CACCCTTCAC 

GGCAAGCATC 

CAGCTTCATA- 

TCCATTTCGA 

CAGATACACT 

GATAAGCATT 

CATAACCTTC 

GCCAAACATC- 

ACTTAAAAGT- 

GTTTGTGGCC 

CAAATCCAGC 

CAGGGTTGTT 

TTTTACTTTT 

CTGCAAAGAA 

CTTTTTCATG 

GAGTGAAAGC 

TTACACTGAT 

AATGTTTCTT 



51 
I 

IIFVASILLN 
FII^CRYTSVL 
VLSLPNIILT 
RYIHKSSRQF 
ILYYCKEITL 
IYYDYTDV 



Seq ID NO: 566 DNA sequence 

Nucleic Acid Accession- ft : NM_O0S365 . 1 

Coding sequence: 1..948 



ATGTCTCTCG 
GAGGACTTGG 
TCCTCTGACA 
CCTCAGGGAG 
GAGGGCTCCA 
GAGTTCATGT 



11 
I 

AGCAGAGGAG 
GCCTGATGGG 
GCAAGGAGGA 
GCGCTTCCTC 
GCAGTCAAGA 
TCCAAGAAGC 



21 
I 

TCCGCACTGC 
TGCACAGGAA 
GGAGGTGTCT 
CTCCATTTCC 
AGAGGAAGAG 
ACTGAAATTG 



31 

1 • 

AAGCCTGATG 
CCCACAGGCG 
GCTGCTGGGT 
GTCTACTACA 
CCAAGCTCCT 
AAGGTGGCTG 



41 
I 

AAGACCTTGA 
AGGAGGAGGA 
CATCAAGTCC 
CTTTATGGAG 
CGGTCGACCC 
AGTTGGTTCA 



51 



I 

AGCCCAAGGA 
GACTACCTCC 
TCCCCAGAGT 
CCAATTCGAT 
AGCTCAGCTG 
TTTCCTGCTC 



60 
120 
ISO 
240 
300 
360 
420 
-480 
540 
600 
660 
720 
780 
'840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



60 
120 
180 
240 
300 
360 
420 
480 
.540 
600 
660 
' 720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



> 60 
120 
180 
240 
300 



60 
120 
180 
- 240 
300 
360 



400 
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CACAAATATC GAGTCAAGGA GCCGGTCACA AAGGCAGAAA 
AATTACAAGC GCTACTTTCC- TGTGATCTTC GGCAAAGCCT 
TTTGGCACTG ATGTGAAGGA GGTGGACCCC GCCGGCCACT 
CTTGGCCTCT CGTGCGATAG CATGCTGGGT GATGGTCATA 
CTGATCATTG TCCTGGGTGT GATCCTAACC AAAGACAACT 
TGGGAAGCGT TGAGTGTGAT GGGGGTGTAT GTTGGGAAGG 
CCCAGGAAGC TGCTCACCCA AGATTGGGTG CAGGAAAACT 
CCCGGCAGTG ATCCTGCGCA CTACGAGTTC CTGTGGGGTT 
AGCTATGAGA" AGGTCAT AAA ■ TTATTTGGTC ATGCTCAATG 
CCATCCCTTT ATGAAGAGGT TTTGGGAGAG GAGCAAGAGG 



PCT/US02/12476 . 



TGCTGGAGAG 
CCGAGTTCAT 
CCTACATCCT 
GCATGCCCAA 
GCGCCCCTGA 
AGCACATGTT 
ACCTGGAGTA 
CCAAGGCCCA 
CAAGAGAGCC 
GAGTCTGA 



CGTCATCAAA 
GCAGGTGATC 
TGTCACTGCT 
GGCCGCCCTC 
AGAGGTTATC 
CTACGGGGAG- 
CCGGCAGGTG 
CGCTGAAACC 
CATCTGCTAC 



Seq ID NO; 567 Protein sequence 
Protein Accession S: NP_0 053 56.1 



1 

I. 

MSLEQRSPHC 
PQGGASSSIS 
KKYRVKEPVT 
LGLSCDSMLG 
PRKLLTQDWV 
PSLYEEVLGE 



11 



21 
I 



KPDEDLEAQG EDLGLMGAQE 
VYYTIiWSQPD EGSSSQEEEE 
KAEMLESVIK NYKRYFFVIF. 
DGHSMPKAAL LIIVLGVILT 
•QENYLEYRQV- PGSDPAHYEF 
EQEGV 



31 41 

I I 

PTGEEEETTS SSDSKEEEVS 
PSSSVDPAQL EFMFQEALKL 
GKASEFMQVI • FGTDVKEVDP 
KDNCAPEEVI WEALSVMGVY 
LWGSKAHAET SYEKVINYLV 



51 • 
I 

AAGSSSPPOS 
KVAELVHFLL 
AGHSYILVTA 
VGXEHMFYGE 
MLNAREPICY 



Seq ID NO: 568 DNA sequence 
Nucleic Acid Accession If: NM_014400 
Coding sequence: 86.. 1126 



1 
I 

GGTTACTCAT ' 
GACGCCAAGG 
GATCTGGACT 
GTGCTACAGC 
GAAGTGCGCG 
CGGACAATTC 
CGGCCTGGAT 
CTGCAACGCC 
ATACCCGCCC 
GGGTACATCG 
CTTCGACGGC 
CTGTGTCCAG 
TGGCTCCTGT. 
CCCTCGAATC 
CACATCTGTC 
GCCAGCGCCA 
GGAGCCCAGG 
TCCTGCAAAA 
ATTGGCAGCC 
AAATTTCCCT 
CCCACCACTG 
CTTCTGCTGC 
GGGTGTTCTA 
TCCTCTTGTG 
AGGATGCTAA 
GGTGGGACAA 
ATCGGTTCCC 
CTTATGTCTG 
TTGTATAGTG 



11 . 
I 

CCTGGGCTCA 
GAGCAGGACG 
GCAGGCTGGC 
TGCGTGCAGA ; 
CCGGGCGTGG 
TCGCTGGCAG 
CTTCACGGGC 
AAGCTCAACC 
AACGGCGTGG 
CCGCCGGTCG 
AACGTCACCT" 
GATGAATTCT. 
TGCCAGGGGT 
CX^CCCCTTG 
ACCACTTCTA 
ACCAGTCAGA 
TTGACTGGAG- 
GGGGGGCCCC 
CTTCTGTTGG 
CTCACCTACT 
GACTGGGCTG 
GCTGGTTTGC 
GCTTTTTGAG 
ATGTTAGGAC 
GCTTCCTACT 
TGGCTCCCCA 
CATATGTCTT 
TGTGTGATCA 
AAAAAAAA 



21 
I 

GGTAAGAGGG 
GAGCCATGGA 
TGCTGCTGCT 
AAGCAGATGA 
ACGTCTGCAC 
TGCSGGGTTG 
TTCTGGCGTT- 
TCACCTCGCG 
AGTGCTACAG. 
TGAGCTGCTA 
TGACGGCAGC 
GCACTCGGGA 
CCCGCTGTAA 
TCCGGCTGCC 
CCTCGGCCCC. 
CTCCGAGACA 
GCGCCGCTGG 
AGCAGCCCCA 
CCGTGGCTGC 
TCTCTGGCCC 
GCCCAGCCCC 
GGCTTTGGGA 
GACAGCTCCT 
AGAGTGAGAG 
CACTTTCTCC 
CTCTAAGCAC 
CCTTACTAGA 
GTTTCTGGCA 



31 
I 

CCCGAGCTCG 
CCCCGCCAGG. 
GCTGCTTCGC 
CGGATGCTCC 
CGAGGCCGTG 
CGGTTCGGGA 
CATCCAGCTG: 
GGCGCTCGAC 
CTGTGTGGGC 
CAACGCCAGC 
TAATGTGACT 
TGGAGTAACA 
CTCTGACCTC 
CCCTCCAGAG 
AGTGAGACCC 
GGGAGTAGAA 
CCACCAGGAC 
TAATAAAGGC 
TGGTGTCCTA 
TGGGTACCCC 
TGTTTTTCCA 
AATAAAATAC 
GTATCCTTCT 
AAGTCAGCTG 
TAGCCAGCCT 
TGCCTCCCCT. 
CTGTGAGCTC 
CATAAATGCC 



41 
I 

GAGGCGGCAC 
AAAGCAGGTG 
GGAGGAGCGC 
CCGAACAAGA 
GGGGCGGTGG. 
CTCCCCGGCA 
CAGCAATGCG 
CCGGCAGGTA 
CTGAGCCGGG 
GATCATGTCT 
GTGTCCTTGC 
GGCCCAGGGT 
CGCAACAAGA 
CCCACGACTG 
ACATCCACCA 
CACGAGGCCT 
CGCAGCAATT 
TGTGTGGCTC 
CTGTGAGCTT 
TCTTCTCATC 
ACATTCCCCA 
CGTTGTATAT 
CATCCTTGTC 
TCACGGGGAA 
GGACTTTGGA 
ACTCCCCGCA 
CTCGAGGGCA 
TCAATAAAGA 



51 
I 

ACCCAGGGGG 
CCCAGGCCAT 
AGGCCCTGGA 
TGAAGACAGT 
AGACCATCCA 
AGAATGACCG 
CTCAGGATCG- 
ATGAGAGTGC 
AGGCGTGCCA- 
ACAAGGGCTG 
CTGTCCGGGG 
TCACGCTCAG 
CCTACTTCTC 
TGGCCTCAAC 
CCAAACCCAT 
CCCGGGATGA 
CAGGGCAGTA 
CCACAGCTGG 
CTCCACCTGG 
ACTTCCTGTT 
GTATCCCCAG 
ATTCTGGCAG. 
TCTCCGCTTG 
GGTGAGAGAG 
GCGTGGGGTG 
TCTTTGGGGA 
GGGACCGTGC 
TTTAATTACT 



Seq ID NO: 569 Protein sequence 
Protein Accession- # : NP_055215 



I 

MDPARKAGAQ 
CTEAVGAVET 
SRALDPAGNE 
AANVTVSLPV 
LPPPEPTTVA 
AGHQDRSNSG 



11 

I 

AMIWTAGWLL 
IHGQFSLAVX 
SAYPPNGVEC 
RGCVQDEPCT 
STTSVTTSTS 
QYPAKGGPQQ 



21 
I 

LLLLRGGAQA 
GCGSGLPGKN 
YSCVGLSREA 
RDGVTGPGFT 
APVRPTSTTK 
PHNKGCVAPT 



31 

I 

LECYSCVQKA 
DRGLDLHGLL 
CQGTSPPWS 
LSGSCCQGSR 
PMPAPTSQTP 
AGLAALLLAV 



41 51 

I 1 

DDGCSPNKMK TVKCAPGVDV 
AFIQLQQCAQ DRCNAKLNLT 
CYNASDHVYK GCFDGNVTLT 
CNSDLRNKTY FSPRIPPLVR 
RQGVEHEASR DEEPRLTGGA 
AAGVLL 



Seq ID NO: 570 DNA- sequence 

Nucleic Acid Accession fh NM_005329.1 

Coding sequence : 1 . . 1662 



ATGCCGGTGC 
GTGCTGGGTG 
CACTACCTGT 
CTTTTTGCCT 
TCCCCGCGGC 
TTGCGCAAGT' 
GTGGTGGATG 
GGCGGCACCG 
GGTGAGACGG 
AGCACCTTCT 



11 
I 

AGCTGACGAC 
GCATCCTGGC 
CCTTCGGCCT 
TCCTGGAGCA 
GGGGCTCGGT 
GCCTGCGCTC 
GCAACCGCCA 
AGCAGGCCGG 
AGGCCAGCCT 
CGTGCATCAT 



21 
I 

AGCCCTGCGT 
AGCCTATGTG 
GTACGGCGCC 
CCGGCGCATG 
GGCACTGTGC 
GGCCCAGCGC 
GGAGGACGCC 
CTTCTTTGTG 
GCAGGAGGGC 
GCAGAAGTGG 



31 
I 

GTGGTGGGCA 
ACGGGCTACC 
ATCCTGGGCC 
CGACGTGCCG 
ATTGCCGCGT 
ATCTCCTTCC 
TACATGCTGG 
TGGCGCAGCA 
ATGGACCGTG 
GGAGGCAAGC 



41 

I 

CCAGCCTGTT 
AGTTCATCCA 
TGCACCTGCT 
GCCAGGCCCT 
ACCAGGAGGA 
CTGACCTCAA 
ACATCTTCCA 
ACTTCCATGA 
TGCGGGATGT 
GCGAGGTCAT 



51 
I 

TGCCCTGGCA 
CACGGAAAAG 
CATTCAGAGC 
GAAGCTGCCC 
CCCTGACTAC 
GGTGGTCATG 
CGAGGTGCTG 
GGCAGGCGAG 
GGTGCGGGCC 
GTACACGGCC 



420 
.480 
540 
600 
660 
720 
780 
840 
900 • 



• 60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
•900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
4 80 
540 
600 
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TTCAAGGCCC TCGGCGATTC GGTGGACTAC ATCCAGGTGT GCGACTCTGA CACTGTGCTG 660 

GATCCAGCCT GCACCATCGA GATGCTTCGA GTCCTGGAGG AGGATCCCCA AGTAGGGGGA ■ 720 
GTCGGGGGAG ATGTCCAGAT CCTCAAOAAG. TACGACTCAT GGATTTCCTT CCTGAGCAGC.' 780 

GTGCGGTACt' GGATGGCCTT CAACGTGGAG CGGGCCTGCC AGTCCTACTT TGGCTGTGTG - 840 
CAGTGTATTA GTGGGCCCTT- GGGCATGTAC CGCAACAGCC TCCTCCAGCA GTTCCTGGAG . 900 

GACTGGTACC ATCAGAAGTT - CCTAGGCAGC AAGTGCAGCT TCGGGGATGA CCGGCACCTC 960 

ACCAACCGAG ■ TCCTGAGCCT TGGCTACCGA : ACTAAGTATA CCGCGCGCTC CAAGTGCCTC .1020 

ACAGAGACCC CCACTAAGTA CCTCCGGTGG CTCAACCAGC AAACCCGCTG " GAGCAAGTCT 1080 

TACTTCCGGG AGTGGCTCTA CAACTCTCTG. TGGTTCCATA- AGCACCACCT CTGGATGACC 1140 

TACGAGTCAG TGGTCACGGG TTTCTTCCCC TTCTTCCTCA TTGCCACGGT TATACAGCTT 1200 

TTCTACCGGG GCCGCATCTG GAACATTCTC CTCTTCCTGC TGACGGTGCA GCTGGTGGGC 1260 

ATTATCAAGG CCACCTACGC. CTGCTTCCTT CGGGGCAATG CAGAGATGAT CTTCATGTCC 1320 

CTCTACTCCC TCCTCTATAT GTCCAGCCTT CTGCCGGCCA AGATCTTTGC CATTGCTACC 1380 

ATCAACAAAT CTGGCTGGGG CACCTCTGGC CGAAAAAGCA TTGTGGTGAA" CTTCATTGGC "1440 

CTCATTCCTG TGTCCATCTG GGTGGCAGTT CTCCTGGGAG GGCTGGCCTA: CACAGCTTAT 1500 

TGCCAGGACC TGTTCAGTGA GACAGAGCTA GCCTTCCTTG TCTCTGGGGC TATACTGTAT 1560 

GGCTGCTACT GGGTGGCCCT CCTCATGCTA TATCTGGCCA TCATCGCCCG GCGATGTGGG 1620 
AAGAAGCCGG AGCAGTACAG . CTTGGCTTTT GCTGAGGTGT GA 

Seq ID NO: 511 Protein sequence ' 
Protein Accession #: NP_005320.1 

1 11 21 31 41 51 

| I I I I I 

MPVQLTTALR WGTSLFALA VLGGILAAYV TGYQFIHTEK HYLSFGLYGA I LGLHLL I QS 60 

LFAFLEHRRM: RRAGQALKLP SPRRGSVALC IAAYQEDPDY - LRKCLRSAQR - ISFPDUCWM- 120 

WDGNRQEDA YMLDIFHEVL .GGTEQAGFFV WRSNFHEAGE GBTEASLQEG MDRVRDWRA 180 

STFSCIMQKM GGKREVMYTA FKALGDSVDY- IQVCDSDTVL DPACTIEMLR VIiEEDPQVGG ' 240 

VGGDVQILNK YDSWISFLSSr' VRYWMAFNVE RACQSYFGCV. QCISGPLGMY, RNSULQQFLE 300 

DWYHQKFLGS KCSFGDDRHL TNRVLSLGYR TKYTARSKCL TETPTKYLRW. LNQQTRWSKS 360 

YFREWLYNSIi WFHKHHLWMT, YESWTGFFP FFLIATVIQI»: FYRGRIWNIL LFLLTVQLVG- 420 

IIKATYACFI* RGNAEMIFMS. LYSLLYMSSL liPAKIFAIAT INKSGWGTSG RKTIWNFIG 480 

LIPVSIWVAV LLGGLAYTAY CQDLFSETEL AFLVSGAILY GCYWVALLML YLAIIARRCG 540 
KKPEQYSLAF -AEV 

Seq ID NO:; 572 DNA sequence 

Nucleic Acid Accession Eos sequence 

Coding sequence: 148-7095 

l" 11 21 31 41 SI 

I ' | | I I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG" GTCCTATACA GGAGCACTGA ATCAAAAAAA" TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 3 60 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT ' 490 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT- AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 
GGAAAAGGGA AGTTAAGAGC TTTATCCATT- TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA" AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT - . 780 

TTAGATCCAT TCATACTGTT, GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 " 

AATGGCTCAT- TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT- TTTTAAAGAT. 900 

ACAGTTAGCA TCTCTGAAAG- CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG: TCATG CTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATT CATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAG ■ TTGATCTTTT- CCCTGAATTA" ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT- GACTT CTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAG A ATCCTTAAAT- ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CT'TATACCAG AATCTGCTAG AAATGCTTCC -2040 

GAAGATTCAA" CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT.. GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA'- CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG GTGAGACACC TCTTCAACCT TCCTACAGTA GTGAAGTCTT . TCCTCTAGTC 2460 

ACCCCTTTGT TGCTTGACAA" TCAGATCCTC AACACTACCC CTGCTGCTTC AAGTAGTGAT 2520 

TCGGCCTTGC ATGCTACGCC TGTATTTCCC AGTGTCGATG TGTCATTTGA ATCCATCCTG 2580 

TCTTCCTATG ATGGTGCACC TTTGCTTCCA TTTTCCTCTG CTTCCTTCAG TAGTGAATTG 2640 

TTTCGCCATC TGCATACAGT TTCTCAAATC CTTCCACAAG TTACTTCAGC TACCGAGAGT 2700 

GATAAGGTGC CCTTGCATGC TTCTCTGCCA GTGGCTGGGG GTGATTTGCT ATTAGAGCCC 2760 
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AGCCTTGCTC AGTATTCTGA 'TGTGCTGTCC ACTACTCATG CTGCTTCAGA GACGCTGGAA 2820 

TTTGGTAGTG • AATCTGGTGT TCTTTATAAA ACGCTTATGT TTTCTCAAGT TGAACCACCC 2880 . 

AGCAGTGATG CCATGATGCA TGCACGTTCT" TCAGGGCCTG AACCTTCTTA TGCCTTGTCT 2 940 

GATAATGAGG GCTCCCAACA ■ CATCTTCAGT GTTTCTTACA GTTCTGCAAT. ACCTGTGCAT 3000 

GATTCTGTGG GTGTAACTTA TCAGGGTTCC TTATTTAGCG GCCCTAGCCA TATACCAATA 3060 

CCTAAGTCTT CGTTAATAAC CCCAACTGCA TCATTACTGC AGCCTACTCA TGCCCTCTCT 3120 

GGTGATGGGG AATGGTCTGG AGCCTCTTCT GATAGTGAAT TTCTTTTACC TGACACAGAT 3180 

GGGCTGACAG CCCTTAACAT TTCTTCACCT GTTTCTGTAG CTGAATTTAC ATATACAACA 3240 . 

TCTGTGTTTG GTGATGATAA - TAAGGCGCTT TCTAAAAGTG AAATAATATA TGGAAATGAG - 3300. 

ACTGAACTGC AAATTCCTTC TTTCAATGAG ATGGTTTACC CTTCTGAAAG CACAGTCATG 3360 

CCCAACATGT ATGATAATGT AAATAAGTTG AATGCGTCTT TACAAGAAAC CTCTGTTTCC 3420 

ATTTCTAGCA CCAAGGGCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC TAAGGTTTT7 3480 

GATCATGAGA TTAGTCAAGT TCCAGAAAAT AACTTTTCAG. TTCAACCTAC ACATACTGTC 3540 

TCTCAAGCAT- CTGGTGACAC TTCGCTTAAA" CCTGTGCTTA GTGCAAACTC AGAGCCAGCA 3600 

TCCTCTGACC CTGCTTCTAG - TGAAATGTTA* TCTCCTTCAA CTCAGCTCTT ATTTTATGAG 3660 

ACCTCAGCTT- CTTTTAGTAC TGAAGTATTG CTACAACCTT CCTTTCAGGC TTCTGATGTT 3720 

GACACCTTGC TTAAAACTGT- TCTTCGAGCT GTGCCCAGTG ATCCAATATT GGTTGAAACC 3780 

CCCAAAGTTG ATAAAATTAG TTCTACAATG TTGCATCTCA TTGTATCAAA TTCTGCTTCA- 3840 

AGTGAAAACA TGCTGCACTC TACATCTGTA CCAGTTTTTG ATGTGTCGCC TACTTCTCAT 3900 

ATGCACTCTG CTTCACTTCA AGGTTTGACC ATTTCCTATG CAAGTGAGAA ATATGAACCA 3960 

GTTTTGTTAA- AAAGTGAAAG. TTCCCACCAA GTGGTACCTT CTTTGTACAG TAATGATGAG 4020 

TTGTTCCAAA CGGCCAATTT GGAGATTAAC CAGGCCCATC CCCCAAAAGG AAGGCATGTA 4080 

TTTGCTACAC CTGTTTTATC AATTGATGAA CCATTAAATA CACTAATAAA TAAGCTTATA - 4140 

CATTCGGATG AAATTTTAAC CTCCACCAAA AGTTCTGTTA CTGGTAAGGT. ATTTGCTGGT 4200 

ATTCCAACAG. TTGCTTCTGA TACATTTGTA TCTACTGATC ATTCTGTTCC TATAGGAAAT 4260 

GGGCATGTTG CCATTACAGC TGTTTCTCCC CACAGAGATG GTTCTGTAAC CTCAACAAAG: 4320 

TTGCTGTTTC CTTCTAAGGC AACTTCTGAG. . CTGAGTCATA' GTGCCAAATC TGATGCCGGT 4380 

TTAGTGGGTG r GTGGTGAAGA TGGTGACACT GATGATGATG GTGATGATGA TGATGATGAC 4440 

AGAGGTAGTG ATGGCTTATC CATTCATAAG TGTATGTCAT GCTCATCCTA TAGAGAATCA 4500 

' CAGGAAAAGG TAATGAATGA TTCAGACACC CACG AAAACA GTCTTATGGA TCAGAATAAT 4560 

CCAATCTCAT ACTCACTATC TGAGAATTCT GAAGAAGATA ATAGAGTCAC AAGTGTATCC 4620 

TCAGACAGTC AAACTGGTAT GGACAGAAGT CCTGGTAAAT CACCATCAGC AAATGGGCTA 4680 

TCCCAAAAGC ACAATGATGG AAAAGAGGAA AATGACATTC AGACTGGTAG TGCTCTGCTT 4740 

CCTCTCAGCC CTGAATCTAA AGCATGGGCA GTTCTGACAA GTGATGAAGA AAGTGGATCA 4800 

GGGCAAGGTA CCTCAGATAG CCTTAATGAG AATGAGACTT CCACAGATTT CAGTTTTGCA 4860 

GACACTAATG AAAAAGATGC TGATGGGATC CTGGCAGCAG GTGACTCAGA AATAACTCCT 4920 

GGATTCCCAC AGTCCCCAAC ATCATCTGTT ACTAGCGAGA ACTCAGAAGT GTTCCACGTT 4980 

TCAGAGGCAG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 5040 

GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 5100 

CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 5160 

TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 5220 

ATTTCAGATG ATGTCGGAGC " AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 5280- 

CATGCAAGTA GTGGGTTT AC • TG AAGAATTT GAGACACTGA- AAGAGTTTTA CCAGGAAGTG 5340 

CAGAGCTGTA CTGTTGACTT AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG 5400 

CACAAGAATC G ATA CAT AAA TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 5460 

CTTGCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC . 5520 

AACAGACCAA AAGCTTATAT • TGCTGCCCAA : GGCCCACTGA AATCCACAGC TGAAGATTTC 5580 

TGGAGAATGA TATGGGAACA TAATGTGGAA'-GTTATTGTCA TGATAACAAA CCTCGTGGAG' 5640 

AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTACGGGAAC 5700 

TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 5760 

CTAAGAAAGA" CAAAAATAAA AAAGGGCTCC- CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 5820 

ACACAGTATC ACTACACX5CA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 5880. 

CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 5940 

CACTGCAGTG CTGGAGTTGG AAGAACAGGC. ACATATATTG TGCTAGACAG TATGTTGCAG. 6000 

CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 6060 

AGAAATT AT T TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 6120. 

GCCATACTTA GTAAAGAAAC TGAGGTGCTG " GACAGTCATA TTCATGCCTA TGTTAATGCA 6180 

CTCCTCATTC CTGGACCAGC AGGCAAAACA- AAGCTAGAGA AACAATTCCA GCTCCTGAGC 6240 

CAGTCAAATA • TACAGCAGAG TGACTATTCT GCAGCCCTAA AGCAATGCAA CAGGGAAAAG 6300 

AATCGAACTT CTTCTATCAT CCCTGTGGAA AGATCAAGGG TTGGCATTTC ATCCCTGAGT 6360 

GGAGAAGGCA CAGACTACAT CAATGCCTCC TATATCATGG GCTATTACCA GAGCAATGAA 6420 

TTCATCATTA GGCAGCACCC TCTCCTTCAT ACCATCAAGG ATTTCTGGAG GATGATATGG 6480 

GACCATAATG CCCAACTGGT GGTTATGATT. • CCTGATGGCC AAAACATGGC AGAAGATGAA 6540 

TTTGTTTACT GGCCAAATAA AGATGAGCCT ATAAATTGTG AGAGCTTTAA GGTCACTCTT 6600 

ATGGCTGAAG ■ AACACAAATG TCTATCTAAT GAGGAAAAAC TTATAATTCA GGACTTTATC 6660 

TTAGAAGCTA CACAGGATGA TTATGTACTT GAAGTGAGGC ACTTTCAGTG TCCTAAATGG 6720 

CCAAATCCAG ATAGCCGCAT TAGTAAAACT. TTTGAACTTA TAAGTGTTAT AAAAGAAGAA 6780 

• GCTGCCAATA GGGATGGGCC TATGATTGTT CATGATGAGC ATGGAGGAGT GACGGCAGGA 6840 

ACTTTCTGTG CTCTGACAAC CCTTATGCAC CAACTAGAAA AAGAAAATTC CGTGGATGTT 6900 

TACCAGGTAG CCAAGATGAT CAATCTGATG AGGCCAGGAG TCTTTGCTGA CATTGAGCAG 6960 

TATCAGTTTC TCTACAAAGT GATCCTCAGC ' CTTGTGAGCA CAAGGCAGGA AGAGAATCCA- 7020. 

TCCACCTCTC TGGACAGTAA TGGTGCAGCA TTGCCTGATG GAAATATAGC TGAGAGCTTA- 7080. 

GAGTCTTTAG TTTAACACAG AAAGGGGTGG' GGGGACTCAC ATCTGAGCAT TGTTTTCCTC 7140 

TTCCTAAAAT TAGGCAGGAA AATCAGTCTA GTTCTGTTAT CTGTTGATTT CCCATCACCT 7200 

GACAGTAACT TTCATGACAT AGGATTCTGC CGCCAAATTT ATATCATTAA CAATGTGTGC 7260 

CTTTTTGCAA GACTTGTAAT TTACTTATTA TGTTTGAACT AAAATGATTG AATTTTACAG 7320 

- TATTTCTAAG AATGGAATTG TGGTATTTTT^ TTCTGTATTG ATTTTAACAG AAAATTTCAA 7380 

TTTATAGAGG TTAGGAATTC CAAACTACAG AAAATGTTTG TTTTTAGTGT CAAATTTTTA 7440 

• GCTGTATTTG TAGCAATTAT CAGGTTTGCT AGAAATATAA CTTTTAATAC AGTAGCCTGT 7500 

AAATAAAACA CTCTTCCATA TGATATTCAA CATTTTACAA CTGCAGTATT CACCTAAAGT 7560 

- AGAAATAATC TGTTACTTAT TGTAAATACT GCCCTAGTGT CTCCATGGAC CAAATTTATA 7620 

TTTATAATTG TAGATTTTTA TATTTTACTA CTGAGTCAAG TTTTCTAGTT CTGTGTAATT 7680 

GTTfAGTTTA ATGACGTAGT TCATTAGCTG GTCTTACTCT ACCAGTTTTC TGACATTGTA 7740 

TTGTGTT ACC ' TAAGTCATTA ACTTTGTTTC AGCATGTAAT TTTAACTTTT GTGGAAAATA 7800 

GAAATACCTT CATTTTGAAA GAAGTTTTTA TGAGAATAAC ACCTTACCAA ACATTGTTCA ,7860 

AATGGTTTTT ATCCAAGGAA TTGCAAAAAT AAATATAAAT ATTGCCATTA AAAAAAAAAA 7920 
AAAAAAAAAA AAAAAAAAAA AAAA 
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Seq ID NO; 573 Protein' sequence: 

Protein Accessioa g:..Eos sequence - 

5 1 - 11 21 31 41 51 

1 | - I I I.I 

MRILKRFIAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 
QSPINIDEDL' TQVNVNLK^KL.. KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 
FKASKITFHW. GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

10 ILFEVGTEEN " LDFKAI IDGV; ESVSRFGKQA ALDPFILLNL LPNSTDXYYI YNGSLTSPPC 240 
TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY. KFSRQVFSSY 300 
' TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY- C3QLDGEDQTK-- 360 
HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY, SDQLIVDMPT DNPELDLFPE • . 420 
LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA- TNQIRKKEPQ ISTTTHYNRI- GTKYNEAKTN - 480 

15 RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT- VEGTSASLND ' 540 
GSKTVLRSPH MNLSGTAESL NTVSITEYEE- ESLLTSFKLD. TGAEDSSGSS PATSAIPFIS 600 
ENISQGYIFSi SENPETITYD VLIPESARNA- SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI; 660. 
TAQPDVGSGR- ESFLQTNYTE IRVDESEKTT. KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 
TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNGETPLQ PSYSSEVFPL VTPLLLDNQI 780 

20 LNTTPAASSS DSALHATPVF PSVDVSFESr LSSYDGAPLL PFSSASFSSE LFRHLHTVSQ- 840 
ILPQVTSATE SDKVPLHASL PVAGGDLLLE - PSLAQYSDVL. STTHAASETL EFGSESGVLY 900 
KTLMFSQVEP PSSDAMMHAR: SSGPBPSYAL SDNEGSQHIF: TVSYSSAIPV: HDSVGVTYQG: 960 
SLFSGPSHIP IPKSSLITPT ASLLQPTHAL SGDGEWSGAS- SDSEFLLPDT' DGLTALNISS 1020 
PVSVAEFTYT-" TSVFGDDNKA" LSKSEIIYGNi ETELQIPSFN- EMVYPSESTV. MPNMYDNVNX 1080 

25 LNASLQETSV-SISSTKGMFP GSLAHTTTKV FDHEISQVPE NNFSVQPTHT- VSQASGDTSL." 1140' 
KPVLSANSEP ASSDPASSEM LSPSTQLLFY ETSASFSTEV-- LLQPSFQASD. VDTLLKTVLP 1200 
AVPSDPILVE TPKVDKISST MLHLIVSNSA' SSENMLHSTS VPVFDVSPTS HMHSASLQGL • 12 60 
TISYASEKYE PVLLKSESSH QWPSLYSND- BLFQTANLEL NQAHPPKGRH: VFATPVLSID 1320 
EPLNTLINKL IESDEILTST KSSVTGKVFA GIPTVASDTFi VSTDHSVPIG NGHVAITAVS 1380 

30 PHRDGSVTST KLLFPSKATS. ELSHSAKSDA- GLVGGGEDGD.TDDDGDDDDD DRGSDGLSIH 1440 
KCMSCSSYRE SQEKVMNDSD' THENSLMDQN; NPISYSLSEN: SEEDNRVTSV SSDSQTGMDR - 1500 
SPGKSPSANG LSQKHNDGKE ENDIQTGSAL. LPLSPESKAW AVLTSDEESG SGQGTSDSLNh 1560 
ENETSTDFSF ADTNEKDADG ' ILAAGDSEIT PGFPQSPTSS, VTSENSEVFH VSEAEASNSS; 1620. 
HESRIGLAEG. LESEKKAVIP LVIVSALTFI ' CLWLVGILI YWRKCFQTAH: FYLEDSTSPR" 1680 

35 " VISTPPTPIF PISDDVGAIP- IKHFPKHVAD LHASSGFTEE FETLKEFYQE VQSCTVDLGI . 1740 
TADSSNHPDN. KHKNRYINIV AYDHSRVKLA QLAEKDGKLT- DYINANYVDG YNRPKAYIAA 1800 
QGPLKSTAED FWRMIWEHNV EVIVMITNLV- EKGRRKCDQY WPADGSEEYG NFLVTQKSVQ- 1860 
VLAYYTVRNF TLRNTKIKKG SQKGRPSGRV VTQYHYTQWP; DMGVPEYSLP. VLTFVRKAAYr 192 0 : 
AKRHAVGPW VHCSAGVGRT GTYIVLDSML QQIQHEGTVN- IFGFLKHIRS QRNYLVQTEE- 1980 

40 QYVFIHDTLV EAILSKETEV LDSHIHAYVN'. ALLIPGPAGK ; TKLEKQFQLL SQSNIQQSDY' 2040 
SAALKQCNRE KNRTS S 1 1 PV ERSRVGISSL SGEGTDYINA SYIMGYYQSN EFIITQHPLL 2100 
HTIKDFWRMI- WDHNAQLWM IPDGQNMAED EFVYWPNKDE PINCESFKVT LMAEEHKCLS 2160 
NEEKLIIQDF ILEATQDDYV; LEVRHFQCPK WPNPDSPISK TFELISVIKE EAANRDGPMI 2220 
VHDEHGGVTA GTFCALTTLM HQLEKENSVP" VYQVAKMINL MRPGVFADIE QYQFLYKVIL 2280 

45 SLVSTRQEEN PSTSLDSNGA ALPDGNIAES LESLV 
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Seq ID NO: 574 DNA sequence 

Nucleic Acid Accession JH Eos sequence 

Coding sequence: 148-4518 



1 11 21 31 41 51 . . 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

55 CGGCGAGGGG CCGCAGACCG TCTGGAAATG" CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG- CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT- GGGATAAAAC ATCATTGGAA 420 

'60 AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT' 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA" GTTTTGAGGA AGCAGTCAAA -660 

• GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG- TTGGGACAGA AGAAAATTTG 720 

65 GATTTCAAAG " CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT. 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG". CCAAACTCAA- CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

70 TTCTCTAGAC AGGTGTTTTC CTCATACACT, GGAAAGGAAG' AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT. TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA' AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

75 TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTG AT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT- CAAGGAGGAG' 1440 

GAAGAGGGAA AAGACATTGA . AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

80 AAGGGTGATG TTCCCAATAC ATCTTTAAAT.- TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 - 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT. AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 192 0 

85 ' 1 GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 



404 



WO 02/086443 

GTCTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCC ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATT ATT CT A CCTTTGCCTA CTTCCCAACT GAGGTAACAC. CTCATGCTTT TACCCCATCC 2.340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG CAGAGGCCAG TAATAGTAGC CATGAGTCTC GTATTGGTCT AGCTGAGGGG 2460 

• TTGGAATCCG AGAAGAAGGC AGTTATACCC CTTGTGATCG TGTCAGCCCT GACTTTTATC 2520 

TGTCTAGTGG TTCTTGTGGG TATTCTCATC TACTGGAGGA. AATGCTTCCA GACTGCACAC 2580 

TTTTACTTAG AGGACAGTAC ATCCCCTAGA' GTTATATCCA- CACCTCCAAC ACCTATCTTT 2640 

CCAATTT CAG ATGATGTCGG AGCAATTCCA • ATAAAGCACT TTCCAAAGCA" TGTTGCAGAT 2700 

TTACATGCAA GTAGTGGGTT TACTGAAGAA TTTGAGACAC TGAAAGAGTT TTACCAGGAA 2760 

- GTGCAGAGCTv GTACTGTTGA CTTAGGTATT ACAGCAGACA GCTCCAACCA CCCAGACAAC 2820 

AAGCACAAGa' ATCGATACAT AAATATCGTT GCCTATGATC ATAGCAGGGT TAAGCTAGCA- 2880 

CAGCTTGCTG AAAAGGATGG CAAACTGACT- GATTATATCA ATGCCAATTA TGTTGATGGC 2940 

TACAACAGAC CAAAAGCTTA TATTGCTGCC CAAGGCCCAC TGAAATCCAC AGCTGAAGAT- 3000 

' TTCTGGAGAA TGATATGGGA ACATAATGTG GAAGTTATTG TCATGATAAC AAACCTCGTC 3060 

GAGAAAGGAA GGAGAAAATG TGATCAGTAC- TGGCCTGCCG ATGGGAGTGA GGAGTACGGG-- 3120 

AACTTTCTGG TCACTCAGAA GAGTGTGCAA GTGCTTGCCT ATTATACTGT GAGGAATTTT, 3180 

ACTCTAAGAA ACACAAAAAT AAAAAAGGGC TCCCAGAAAG GAAGACCCAG TGGACGTGTG; 3240 

GTCACACAGT ATCACTACAC GCAGTGGCCT GACATGGGAG TACCAGAGTA CTCCCTGCCA 3300 

GTGCTGACCT TTGTGAGAAA GGCAGCCTAT GCCAAGCGCC ATGCAGTGGG GCCTGTTGTC: 3360 

GTCCACTGCA GTGCTGGAGT TGGAAGAACA GGCACATATA TTGTGCTAGA CAGTATGTTG^ 3420 

CAGCAGATTC AACACGAAGG AACTGTCAAC ATATTTGGCT TCTTAAAACA CATCCGTTCA^ 3480 

CAAAGAAATT ATTTGGTACA AACTGAGGAG CAATATGTCT- TCATTCATGA ™CACTCGTT 3540 

GAGGCCATAC TTAGTAAAGA AACTGAGGTG1 CTGGACAGTC ATATTCATGC CTATGTTAAT 3600 

GCACTCCTCA TTCCTGGACC AGCAGGCAAA 1 ACAAAGCTAG. AGAAACAATT CCAGCTCCTG 3660 

AGCCAGTCAA ATATACAGCA GAGTGACTAT TCTGCAGCCC TAAAGCAATG ^CAGGGAA 3720 

AAGAATCGAA CTTCTTCTAT. CATCCCTGTG GAAAGATCAA GGGTTGGCAT TTCATCCCTG 3780 

AGTGGAGAAG GCACAGACTA CATCAATGCC TCCTATATCA TGGGCTATTA CCAGAGCAAT 3840 

GAATTCATCA- TTACCCAGCA CCCTCTCCTT CATACCATCA AGGATTTCTG GAGGATGATA 3900 

TGGGACCATA' ATGCCCAACT GGTGGTTATG ATTCCTGATG GCCAAAACAT GGCAGAAGAT 3960 

• GAATTTGTTT ACTGG CCAAA- TAAAGATGAG CCTATAAATT GTG AGAGCTT- TAAGGTCACT 4020 
CTTATGGCTG AAGAACACAA ATGTCTATCT AATGAGGAAA AACTTATAAT TCAGGACTTT 

ATCTTAGAAG CTACACAGGA" tgattatgta cttgaagtga- ggcactttca GTGTCCTAAA 

TGGCCAAATC CAGATAGCCC CATTAGTAAA' ACTTTTG AAC TTATAAGTGT TATAAAAGAA- 

GAAGCTGCCA ATAGGGATGG: GCCTATGATT- GTTCATGATG AGCATGGAGG AGTGACGGCA 4260 

GGAACTTTCT. GTGCTCTGAC- AACCCTTATG CACCAACTAG AAAAAGAAAA. TTCCGTGGAT 4320 

GTTTACCAGG TAGCCAAGAT^GATCAATCTG ATGAGGCCAG GAGTCTTTGC. TGACATTGAG 4380 

CAGTATCAGT TT CTCT ACAA ' AGTGATCCTC AGCCTTGTGA 1 GCACAAGGCA GGAAGAGAAT 4440 

CCATCCACCT CTCTGGACAG TAATGGTGCA GCATTGCCTG S ATGGAAATAT- AGCTGAGAGC 4500 

TTAGAGTCTT TAGTTTAACA 1 CAGAAAGGGG TGGGGGGACT CACATCTGAG CATTGTTTTC 4560 

CTCTTCCTAA AATTAGGCAG GAAAATCAGT- CTAGTTCTGT- TATCTGTTGA TTTCCCATCA- 4620 

SgIcaGTA ACTTTCATGA CATAGGATTC TGCCGCCAAA TTTATATCAT TAACAATGTG 4680 

TGCCTTTTTG CAAGACTTGT AATTTACTTA TTATGTTTGA ACTAAAATGA TTGAATTTTA 4740 

CAGTATTTCT AAGAATCGAA TTGTGGTATT- TTTTTCTGTA' TTGATTTTAA CAGAAAATTT- 4800 

CAATTTATAG AGGTTAGGAA TTCCAAACTA CAGAAAATGT TTGTTTTTAG TGTCAAATTT 4860 

TTAGCTGTAT TTGTAGCAAT TATCAGGTTT GCTAGAAATA TAACTTTTAA TACAGTAGCC 4920 

TGTAAATAAA ACACTCTTCC ATATGATATT CAACATTTTA CAACTGCAGT ATTCACCTAA- 4980 

ACTAGAAATA ATCTGTTACT TATTGTAAAT ACTGCCCTAG TGTCTCCATG ^CCAAATTT. 5040 

ATATTTATAA TTGTAGATTT TTATATTTTA CTACTGAGTC AAGTTTTCTA CTTCTGTGTA 5100 

ATTGTTTAGT TTAATGACGT. AGTTCATT AG CTGGTCTTAC TCTACCAGTT ^GACATT 5160 

GTATTGTGTT ACCTAAGTCA TTAACTTTGT TTCAGCATGT AATTTTAACT TTTGTGGAAA 5220 
ATAGAAATAC CTTCATTTTG AAAGAAGTTT TTATGAGAAT AACACCTTAC CAAACATTGT ■ 5280 

TCAAATGGTT TTTATCCAAG GAATTGCAAA AATAAATATA AATATTGCCA TTAAAAAAAA 5340 
AAAAAAAAAA AAAAAAAAAA AAAAAAA 

Seq ID NO: 575 Protein sequence: 
Protein Accession 8: Eos sequence 

1 . 11 21 31 41 51 

MRIUOlFIiAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKL KFQGWDKTSL EKTFIHNTGK TVEINLTNDY RVSGGVS2MV 120 

FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 
ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YKGSLTSPPC , 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 

LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS' PATSAIPFIS 600 

ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 

■TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNAEASNS SHESRIGLAE GLESEKKAVI 780 

PLVIVSALTF ICLWLVGIL IYWRKCFQTA HFYLEDSTSP RVISTPPTPI FPISDDVGAI 840 

PIKHFPKHVA DLHASSGFTE EFETLKEFYQ EVQSCTVDLG ITADSSNHPD NKKKNRYINI 900 

VAYDHSRVKI* AQLAEKDGKL TDYIKANYVD GYNRPKAYIA AQGPLKSTAE DFWRMIWEHN 960 

VEVIVMITNL VEKGRRKCDQ YWPADGSEEY. GNFLVTQKSV QVLAYYTVRN FTLRNTKIKX 1020 

GSQKGRPSGR WTQYHYTQW PDKGVPEYSL PVLTFVRKAA YAKRHAVGPV WHCSAGVGR 1080 

TGTYIVLDSM LQQIQHEGTV NIFGFLKHIR SQRNYLVQTB EQYVFIHDTL VEAILSKETE 1140 

VLDSHIHAYV NALLIPGPAG KTKLEKQFQL LSQSNIQQSD YSAALKQCNR EKNRTSSIIP 1200 

VERSRVGISS LSGEGTDYIN ASYIMGYYQS KEF1 ITQHPL LHTIKDFWRM IWDHNAQLVV 1260 

MIPDGQNMAE DEFVYWPNKD EPINCESFKV TLMAEEHKCL SNEEKLI IQD FILEATQDDY 1320 

^EVrSfQCP KWPNPDSPIS KTFELISVIK EEAANRDGPM IVHDEHGGVT AGTFCALTTL 1380 

MHQLEKENSV DVYQVAKMIN LMRPGVFADI EQYQFLYKVI LSLVSTRQEE NPSTSLDSNG 1440 
AALPDGNIAE SLESLV 



4080 
4140 
4200 
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Seq ID NO: 576 DNA.sequence 

Nucleic Acid Accession 8: BOS sequence 

Coding sequence: 148-4494 

1 11 r j: 1 i 1 r 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 
CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 
CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 1B0 
CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 
CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 
AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 
CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT: GGGATAAAAC ATCATTGGAA - 420 
AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA -ATCTCACTAA- TGACTACCGT' 480 
GTCAGCGGAG" GAGTTTCAGA AATGGTGTTT AAAGCAAGCA • AGATAACTTT- TCACTGGGGA 540 
AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG A AGGA CAAAA ATTTCCACTT 600 
GAGATGCAAA TCTACTGCTT TGATGCAGAC CGATTTTCAA- GTTTTGAGGA AGCAGTCAAA 660 
GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 
GATTTCAAAG* CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT .780 
TTAGATCCAT TCATACTGTT GAACCTTCTG 7 CCAAACTCAA* CTGACAAGTA TTACATTTAC 840 
AATGGCTCATV TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 
ACAGTTAGCA. TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 
TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT; TTCGAGAGCA ACAGTACAAG 1020 
TTCTCTAGAC AGGTGTTTTC . CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT' 1080 
AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT^ ATACCAGCCT- TCTTGTTACA' 1140 
TGGGAAAGAC" ctcgagtcgt TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 
CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG- 1260. 
GGTGCTATTC TCAATAATTT. GCTACCCAAT ATGAGTTATG- TTCTTCAGAT AGTAGCCATA 1320 
TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTG AT . • 1380 
AATCCTGAAC TTGATCTTTT- CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG -* 1440 
GAAGAGGGAA* AAGACATTGA- AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA- 1500 
AACCAAATCA GGAAAAAGGA- ACCCCAGATT- TCTACCACAA CACACTACAA TCGCATAGGG 1560 
ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA- 1620- 
AAGGGTGATG- TTCCCAATAC ATCTTTAAAT TCCACTTCCC. AACCAGTCAC TAAATTAGCC 1680 
ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG " 1740 
GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 
AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG I860 
AGTTTATTGA CCAGTTTCAA- GCTTGATACT GGAGCTGAAG ATTCTTCAGG. CTCCAGTCCC 1920 
GCAACTTCTG. CTATCCCATT- CATCTCTGAG- AACATATCCC AAGGGTATAT- ATTTTCCTCC" 1980 
GAAAACCCAG AGACAATAAC ATATGATGTC CT7ATACCAG AATCTGCTAG AAATGCTTCC 2040 
GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 
GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 
AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG* AATCTGAGAA GACAAGCAAG. 2220 
TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG - TTACAGATCT GGAAATGCCA 2280 
CATTATTCTA CCTTTGCCTA CTTCCCAACT. GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 
TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT- ACTCGCAGAC AACCCAACCG 2400 
GTATACAATG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 2460 
GAATCCGAGA' AGAAGGCAGT- TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 2520 
CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT' 2560 
TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 2640 
ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 2700 
CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGGAAGTGC AGAGCTGTAC TGTTGACTTA 2760 
GGTATTACAG CAGACAGCTC. CAACCACCCA GACAACAAGC ACAAGAATCG ATACATAAAT: 2820 
ATCGTTGCCT- ATGATCATAG CAGGGTTAAG CTAGCACAGC TTGCTGAAAA GGATGGCAAA- 2880 
CTGACTGATT ATATCAATGC CAATTATGTT GATGGCTACA ACAGACCAAA AGCTTATATT 2940 
GCTGCCCAAG GCCCACTGAA ATCCACAGCT GAAGATTTCT GGAGAATGAT ATGGGAACAT 3000 
AATGTGGAAG TTATTGTCAT. GATAACAAAC CTCGTGGAGA' AAGGAAGGAG AAAATGTGAT 3060 
CAGTACTGGC CTGCCGATGG; GAGTGAGGAG TACGGGAACT TTCTGGTCAC TCAGAAGAGT 3120 
GTGCAAGTGC TTGCCTATTA TACTGTGAGG AATTTTACTC TAAGAAACAC AAAAATAAAA 3180 
AAGGGCTCCC AGAAAGGAAG- ACCCAGTGGA CGTGTGGTCA CACAGTATCA CTACACGCAG- 3240 
TGGCCTGACA TGGGAGTACC AGAGTACTCC CTGCCAGTGC TGACCTTTGT GAGAAAGGCA 3 00 
GCCTATGCCA AGCGCCATGC AGTGGGGCCT GTTGTCGTCC ACTGCAGTGC TGGAGTTGGA ■ 3360 
AGAACAGGCA CATATATTGT GCTAGACAGT ATGTTGCAGC AGATTCAACA CGAAGGAACT 3420 
GTCAACATAT TTGGCTTCTT- AAAACACATC CGTTCACAAA GAAATTATTT GGTACAAACT 3480 
GAGGAGCAAT ATGTCTTCAT TCATGATACA CTGGTTGAGG CCATACTTAG TAAAGAAACT 3540 
GAGGTGCTGG ACAGTCATAT TCATGCCTAT GTTAATGCAC TCCTCATTCC TGGACCAGCA 3600 
GGCAAAACAA AGCTAGAGAA ACAATTCCAG CTCCTGAGCC AGTCAAATAT ACAGCAGAGT. 3660 
GACTATTCTG CAGCCCTAAA GCAATGCAAC AGGGAAAAGA ATCGAACTTC TTCTATCATC 3720 
CCTGTGGAAA GATCAAGGGT TGGCATTTCA TCCCTGAGTG GAGAAGGCAC AGACTACATC 3780 
AATGCCTCCT. ATATCATGGG CTATTACCAG AGCAATGAAT TCATCATTAC CCAGCACCCT 3840 
CTCCTTCATA CCATCAAGGA TTTCTGGAGG ATGATATGGG ACCATAATGC CCAACTGGTG 3900 
GTTATGATTC CTGATGGCCA AAACATGGCA GAAGATGAAT TTGTTTACTG GCCAAATAAA' 3960 
GATGAGCCTA TAAATTGTGA GAGCTTTAAG GTCACTCTTA • TGGCTGAAGA ACACAAATGT 4020 
CTATCTAATG AGGAAAAACT TATAATTCAG GACTTTATCT TAGAAGCTAC ACAGGATGAT. 4080 
TATGTACTTG AAGTGAGGCA CTTTCAGTGT CCT AAATGGC CAAATCCAGA TAGCCCCATT 4140 
AGTAAAACTT TTGAACTTAT AAGTGTTATA AAAGAAGAAG CTGCCAATAG GGATGGGCCT 4200 
ATGATTGTTC ATGATGAGCA TGGAGGAGTG ACGGCAGGAA CTTTCTGTGC TCTGACAACC 4260 
CTTATGCACC AACTAGAAAA AGAAAATTCC GTGGATGTTT ACCAGGTAGC fAAGATGATC 4320 
AATCTGATGA GGCCAGGAGT CTTTGCTGAC ATTGAGCAGT ATCAGTTTCT CTACAAAGTG, 4380 
ATCCTCAGCC TTGTGAGCAC AAGGCAGGAA GAGAATCCAT CCACCTCTCT GGACAGTAAT. 4440 
GGTGCAGCAT TGCCTGATGG AAATATAGCT GAGAGCTTAG AGTCTTTAGT TTAACACAGA 4500. 
AAGGGGTGGG GGGACTCACA TCTGAGCATT GTTTTCCTCT TCCTAAAATT .^GCAGGAAA 4560 
ATCAGTCTAG TTCTGTTATC TGTTGATTTC CCATCACCTG ACAGTAACTT TCATGACATA 4620 
GGATTCTGCC GCCAAATTTA TATCATTAAC AATGTGTGCC TTTTTGCAAG AOTGTAATT 4680 
TACTTATTAT GTTTGAACTA AAATGATTGA ATTTTACAGT ATTTCTAAGA ATGGAATTGT. 4740 
GGTATTTTTT TCTGTATTGA TTTTAACAGA AAATTTCAAT TTATAGAGGT TAGGAATTCC 4800 
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AAACTACAGA AAATGTTTGT TTTTAGTGTC AAATTTTTAG CTGTATTTGT AGCAATTATC 4860 

AGGTTTGCTA GAAATATAAC TTTTAATACA GTAGCCTGTA" AATAAAACAC TCTTCCATAT 4920 

GATATTCAAC ATTTTACAAC TGCAGTATTC ACCTAAAGTA GAAATAATCT GTTACTTATT 4980 

GTAAATACTG CCCTAGTGTC TCCATGGACC AAATTTATAT TTATAATTGT AGATTTTTAT 5040 

ATTTTACTAC TGAGTCAAGT TTTCTAGTTC TGTGTAATTG TTTAGTTTAA TGACGTAGTT 5100 

CATTAGCTGG TCTTACTCTA CCAGTTTTCT GACATTGTAT TGTGTTACCX A AGTCA TTAA 5160 

CTTTGTTTCA GCATGTAATT TTAACTTTTG TGGAAAATAG AAATACCTTC ATTTTGAAAG 5220 

AA GTTTTT AT GAGAATAACA CCTTACCAAA CATTGTTCAA ATGGTTTTTA TCCAAGGAAT 5280 

TGCAAAAATA AATATAAATA TTGCCATTAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 5340 
AAA 

Seq ID NO: 577 Protein sequence: * 
Protein Accession 8: EOS sequence 

1 11 21 ■ 31 41 51 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPX 60 

QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 

FKASXITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNIi LPNSTDKYYI- YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LGNNFREQQY KFSRQVFSSY 300 

TGKEEIHEAV CSSEPENVQA" DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 

DIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI- GTKYNEAKTN' 480 

RSPTRGSEFS GKGDVPNTSI* NSTSQPVTKL ATEKDISLTS. QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESI* NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG- NVWFPSSTDI - 660 

TAQPDVGSGR- ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM. PHYSTFAYFP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNEASNSS HESRIGLAEG LESEKKAVIP 780 

LVIVSALTFI CLWLVGILi;: YWRKCFQTAH FYLEDSTSPR- VISTPPTPIF PISDDVGAIP 840 

' IKHFPXHVAD LHASSGFTEE FEEVQSCTVD LGITADSSNH PDNKHKNRYI" NIVAYDHSRV . 900 

KLAQLAEKDG KLTDYINANY- VDGYNRPKAY IAAQGPLKST AEDFWRMIWB HNVEVIVMIT 960 

NLVEKGRRKC DQYWPADGSE EYGNFLVTQK SVQVLAYYTV RNFTLRNTKI KKGSQKGRPS 1020 

GRWTQYHYT QWPDMGVPEY- SLPVLTFVRK AAYAKRHAVG PVWHCSAGV. GRTGTYIVLD 1080 

SHLQQIQHEG TVNIFGFLXH- IRSQRNYLVQ. TEEQYVFIHD TLVEAILSKE TEVLDSHIHA 1140 

YVNALfcIPGP : AGKTKLEKQF. QLLSQSNIQQ: SDYSAALKQC NREKNRTSS I I PVERSRVGI 1200 

SSLSGEGTDY INASYIMGYY' QSNEFIITQH PLLHTIKDFW RMIWDHNAQL WMIPDGQNM 1260 

AEDEFVYWPN KDEPINCESF KVTI^IAEEHK- CLSNEEKLII QDFILEATQD DYVLEVRHFQ 1320 

CPKWPNPDSP' ISKTFELISV- IKEEAANRDG PMIVHDEHGG VTAGTFCALT TLMHQLEKEN 1380 

SVDVYQVAiOvi^ INLMRPGVFA DIEQYQFIiYK VILSLVSTRQ EENPSTSLDS KGAALPDGNI 1440 
AESLESLV 



Seq ID NO: 578 DNA sequence 

Nucleic Acid Accession #: EOS sequence 

Coding sequence: 501-4514 

1 11 21 31 41 51 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA. 240 

CTTGTTGAAG AGATTGGCTG. GTCCTATACA- GGAGCACTGA" ATCAAAAAAT TGGGGAAAGA 300 

AATATCCAAC ATGTAATAGC CCAAAACAAT CTCCTATCAA TATTGATGAA GATCTTACAC 360 

AAGTAAATGT GAATCTTAAG AAACTTAAAT : TTCAGGGTTG GGATAAAACA " TCATTGGAAA- 420- 

ACACATTCAT TCATAACACT GGGAAAACAG TGGAAATTAA TCTCACTAAT- GACTACCGTG 480 

TCAGCGGAGG AGTTTCAGAA" ATGGTGTTTA AAGCAAGCAA GATAACTTTT CACTGGGGAA 540 

AATGCAATAT GTCATCTGAT GGATCAGAGC ATAGTTTAGA AGGACAAAAA TTTCCACTTG 600 

AGATGCAAAT CTACTGCTTT GATGCGGACC GATTTTCAAG TTTTGAGGAA GCAGTCAAAG 660 

GAAAAGGGAA GTTAAGAGCT TTATCCATTT TGTTTGAGGT TGGGACAGAA GAAAATTTGG 720 

ATTTCAAAGC GATTATTGAT GGAGTCGAAA GTGTTAGTCG TTTTGGGAAG CAGGCTGCTT 780 

TAGATCCATT CATACTGTTG AACCTTCTGC CAAACTCAAC TGACAAGTAT TACATTTACA 840 

ATGGCTCATT GACATCTCCT- CCCTGCACAG ACACAGTTGA CTGGATTGTT TTTAAAGATA 900 

CAGTTAGCAT CTCTGAAAGC CAGTTGGCTG TTTTTTGTGA AGTTCTTACA ATGCAACAAT 960 

• CTGGTTATGT CATGCTGATG GACTACTTAC AAAACAATTT TCGAGAGCAA CAGTACAAGT 1020 

TCTCTAGACA- GGTGTTTTCC TCATACACTG GAAAGGAAGA GATTCATGAA GCAGTTTGTA 1080 

GTTCAGAACC AGAAAATGTT CAGGCTGACC CAGAGAATTA TACCAGCCTT CTTGTTACAT 1140 

GGGAAAGACC TCGAGTCGTT TATGATACCA TGATTGAGAA GTTTGCAGTT TTGTACCAGC 1200 

AGTTGGATGG AGAGGACCAA ACCAAGCATG AATTTTTGAC AGATGGCTAT CAAGACTTGG 1260 

GTGCTATTCT CAATAATTTG- CTACCCAATA TGAGTTATGT, TCTTCAGATA GTAGCCATAT 1320 

GCACTAATGG CTTATATGGA AAATACAGCG ACCAACTGAT TGTCGACATG CCTACTGATA 1380 

ATCCTGAACT TGATCTTTTC CCTGAATTAA TTGGAACTGA AGAAATAATC AAGGAGGAGG 1440 

AAGAGGGAAA AGACATTGAA GAAGGCGCTA TTGTGAATCC TGGTAGAGAC AGTGCTACAA 1500 

ACCAAATCAG. GAAAAAGGAA' CCCCAGATTT CTACCACAAC ACACTACAAT CGCATAGGGA .1560 

CGAAATACAA TGAAGCCAAG ACTAACCGAT CCCCAACAAG AGGAAGTGAA TTCTCTGGAA 1620 

AGGGTGATGT TCCCAATACA' TCTTTAAATT CCACTTCCCA ACCAGTCACT AAATTAGCCA 1680 

CAGAAAAAGA TATTTCCTTG ACTTCTCAGA CTGTGACTGA ACTGCCACCT CACACTGTGG 1740 

AAGGTACTTC AGCCTCTTTA AATGATGGCT CTAAAACTGT TCTTAGATCT CCACATATGA 1800 

ACTTGTCGGG GACTGCAGAA TCCTTAAATA CAGTTTCTAT AACAGAATAT GAGGAGGAGA 1860 

GTTTATTGAC CAGTTTCAAG CTTGATACTG GAGCTGAAGA TTCTTCAGGC TCCAGTCCCG 1920 

CAACTTCTGC TATCCCATTC ATCTCTGAGA ACATATCCCA AGGGTATATA TTTTCCTCCG 1980 

AAAACCCAGA GACAATAACA TATGATGTCC TTATACCAGA ATCTGCTAGA AATGCTTCCG 2040 

AAGATTCAAC TTCATCAGGT TCAGAAGAAT CACTAAAGGA TCCTTCTATG GAGGGAAATG 2100 

TGTGGTTTCC TAGCTCTACA GACATAACAG CACAGCCCGA TGTTGGATCA GGCAGAGAGA 2160 

GCTTTCTCCA GACTAATTAC " ACTGAGATAC GTGTTGATGA ATCTGAGAAG ACAACCAAGT 2220 

CCTTTTCTGC AGGCCCAGTG ATGTCACAGG GTCCCTCAGT TACAGATCTG GAAATGCCAC 2280 
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ATTATTCTAC CTTTGCCTAC TTCCCAACTG AGGTAACACC TCATGCTTTT ACCCCATCCT 2340 

CCAGACAACA GGATTTCGTC TCCACGGTCA ACGTGGTATA CTCGCAGACA ACCCAACCGG ? 2400 

TATACAATGA GGCCAGTAAT AGTAGCCATG AGTCTCGTAT TGGTCTAGCT GAGGGGTTGG 2460 

AATCCGAGAA GAAGGCAGTT ATACCCCTTG TGATCGTGTC AGCCCTGACT TTTATCTGTC 2520 

TAGTGGTTCT TGTGGGTATT CTCATCTACT GGAGGAAATG CTTCCAGACT GCACACTTTT 25B0 

ACTTAGAGGA CAGTACATCC CCTAGAGTTA TATCCACACC TCCAACACCT ATCTTTCCAA 2640 

TTTCAGATGA TGTCGGAGCA ATTCCAATAA AGCACTTTCC AAAGCATGTT GCAGATTTAC 2700 

ATGCAAGTAG TGGGTTTACT- GAAGAATTTG" AGACACTGAA AGAGTTTTAC CAGGAAGTGC 2 7 GO 

AGAGCTGTAC TGTTGACTTA GGTATTACAG CAGACAGCTC CAACCACCCA GACAACAAGC 2820 

ACAAGAATCG ATACATAAAT ATCGTTGCCT ATGATCATAG CAGGGTTAAG CTAGCACAGC 2880 

TTGCTGAAAA GGATGGCAAA CTGACTGATT ATATCAATGC CAATTATGTT GATGGCTACA 2940 

ACAGACCAAA AGCTTATATT GCTGCCCAAG GCCCACTGAA ATCCACAGCT GAAGATTTCT 3000 

GGAGAATGAT ATGGGAACAT AATGTGGAAG TTATTGTCAT GATAACAAAC CTCGTGGAGA . 3060 

AAGGAAGGAG AAAATGTGAT CAGTACTGGC CTGCCGATGG GAGTGAGGAG TACGGGAACT 3120 

TTCTGGTCAC TCAGAAGAGT GTGCAAGTGC TTGCCTATTA TACTGTGAGG AATTTTACTC 3180 

TAAGAAACAC AAAAATAAAA AAGGGCTCCC AGAAAGGAAG ACCCAGTGGA CGTGTGGTCA 3240 

CACAGTATCA CTACACGCAG TGGCCTGACA TGGGAGTACC AGAGTACTCC CTGCCAGTGC 3300 

TGACCTTTGT GAGAAAGGCA GCCTATGCCA AGCGCCATGC AGTGGGGCCT GTTGTCGTCC 3360 

ACTGCAGTGC TGGAGTTGGA AGAACAGGCA CATATATTGT GCTAGACAGT ATGTTGCAGC 3420 

AGATTCAACA CGAAGGAACT GTCAACATAT TTGGCTTCTT AAAACACATC CGTTCACAAA 3480 

GAAATTATTT- GGTACAAACT. GAGGAGCAAT ATGTCTTCAT TCATGATACA CTGGTTGAGG 3540 

CCAT ACTTAG TAAAGAAACT GAGGTGCTGG ACAGTCATAT TCATGCCTAT GTTAATGCAC 3600 

TCCTCATTCC " TGGACCAGCA GGCAAAACAA AGCTAGAGAA ACAATTCCAG CTCCTGAGCC 3660 

AGTCAAATAT ACAGCAGAGT GACTATTCTG CAGCCCTAAA GCAATGCAAC AGGGAAAAGA 3720 

ATCGAACTTC TTCTATCATC CCTGTGGAAA GATCAAGGGT TGGCATTTCA TCCCTGAGTG 3780 

GAGAAGGCAC AGACTACATC AATGCCTCCT ATATCATGGG CTATTACCAG AGCAATGAAT. 3840 

TCATCATTAC CCAGCACCCT CTCCTTCATA CCATCAAGGA TTTCTGGAGG ATGATATGGG 3900 

ACCATAATGC CCAACTGGTG" GTTATGATTC CTGATGGCCA AAACATGGCA GAAGATGAAT 3960 

TTGTTTACTG GCCAAATAAA GATGAGCCTA TAAATTGTGA GAGCTTTAAG GTCACTCTTA 4020 

TGGCTGAAGA ACACAAATGT CTATCTAATG' AGGAAAAACT TATAATTCAG GACTTTATCT 4080 

TAGAAGCTAC ACAGGATGAT TATGTACTTG AAGTGAGGCA'. CTTTCAGTGT CCTAAATGGC 4140 

CAAATCCAGA TAGCCCCATT AGTAAAACTT TTGAACTTAT AAGTGTTATA AAAGAAGAAG 4200 

CTGCCAATAG GGATGGGCCT ATGATTGTTC ATGATGAGCA' TGGAGGAGTG ACGGCAGGAA 4260 

CTTTCTGTGC TCTGACAACC CTTATGCACC AACTAGAAAA AGAAAATTCC GTGGATGTTT 4320 

• ACCAGGTAGC CAAGATGATC AATCTGATGA GGCCAGGAGT CTTTGCTGAC ATTGAGCAGT 4380 

ATCAGTTTCT CTACAAAGTG. ATCCTCAGCC TTGTGAGCAC AAGGCAGGAA GAGAATCCAT 4440 

CCACCTCTCT GGACAGTAAT GGTGCAGCAT TGCCTGATGG AAATATAGCT GAGAGCTTAG 4500 

AGTCTTTAGT TTAACACAGA . AAGGGGTGGG GGGACTCACA TCTGAGCATT GTTTTCCTCT 4560 

TCCTAAAATT AGGCAGGAAA ATCAGTCTAG- TTCTGTTATC TGTTGATTTC CCATCACCTG 4620 

ACAGTAACTT TCATGACATA GGATTCTGCC GCCAAATTTA' TATCATTAAC AATGTGTGCC , 4680 

TTTTTGCAAG ACTTGTAATT TACTTATTAT GTTTGAACTA AAATGATTGA ATTTTACAGT 4740 

ATTTCTAAGA ATGGAATTGT GGTATTTTTT TCTGTATTGA TTTTAACAGA AAATTTCAAT 4800 

TTATAGAGGT- TAGGAATTCC AAACTACAGA AAATGTTTGT TTTTAGTGTC AAATTTTTAG 4860 

CTGTATTTGT AGCAATTATC AGGTTTGCTA GAAATATAAC TTTTAATACA GTAGCCTGTA 4920 

AATAAAACAC TCTTCCATAT GATATTCAAC ATTTTACAAC TGCAGTATTC ACCTAAAGTA 4980 

GAAATAATCT GTTACTTATT GTAAAT ACTG CCCTAGTGTC TCCATGGACC AAATTTATAT 5040 

TTATAATTGT AGATTTTTAT. ATTTTACTAC TGAGTCAAGT TTTCTAGTTC TGTGTAATTG 5100 

TTTAGTTTAA TGACGTAGTT CATTAGCTGG TCTTACTCTA CCAGTTTTCT GACATTGTAT 5160 

TGTGTTACCT AAGTCATTAA CTTTGTTTCA GCATGTAATT TTAACTTTTG TGGAAAATAG 5220 

AAATACCTTC ATTTTGAAAG AAGTTTTTAT GAGAATAACA CCTTACCAAA CATTGTTCAA 52 BO 

ATGGTTTTTA TCCAAGGAAT TGCAAAAATA AATATAAATA TTGCCATTAA AAAAAAAAAA 5340 
AAAAAAAAAA AAAAAAAAAA" AAA 

Seq ID NO: 579 Protein sequence: 
Protein Accession ft: EOS sequence 

1, . . 11 21 31 41 51 

MVFKASKITF HWGKCNMSSD GSEHSLEGQK FPLEMQIYCP DADRFSSPEE AVKGKGKLRA 60 

LSILFEVGTE ENLDFXAIID GVESVSRFGK QAALDPFILL NLLPNSTDKY YIYNGSLTSP — 
PCTDTVDWIV FKDTVSISES QLAVFCEVLT MQQSGYVMLM DYXQNNFREQ QYKFSRQVFS 
SYTGKEE1HE AVCSSEPENV QADPENYTSL LVTWERPRW YDTMIEKFAV LYQQLDGEDQ 
TKHEFLTDGY QDLGAILNNL LPNMSYVLQI VAICTNGLYG KYSDQLIVDM PTDNPELDLF 
PELIGTEEII KEEEEGKDIE EGAIVNPGRD SATNQIRKKE PQISTTTHYN RIGTKYNEAK 
TNRSPTRGSE FSGKGDVPNT SLNSTSQPVT- KLATEKDISIr TSQTVTSLPP HTVEGTSASI* 
NDGSKTVLRS- PHMNLSGTAE SUJTVSITEY EEESLLTSFK LDTGAEDSSG SSPATSAIPF 
ISENISQGYI FSSENPETIT YDVLIPESAR NASEDSTSSG SEESLKDPSM EGNVWFPSST 
DITAQPDVGS GRESFLQTNY-- TEIRVDESEK TTKSFSAGPV MSQGPSVTDL EMPHYSTFAY 
FPTEVTPHAF TPSSRQQDLV STVNWYSQT TQPVYNEASN SSHESRIGLA EGLESEKKAV 
IPLVIVSALT FICLWLVGI 1 LIYWRKCFQT AHFYLEDSTS PRVISTPPTP IFPISDDVGA 

IPIKHFPKHV. ADLHASSGFT EEFETLKEFY QEVQSCTVDL GITADSSNHP DNKHKNRYIN 780 

IVAYDHSRVK LAQLAEKDGK LTDYINANYV DGYNRPKAYI AAQGPLKSTA EDFWRMIHEH 840 

NVEVIVMITN LVEKGRRKCD QYWPADGSEE YGNFLVTQKS VQVLAYYTVR NFTLRNTKIK 900 

KGSQKGRPSG RWTQYHYTQ WPDMGVPBYS LPVLTFVRKA AYAKRHAVGP VWHCSAGVG 960 

RTGTYIVLDS MLQQIQHEGT VNIFGFLKHI RSQRNYLVQT .EEQYVFIHDT LVEAILSKET 1020 

EVLDSHIHAY VNALLIPGPA' GKTKLEKQFQ LLSQSNIQQS DYSAALKQCN REKNRTSSII 1080 

PVERSRVGIS SLSGEGTDYI NASYIMGYYQ SNEFIITQHP LLHTIKDFWR MIWDHNAQLV 1140 

VMIPDGQNMA EDEFVYWPNK DEPINCESFK VTLMAEEHKC LSNEEKLIIQ DFILEATQDD 1200 

YVLEVRHFQC PKWPNPDSPI SKTFELISVI KEEAANRDGP MIVHDEHGGV TAGTFCALTT 1260 

LMHQIiEKENS VDVYQVAKMI NLMRPGVFAD IEQYQFLYKV ILSLVSTRQE ENPSTSLDSN 1320 
GAALPDGNIA ESLESLV 

Seq ID NO: 580 DNA sequence 

Nucleic Acid Accession ft: EOS sequence 

Coding ' sequence :. 1 4 8 - 4 6 3 2 
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CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 
CAAAAAAAAC ATTTCCTTCG' CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA . 
CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AACGTTTCCT CGCTTGCATT 
CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 
CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 
AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA , 
CAAGTAAATG TGAATCTTAA' GAAACTTAAA TTTCAGGGTT. GGGATAAAAC ATCATTGGAA 
AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT , 
GTCAGCGGAG GAGTTTCAGA AATGGTGTTT- AAAGCAAGCA AGATAACTTT TCACTGGGGA 
AAATGCAATA. TGTCATCTGA TGGATCAGAG- CATAGTTTAG AAGGACAAAA ATTTCCACTT 
GAGATGCAAA- TCTACTGCTT TGATGCGGAC . CGATTTTCAA - GTTTTGAGGA AGCAGTCAAA 
GGAAAAGGGA AGTTAAGAGC. TTTATCCATT TTGTTTGAGG. TTGGGACAGA AGAAAATTTG 
GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 
TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA- CTGACAAGTA. TTACATTTAC . 
AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 
ACAGTTAGCA- TCTCTGAAAG: CCAGTTGGCT. GTTTTTTGTG AAGTTCTTAC AATGCAACAA 
TCTGGTTATG TCATGCTGAT- GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG : 
TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT-: 
AGTTCAGAAC CAGAAAATGT TCAGGCTGAC- CCAGAGAATT. ATACCAGCCT TCTTGTTACA ■ 
TGGGAAAGAC CTCGAGTCGT. TTATGATACC ATGATTGAGA" AGTTTGCAGT TTTGTACCAG, 
CAGTTGGATG' GAGAGGACCA ' AACCAAGCAT GAATTTTTGA* CAGATGGCTA TCAAGACTTG. 
GGTGCTATTC TCAATAATTT. GCTACCCAAT ATGAGTTATG - TTCTTCAGAT AGTAGCCATA" 
• TGCACTAATG GCTTATATGG " AAAATACAGC" GACCAACTGA- TTGTCGACAT GCCTACTGAT 
AATCCTGAAC TTGAT CTTTT- CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 
GAAGAGGGAA- AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA. 
AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 
ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA - 
AAGGGTGATG TTCCCAATAC ATCTTTAAAT^ TCCACTTCCC AACCAGTCAC • TAAATTAGCC 
ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 
GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 
AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA T AACAGAATA TGAGGAGGAG." 
AGTTTATTGA : CCAGTTTCAA GCTTGATACT GGAGCTGAAG- ATTCTTCAGG CTCCAGTCCC 
GCAACTTCTG CTATCCCATT- CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 
GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 
GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 
GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG' ATGTTGGATC AGGCAGAGAG 
AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 
TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG; TTACAGATCT GGAAATGCCA 
CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC " 
TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG : 
GTATACAATG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 
GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 
CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 
TACTT AGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 
ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA- 
CATGCAAGTA : GTGGGTTTAC TGAAGAATTT GAGACACTGA AAGAGTTTTA CCAGGAAGTG 
CAGAGCTGTA CTGTTGACTT- AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG 
CACAAGAATC GATACATAAA TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 
CTTGCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 
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AACAGACCAA AAGCTTATAT TGCTGCCCAA 
TGGAGAATGA TATGGGAACA TAATGTGGAA 
AAAGGAAGGA GAAAATGTGA TCAGTACTGG 
nTCTGGTCA ctcagaagag; TGTGCAAGTG 
CTAAGAAACA CAAAAATAAA' AAAGGGCTCC 



GGCCCACTGA AATCCACAGC TGAAGATTTC 
GTTATTGTCA TGATAACAAA CCTCGTGGAG 
CCTGCCGATG GGAGTGAGGA GTACGGGAAC 
CTTGCCTATT ATACTGTGAG GAATTTTACT 
CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 



ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 
CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG. CAGTGGGGCC TGTTGTCGTC 
CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG" TGCTAGACAG TATGTTGCAG 
CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT. TAAAACACAT CCGTTCACAA 
AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 
GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA 
CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA' AACAATTCCA GGGTCTCACT 
CTGTCACCCA GGCTGGAGTG CAGAGGCACA ATCTCGGCTC ' ACTGCAACCT TCCTCTCCCT 
GGCTTAACTG ATCCTCCTAC CTCAGCCTCC CGAGTGGCTG GGACTATACT CCTGAGCCAG 
TCAAATATAC AGCAGAGTGA- CTATTCTGCA GCCCTAAAGC AATGCAACAG GGAAAAGAAT 
CGAACTTCTT CTATCATCCC . TGTGGAAAGA TCAAGGGTTG GCATTTCATC CCTGAGTGGA 
GAAGGCACAG ACTACATCAA TGCCTCCTAT . ATCATGGGCT ATTACCAGAG CAATGAATTC 
ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT TCTGGAGGAT GATATGGGAC 
CATAATGCCC AACTGGTGGT- TATGATTCCT. GATGGCCAAA ACATGGCAGA AGATGAATTT 
GTTTACTGGC CAAATAAAGA TGAGCCTATA AATTGTGAGA GCTTTAAGGT CACTCTTATG 
GCTGAAGAAC ACAAATGTCT ATCTAATGAG GAAAAACTTA TAATTCAGGA CTTTATCTTA 
GAAGCTACAC AGGATGATTA TGTACTTGAA GTGAGGCACT TTCAGTGTCC TAAATGGCCA 
AATCCAGATA GCCCCATTAG. TAAAACTTTT GAACTTATAA GTGTTATAAA AGAAGAAGCT 
GCCAATAGGG ATGGGCCTAT GATTGTTCAT GATGAGCATG GAGGAGTGAC GGCAGGAACT 
TTCTGTGCTC TGACAACCCT- - TATGCACCAA CTAGAAAAAG AAAATTCCGT GGATGTTTAC 
CAGGf AGCCA AGATGATCAA TCTGATGAGG CCAGGAGTCT TTGCTGACAT TGAGCAGTAT 
CAGTTTCTCT ACAAAGTGAT CCTCAGCCTT GTGGGCACAA GGCAGGAAGA GAATCCATCC 
ACCTCTCTGG AGAGTAAT GG TGCAGCATTG CCTGATGGAA ATATAGCTGA GAGCTTAGAG 
TCTTTAGTTT AACACAGAAA GGGGTGGGGG GACTCACATC TGAGCATTGT TTTCCTCTTC 
CTAAAATTAG GCAGGAAAAT ' CAGTCTAGTT CTGTTATCTG TTGATTTCCC ATCACCTGAC 
AGTAACTTTC ATGACATAGG ATTCTGCCGC CAAATTTATA TCATTAACAA TGTGTGCCTT 
TTTGCAAGAC TTGTAATTTA CTTATTATGT TTGAACTAAA ATGATTGAAT TTTACAGTAT 
TTCTAAGAAT GGAATTGTGG TATTTTTTTC TGTATTGATT TTAACAGAAA ATTTCAATTT 
ATAGAGGTTA GGAATTCCAA ACTACAGAAA ATGTTTGTTT TTAGTGTCAA ATTTTTAGCT 
GTATTTGTAG CAATTATCAG GTTTGCTAGA AATATAACTT TTAATACAGT AGCCTGTAAA 
TAAAACACTC TTCCATATGA TATTCAACAT TTTACAACTG CAGTATTCAC CTAAAGTAGA 
AATAATCTGT TACTTATTGT AAATACTGCC CTAGTGTCTC CATGGACCAA ATTTATATTT 
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ATAATTGTAG ATTTTTATAT TTTACTACTG AGTCAAGTTT TCTAGTTCTG TGTAATTGTT 5220 

TAGTTTAATG ACGTAGTTCA TTAGCTGGTC TTACTCTACC AGTTTTCTGA CATTGTATTG 5280 

TGTTACCTAA GTCATTAACT TTGTTTCAGC ATGTAATTTT. AACTTTTGTG GAAj^TAGAA 5340 

ATACCTTCAT TTTGAAAGAA GTTTTTATGA GAATAACACC TTACCAAACA .TTGTTCAAAT 5400 

GGTTTTTATC CAAGGAATTG CAAAAATAAA TATAAATATT GCCATTAAAA AAAAAAAAAA 5460 
AAAAAAAAAA AAAAAAAAAA. A 

Seq-ID NO: 581 Protein sequences 
Protein" Accession »:E0S sequence 

1 11 21 • j 1 j 1 ] l 

MRILKRFLAC -IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 
OSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV: 
FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA- DRFSSFEEAV KGXGKLRALS 
ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA AU3PFILLNL- LPNSTDKYYI YNGSLTSPPC 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY-- LQNNFREQQY KFSRQVFSSY . 300 

TGKE E I HEAV- CSSEPENVQA DPENYTSLLV TWERPRWYD. TMIEKFAVLY QQLDGEDQTK ^ 
HEFLTDGYQD: lgailnnllp nmsyvlqiva ictnglygky SDQLIVDMPT DNPELDLFPB 
' LIGTBEIIKE EEEGKDIEEG AIVNFGRDSA- TNQIRKKEPQ'. ISTTTHYNRI- GTKYNEAKTN. 
RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTSLPPHT VEGTSASLND 
GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 
ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE' ESLKDPSMEG NVWFPSSTDr 
TAQPDVGSGR- BSFLQTNYTE IRVDESEKTT KSFSAGPVMS: QGPSVTDLEM PHYSTFAYFP 

TEVTPHAFTP- SSRQQDLVST VNWYSQTTQ PVYNEASNSS- HESRIGLAEG" IiESEKKAVIP 780 

LVIVSALTFI CLWLVGILI YWRKCPQTAH FYLEDSTSPR- VISTPPTPIF PISDDVGAIP' 840 

IKHFPKHVAD LHASSGFTEE FETLKEFYQE VQSCTVDLGI TADSSNHPDN KHKNRYINIV- . 900 

AYDHSRVKLA" QLAEKDGKLT DYINANYVDG YNRPKAYIAA- QGPLKSTAED F^IWEHNV 960 

EVIVMITNLV, EKGRRKCDQY WPADGSEEYG NFLVTQKSVQ VLAYYTVRNF TLRNTKIKKG, 1020 

SQKGRPSGRV VTQYHYTQWP DMGVPEYSLP VLTFVRKAAY AKRHAVGPW VHCSAGVGRT 1080 

GTYIVLDSML: QQIQHEGTVN.' IFGFLKHIRS QRNYLVQTEE- QYVFIHDTLV EAILSKETEV: 1140 

LDSHIHAYVN ALLIPGPAGK TKLEKQFQGL TLSPRLECRG TISAHCNLPL P^PPTSA 1200 

SRVAGTILLS QSNIQQSDYS AALKQCNREK NRTSSIIPVE' RSRVGISSLS GEGTDYINAS 1260 

YIMGYYQSNE FIITQHPIiLH- TIKDFWRMIW- DHNAQLWMI PDGQNMAEDE FVYWPNKDEP 1320 

INCESFKVTL MAEEHKCIiSN EEKLIIQDFI" LEATQDDYVL EVRHFQCFKW PNPDSPISKT 1380 

FELISVIKEE" AANRDGPMIV: HDEHGGVTAG TFCALTTLMH ; QLEKENSVDV YQVAKMINLM 1440 
RPGVFADIEQ YQFLYKVILS LVGTRQEENP STSLDSNGAA : LPDGNIAESL ESLV 

Seq ID NO: 582 DNA sequence 

Nucleic Acid Accession #: NM_002851.1 

Coding sequence: 148.. 7092 

1 11 21 31 41 51 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA . 120 

CGGCGAGGGG • CCGCAGACCG TCTGGAAATG OGAATOCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT : ACTACAGACA ACAGAGAAAA- 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA- ATCAAAAAAA TTGGGGAAAG 3 0 0 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA- 360 

CAAGTAAATG. TGAATCTTAA ' GAAACTTAAA TTTCAGGGTT- GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT - 480 
GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA" 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT- 600 

GAGATGCAAA TCTACTGCTT* TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

- GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 
GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 
T TAGAT CCAT TCATACTGTT. GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTT AC 840 
AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT ■:. 900 
ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA. 960 
TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 
TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 
AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 
TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 120.0 
CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 
GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 
TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 
AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 
GAAGAGGGAA- AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 
AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

■ ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

■ AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 
ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 
GAAGGTACTT- CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 
AACTTGTCGG G GACTGCAGA ATCCTTAAAT ACAGTTTCT A TAACAGAATA TGAGGAGGAG 1860 
AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC ' 1920 
GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 
GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 
G^TTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

- GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC WGCAGAGAG "60 
AGCrrrCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA ^CAACCAAG -2220 
TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 
CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 
ScAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 
GTATACAATG GTGAGACACC TCTTCAACCT TCCTACAGTA GTGAAGTCTT TCCTCTAGTC 2460 
ACCCCTTTGT TGCTTGACAA TCAGATCCTC AACACTACCC CTGCTGCTTC AAGTAGTGAT 2S20 
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TCGGCCTTGC ATGCTACGCC TGTATTTCCC AGTGTCGATG TGTCATTTGA ATCCATCCTG 25 BO 

TCTTCCTATG ATGGTGCACC TTTGCTTCCA TTTTCCTCTG CTTCCTTCAG T AGTGAATTG 2640 

TTTCGCCATC TGCATACAGT TTCTCAAATC CTTCCACAAG TTACTTCAGC TACCGAGAGT 2700 

GATAAGGTGC CCTTGCATGC TTCTCTGCCA GTGGCTGGGG GTGATTTGCT ATTAGAGCCC 2760 

AGCCTTGCTC AGTATTCTGA TGTGCTGTCC* ACTACTCATG CTGCTTCAGA GACGCTGGAA 2820 

TTTGGTAGTG AATCTGGTGT TCTTTATAAA ACGCTTATGT TTTCTCAAGT TGAACCACCC 2880 

AGCAGTGATG CCATGATGCA TGCACGTTCT TCAGGGCCTG AACCTTCTTA TGCCTTGTCT 2940 

GATAATGAGG GCTCCCAACA CATCTTCACT GTTTCTTACA GTTCTGCAAT ACCTGTGCAT 30O0 

GATTCTGTGG GTGTAACTTA- TCAGGGTTCC TTATTTAGCG GCCCTAGCCA TATACCAATA 3060 

CCTAAGTCTT CGTTAATAAC CCCAACTGCA TCATTACTGC AGCCTACTCA TGCCCTCTCT 3120 

GGTGATGGGG AATGGTCTGG AGCCTCTTCT GATAGTGAAT TTCTTTTACC TGACACAGAT 3180 

GGGCTGACAG CCCTTAACAT- TTCTTCACCT GTTTCTGTAG CTGAATTTAC. ATATACAACA 3240. 

TCTGTGTTTG GTGATGATAA' TAAGGCGCTT TCTAAAAGTG AAATAATATA TGGAAATGAG 3300 

ACTGAACTGC" AAATTCCTTC • TTTCAATGAG ATGGTTTACC CTTCTGAAAG CACAGTCATG 3360 

CCCAACATGT ATGATAATGT AAATAAGTTG AATGGGTCTT TACAAGAAAC CTCTGTTTCC 3420 

ATTTCTAGCA CCAAGGGCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC TAAGGTTTTT 3480. 

GATCATGAGA TTAGTCAAGT TCCAGAAAAT- AACTTTTCAG TTCAACCTAC ACATACTGTC - 3540 

TCTCAAGCAT CTGGTGACAC TTCGCTTAAA ! - CCTGTGCTTA GTGCAAACTC AGAGCCAGCA, 3600" 

TCCTCTGACC CTGCTTCTAG' TGAAATGTTA TCTCCTTCAA CTCAGCTCTT ATTTTATGAG 3 660 

ACCTCAGCTT CTTTTAGTAC- TGAAGTATTG CTACAACCTT CCTTTCAGGC TTCTGATGTT 3720. 

GACACCTTGC TTAAAACTGT TCTTCCAGCT; GTGCCCAGTG ATCCAATATT GGTTGAAACC 3780 

CCCAAAGTTG ■ ATAAAATTAG TTCTACAATG TTGCATCTCA TTGTATCAAA TTCTGCTTCA 3B4Q- 

AGTGAAAACA TGCTGCACTC TACATCTGTA CCAGTTTTTG ATGTGTCGCC TACTTCTCAT- 3 900 

ATGCACTCTG CTTCACTTCA , AGGTTTGACC ATTTCCTATG- CAAGTGAGAA ATATGAACCA 3960. 

GTTTTGTTAA AAAGTGAAAG. TTCCCACCAA GTGGTACCTT CTTTGTACAG. TAATGATGAG 4020 

TTGTTCCAAA CGGCCAATTT GGAGATTAAC CAGGCCCATC CCCCAAAAGG AAGGCATGTA. 4080 

TTTGCTACAC CTGTTTTATC AATTGATGAA CCATTAAATA CACTAATAAA TAAGCTTATA 4140 

CATTCCGATG 7 AAATTTTAAC CTCCACCAAA AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 4200 

ATTCCAACAG- TTGCTTCTGA TACATTTGTA" TCTACTGATC ATTCTGTTCC TATAGGAAAT 4260 

GGGCATGTTG CCATTACAGC TGTTTCTCCC. CACAGAGATG GTTCTGTAAC CTCAACAAAG 4320 

TTGCTGTTTC CTTCTAAGGC AACTTCTGAG- CTGAGTCATA GTGCCAAATC TGATGCCGGT 4380 

TTAGTGGGTG. GTGGTGAAGA TGGTGACACT GATGATGATG GTGATGATGA TGATGACAGA 4440 

GATAGTGATG' GCTTATCCAT TCATAAGTGT ATGTCATGCT • CATCCTATAG AGAATCACAG 4500 

GAAAAGGTAA- TGAATGATTC AGACACCCAC GAAAACAGTC. TTATGGATCA GAATAATCCA' 4560 

ATCTCATACT CACTATCTGA ' GAATTCTGAA GAAGATAATA GAGTCACAAG TGTATCCTCA 4620 

GACAGTCAAA CTGGTATGGA CAGAAGTCCT GGTAAATCAC " CATCAGCAAA TGGGCTATCC, 4680 

CAAAAGCACA. ATGATGGAAA AGAGGAAAAT GACATTCAGA. CTGGTAGTGC TCTCCTTCCT,; 4740 

CTCAGCCCTG AATCTAAAGC ATGGGCAGTT CTGACAAGTG ATGAAGAAAG TGGATCAGGG 4800 

CAAGGTACCT CAGATAGCCT ■ TAATGAGAAT GAGACTTCCA- CAGATTTCAG TTTTGCAGAC. 4 860 

ACTAATGAAA AAGATGCTGA TGGGATCCTG GCAGCAGGTG ACTCAGAAAT AACTCCTGGA 4920 

TTCCCACAGT CCCCAACATC ATCTGTTACT- AGCGAGAACT CAGAAGTGTT CCACGTTTCA 4980 

GAGGCAGAGG CCAGTAATAG TAGCCATGAG TCTCGTATTG GTCTAGCTGA GGGGTTGGAA 5040 

TCCGAGAAG A : AGGCAGTTAT ACCCCTTGTG ATCGTGTCAG CCCTGACTTT TATCTGTCTA 5100 

GTGGTTCTTG TGGGTATTCT CATCTACTGG AGGAAATGCT TCCAGACTGC ACACTTTTAC 5160 

TTAGAGGACA GTACATCCCC TAGAGTTATA TCCACACCTC CAACACCTAT CTTTCCAATT 5220 

TCAGATGATG- TCGGAGCAAT TCCAATAAAG CACTTTCCAA AGCATGTTGC AGATTTACAT 5280 

GGAAGTAGTG GGTTTACTGA AGAATTTGAG ACACTGAAAG AGTTTTACCA GGAAGTGCAG 5340 

AGCTGTACTG TTGACTTAGG TATTACAGCA GACAGCTCCA ACCACCCAGA CAACAAGCAC 5400 

AAGAATCGAT ACATAAATAT- CGTTGCCTAT GATCATAGCA GGGTTAAGCT AGCACAGCTT 5460 

GCTGAAAAGG ATGGCAAACT GACTGATTAT ATCAATGCCA ATTATGTTGA TGGCTACAAC. 5520 

AGACCAAAAG' CTTATATTGC" TGCCCAAGGC CCACTGAAAT CCACAGCTGA AGATTTCTGG 5580 

AGAATGATAT GGGAACATAA TGTGGAAGTT. ATTGTCATGA TAACAAACCT CGTGGAGAAA 5640 

GGAAGGAGAA AATGTGATCA GTACTGGCCT GCCGATGGGA GTGAGGAGTA CGGGAACTTT- 5700 

CTGGTCACTC AGAAGAGTGT - GCAAGTGCTT- GCCTATTATA CTGTGAGGAA TTTTACTCTA 5760 

AGAAACACAA AAATAAAAAA GGGCTCCCAG AAAGGAAGAC CCAGTGGACG TGTGGTCACA 5820 

CAGTATCACT ACACGCAGTG GCCTGACATG GGAGTACCAG AGTACTCCCT GCCAGTGCTG 5880 

ACCTTTGTGA GAAAGGCAGC CTATGCCAAG CGCCATGCAG TGGGGCCTGT TGTCGTCCAC 5940 

TGCAGTGCTG GAGTTGGAAG AACAGGCACA TATATTGTGC TAGACAGTAT GTTGCAGCAG 6000 

ATTCAACACG AAGGAACTGT- CAACATATTT GGCTTCTTAA AACACATCCG TTCACAAAGA' 6060 

AATTATTTGG TACAAACTGA GGAGCAATAT GTCTTCATTC ATGATACACT GGTTGAGGCC 6120 

ATACTTAGTA AAGAAACTGA GGTGCTGGAC AGTCATATTC ATGCCTATGT TAATGCACTC 6180 

CTCATTCCTG GACCAGCAGG CAAAACAAAG CTAGAGAAAC AATTCCAGCT. CCTGAGCCAG 6240 

TCAAATATAC AGCAGAGTGA CTATTCTGCA GCCCTAAAGC AATGCAACAG GGAAAAGAAT 6300 

CGAACTTCTT CTATCATCCC TGTGGAAAGA TCAAGGGTTG GCATTTCATC CCTGAGTGGA 63 60 

GAAGGCACAG ACTACATCAA TGCCTCCTAT ATCATGGGCT ATTACCAGAG CAATGAATTC. 6420 

ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT TCTGGAGGAT GATATGGGAC 6480 

CATAATGCGC AACTGGTGGT TATGATTCCT GATGGCCAAA ACATGGCAGA AGATGAATTT 6540 
GTTTACTGGC CAAATAAAGA TGAGCCTATA AATTGTGAGA GCTTTAAGGT CACTCTTATG 
GCTGAAGAAC ACAAATGTCT ATCTAATGAG GAAAAACTTA TAATTCAGGA CTTTATCTTA 
GAAGCTACAC AGGATGATTA TGTACTTGAA GTGAGGCACT TTCAGTGTCC TAAATGGCCA 

AATCCAGATA GCCCCATTAG TAAAACTTTT GAACTTATAA GTGTTATAAA AGAAGAAGCT 6780 

GCCAATAGGG ATGGGCCTAT GATTGTTCAT GATGAGCATG GAGGAGTGAC GGCAGGAACT 6840 

TTCTGTGCTC TGACAACCCT TATGCACCAA 'CTAGAAAAAG AAAATTCCGT GGATGTTTAC 6900 
CAGGTAGCCA AGATGATCAA TCTGATGAGG CCAGGAGTCT- TTGCTGACAT TGAGCAGTAT 
CAGTTTCTCT ACAAAGTGAT CCTCAGCCTT GTGAGCACAA GGCAGGAAGA GAATCCATCC 
ACCTCTCTGG ACAGTAATGG TGCAGCATTG CCTGATGGAA ATATAGCTGA GAGCTTAGAG 

TCTTTAGTTT -AACACAGAAA GGGGTGGGGG GACTCACATC TGAGCATTGT TTTCCTCTTC 7140 

CTAAAATTAG GCAGGAAAAT CAGTCTAGTT CTGTTATCTG TTGATTTCCC ATCACCTGAC 7200 

AGTAACTTTC ATGACATAGG ATTCTGCCGC CAAATTTATA TCATTAACAA TGTGTGCCTT 7260 

TTTGCAAGAC TTGTAATTTA CTTATTATGT TTGAACTAAA ATGATTGAAT TTTACAGTAT 7320 

• rrCTAAGAAT "GGAATTGTGG TATTTTTTTC TGTATTGATT TTAACAGAAA ATTTCAATTT 73 80 

ATAGAGGTTA GGAATTCCAA ACTACAGAAA ATGTTTGTTT - TTAGTGTCAA ATTTTTAGCT 7440 

GTATTTGTAG CAATTATCAG GTTTGCTAGA AATATAACTT TTAATACAGT AGCCrGTAAA 7500 

TAAAACACTC TTCCATATGA TATTCAACAT TTTACAACTG CAGTATTCAC CTAAAGTAGA 7560 

IaTA^^T SotItTGT AAATACTGCC CTAGTGTCTC CATGGACCAA ATTTATATTT 7620 
ATAATTGTAG ATTTTTATAT TTTACTACTG AGTCAAGTTT TCTAGTTCTG TGTAATTGTT 
TAGTTTAATG ACGTAGTTCA TTAGCTGGTC TTACTCTACC AGTTTTCTGA CATTGTATTG 
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TGTTACCTAA GTCATTAACT • TTGTTTCAGC ATGTAATTTT AACTTTTGTG GAAAATAGAA 7800 

■ ATACCTTCAT TTTGAAAGAA GTTTTTATGA GAATAACACC TTACCAAACA TTGTTCAAAT 7B60 

GGTTTTTATC CAAGGAATTG CAAAAATAAA TATAAATATT GCCATTAAAA AAAAAAAAAA- 7920 
AAAAAAAAAA AAAAAAAAAA A ■ % 

. Seq ID NO: S83 Protein • sequence 
Protein Accession NP_002842.1 

i ii 2i 31 j 1 r * 

MRILKRFIAC IQLLCVCRLD. WANGYYRQQR KLVEEIGWSY TGALNQKNKG KKYPTCNSPK ' 60 

QSPINIDEDL TQVNVNLKXL KFQGWDKTSIj ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 

FKASKITFHW- GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRAIiS- 180 

ILFEVGTEEN LDFKAI IDGV ESVSRFGXQA ALDPFILLNIi LPNSTDKYYI; YKGSLTSPPC 240 

. TDTVDMIVFK DTVSISESQL. AVFCEVLTKQ QSGYVMLMDY LQNNFREQQY- KFSRQVFSSY- 300 

TGKEEIHEAV CSSEPENVQA. DPENYTSLLV. TWERPRWYD" TMIEKFAVLY QQLDGEDQTK 360 

HEFLTDGYQD LGAIZjNNLLP- NMSYVIiQIVA ICTNGLYGKY; SDQLIVDMPT DNPELDLFPE " 420 
LIGTEEIIKE- EEEGKDIEEG ' AIVNPGRDSA TNQIRKKEPQ 7 ISTTTHYNRI GTKYNEAKTN -~ 480 

RSPTRGSEFS GXGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHTr VEGTSASLND -" 540 

GSKTVLRSPH- MNLSGTAESL NTVSITEYEE ESLLTSFKLD' TGAEDSSGSS PATSAIPFIS. 600 

EN I SQCYIFS SENPET I T YD VLIPESARNA SEDSTSSGSE ESLKDPSMEG. NVWFPSSTDI 660 

TAQPDVGSGR ESFLOTNYTE- I RVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP. 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNGETPLQ: PSYSSEVFPL VTPLLLDNQI- 780 

LNTTPAASSS DSALHATPVF PSVDVSFESI LSSYDGAPLL PFSSASFSSE LFRHLHTVSQ: 840 

ILPQVTSATE' SDKVPLHASL. PVAGGDLLLE. PSfcAQYSDVL STTHAASETL EFGSESGVLY 900 

KTLMFSQVEP" PSSDAMMHAR SSGPEPSYAL. SDNEGSQHIF TVSYSSAIPV HDSVGVTYQG- 960 

SliFSGPSHIP IPKSSLITPT ASLLQPTHAL SGDGEWSGAS SDSEFLIiPDT- DGLTALNISS- 1020 

PVSVAEFTYT TSVFGDDNXA LSKSEIIYGN": ETELQIPSFN' EMVYPSESTV MPNMYDNVNK '• 1080 

LNASLQETSV SISSTKGMFP GSLAHTTTKV FDHEI SQVPE' NNFSVQPTHT' VSQASGDTSL 1140 

KPVLSANSEP' ASSDPASSEM- LSPSTQLLFY ETSASFSTEV. LLQPSFQASD VDTLLKTVLP' 1200 

AVPSDPILVE TPKVDKISST MLHLIVSNSA- SSENMLHSTS VPVFDVSPTS HMHSASLQGL 1260 

TISYASEKYE PVLLKSESSH- QWPSLYSND ELFQTANLEI NQAHPPKGRH VFATPVLSID 1320 

EPLNTLINKL IHSDEILTST KSSVTGKVFA GIPTVASDTF VSTDHSVPIG NGHVAITAVS ; 1380 
PHRDGSVTST- KLLFPSKATS 1 ELSHSAKSDA" GLVGGGEDGD TDDDGDDDDD RDSDGLSIHK .1440 

CMSCSSYRES QEKVMNDSDT. HENSLMDQNN PISYSLSENS EEDNRVTSVS SDSQTGMDRS 1500 

PGKSPSANGIi SQKHNDGKEE NDIQTGSALL PLSPESKAWA VLTSDEESGS GQGTSDSLKE, 1560 

NETSTDFSFA DTNEKDADGI LAAGDSEITP GFPQSPTSSV TSENSEVFHV SEAEASNSSH 1620 

ESRIGLAEGL ESEKKAVIPL VIVSALTFIC. LWLVGILIY- WRKCFQTAHF YLEDSTSPRV 1680 

ISTPPTPIFP- ISDDVGAIPI KHFPKHVADL HASSGFTEEF- ETLKEFYQEV QSCTVDLGIT 1740 

ADSSNHPDNK HKNRYINIVA YDHSRVKLAQ. LAEKDGKLTD YINANYVDGY NRPKAYIAAQ 1800 

GPLKSTAEDF WRMIWEHNVE V1VMITNLVE KGRRKCDQYW PADGSEEYGN FLVTQKSVQV ■ IB 60 

LAYYTVRNFT LRNTKIKKGS QKGRPSGRW: TQYHYTQWPD MGVPEYSLPV LTFVRKAAYA 1920 

KRHAVGPVW- HCSAGVGRTG TYIVLDSMLQ- QIQHEGTVNI" FGFLKHIRSQ RNYLVQTEEQ 1980 

YVFIHDTLVE AILSKETEVL DSHIHAYVNA LLIPGPAGKT KL.EKQFQLLS QSNIQQSDYS 2040 

AALKQCNREK NRTSSIIPVE RSRVGISSLS GEGTDYINAS YIMGYYQSNE FIITQHPLLH- 2100 

TIKDFWRMIW DHNAQIjWMI- PDGQNMAEDE FVYWPNKDEP INCESFKVTL MABEHKCLSN 2160 

EEKLIIQDFI LEATQDDYVL EVRHFQCPKW. PNPDSPISKT FELISVIKEB AANRDGPMIV 2220 

HDEHGGVTAG TFCALTTLMH QLEKENSVDV YQVAKMINLM RPGVFADIEQ YQFLYKVILS 2280 
LVSTRQEENP STSUDSNGAA LPDGNIAESL ESLV 

Seq ID NO: 584 DNA sequence 

Nucleic Acid' Accession fc: NM_005688.1 

Coding sequences 126.. 4439 

1 11 21 31 41 51 

CCGGGCAGGT GGCTCATGCT CGGGAGCGTG GTTGAGCGGC TGGCGCGGTT GTCCTGGAGC 60 

AGGGGCGCAG GAATTCTGAT. GTGAAACTAA CAGTCTGTGA GCCCTGGAAC CTCCGCTCAG 120 

AGAAGATGAA- GGATATCGAC ATAGGAAAAG AGTATATCAT CCCCAGTCCT GGGTATAGAA- 180 

GTGTGAGGGA- GAGAACCAGC ACTTCTGGGA CGCACAGAGA CCGTGAAGAT TCCAAGTTCA 240 

GGAGAACTCG ACCGTTGGAA TGCCAAGATG CCTTGGAAAC AGCAGCCCGA GCCGAGGGCC 300 

TCTCTCTTGA TGCCTCCATG CATTCTCAGC TCAGAATCCT GGATGAGGAG, CATCCCAAGG 360 

GAAAGTACCA TCATGGGTTG AGTGCTCTGA AGCCCATCCG GACTACTTCC AAACACCAGC; 420 

ACCCAGTGGA CAATGCTGGG CTTTTTTCCT GTATGACTTT TTCGTGGCTT TCTTCTCTGG 480 

CCCGTGTGGC CCACAAGAAG: GGGGAGCTCT CAATGGAAGA CGTGTGGTCT CTGTCCAAGC 540 

ACGAGTCTTC TGACGTGAAC TGCAGAAGAC TAGAGAGACT GTGGCAAGAA GAGCTGAATG 600 

AAGTTGGGCC AGACGCTGCT TCCCTGCGAA GGGTTGTGTG GATCTTCTGC CGCACCAGGC 660 

TCATCCTGTC CATCGTGTGC CTGATGATCA CGCAGCTGGC TGGCTTCAGT GGACCAGCCT 720 

TCATGGTGAA' ACACCTCTTG GAGTATACCC AGGCAACAGA GTCTAACCTG CAGTACAGCT 780 

TGTTGTTAGT GCTGGGCCTC CTCCTGACGG AAATCGTGCG GTCTTGGTCG CTTGCACTGA 840 

CTTGGGCATT GAATTACCGA ACCGGTGTCC GCTTGCGGGG GGCCATCCTA ACCATGGCAT 900 

TTAAGAAGAT- CCTTAAGTTA AAGAACATTA AAGAGAAATC CCTGGGTGAG CTCATCAACA 960 

TTTGCTCCAA CGATGGGCAG AGAATGTTTG AGGCAGCAGC CGTTGGCAGC CTGCTGGCTG 1020 

GAGGACCCGT TGTTGCCATC TTAGGCATGA TTTATAATGT AATTATTCTG GGACCAACAG 1080 

GCTTCCTGGG ATCAGCTGTT TTTATCCTCT TTTACCCAGC AATGATGTTT GCATCACGGC 1140 

TCACAGCATA TTTCAGGAGA AAATGCGTGG CCGCCACGGA TGAACGTGTC CAGAAGATGA 1200 

ATGAAGTTCT TACTTACATT AAATTTATCA AAATGTATGC CTGGGTCAAA GCATTTTCTC 1260 

AGAGTGTTCA AAAAATCCGC GAGGAGGAGC GTCGGATATT GGAAAAAGCC GGGTACTTCC 1320 

AGGGTATCAC TGTGGGTGTG GCTCCCATTG TGGTGGTGAT TGCCAGCGTG GTGACCTTCT 1380 

CTGTTCATAT GACCCTGGGC TTCGATCTGA CAGCAGCACA GGCTTTCACA GTGGTGACAG 1440 

TCTTCAATTC CATGAGTTTT GCTTTGAAAG TAACACCGTT TTCAGTAAAG TCCCTCTCAG 1500 

AAGCCTCAGT GGCTGTTGAC AGATTTAAGA GTTTGTTTCT AATGGAAGAG' GTTCACATGA 1560 

TAAAGAACAA ACCAGCCAGT CCTCACATCA AGATAGAGAT GAAAAATGCC ACCTTGGCAT 1620 

GGGACTCCTC CCACTCCAGT ATCCAGAACT CGCCCAAGCT GACCCCCAAA" ATGAAAAAAG 1680 

ACAAGAGGGC TTCCAGGGGC AAGAAAGAGA AGGTGAGGCA GCTGCAGCGC ACTGAGCATC 1740 

AGGCGGTGCT GGCAGAGCAG AAAGGCCACC TCCTCCTGGA CAGTGACGAG CGGCCCAGTC 1B00 

CCGAAGAGGA AGAAGGCAAG CACATCCACC TGGGCCACCT GCGCTTACAG AGGACACTGC 1860 
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ACAGCATCGA TCTGGAGATC • CAAGAGGGTA AACTGGTTGG 
GTGGAAAAAC CTCTCTCATT TCAGCCATTT TAGGCCAGAT 
TTGCAATCAG TGGAACCTTC . GCTTATGTGG CCCAGCAGGC 
TGAGAGACAA CATCCTGTTT GGGAAGGAAT ATGATGAAGA 
ACAGCTGCTG CCTGAGGCCT GACCTGGCCA TTCTTCCCAG 
GAGAGCGAGG AGCCAACCTG AGCGGTGGGC AGCGCCAGAG 
TGTATAGTGA CAGGAGCATC TACATCCTGG ACGACCCCCT 
TGGGCAACCA CATCTTCAAT AGTGCTATCC GGAAACATCT 
TTGTTACCCA CCAGTTACAG TACCTGGTTG ACTGTGATGA 

- GCTGTATTAC GGAAAGAGGC ACCCATGAGG AACTGATGAA 
CCATTTTTAA TAACCTGTTG CTGGGAGAGA CACCGCCAGT 
AAACCAGTGG TTCACAGAAG AAGTCACAAG ACAAGGGTCC 
AGGAAAAAGC AGTAAAGCCA GAGGAAGGGC AGCTTGTGCA 
GTTCAGTGCC CTGGTCAGTA TATGGTGTCT ACATCCAGGC 
TCCTGGTTAT, TATGGCCCTT- TTCATGCTGA ATGTAGGCAG 
GGTTGAGTTA 4 CTGGATCAAG' CAAGGAAGCG GGAACACCAC 
CCTCGGTGAG: TGACAGCATG AAGGACAATC CTCATATGCA 
CCCTCTCCAT GGCAGTCATG' CTGATCCTGA AAGCCATTCG 
GCACGCTGCG AGCTTCCTCC CGGCTGCATG ACGAGCTTTT 
CTATGAAGTT TTTTGACACG ACCCCCACAG GGAGGATTCT 
TGGATGAAGT TGACGTGCGG CTGCCGTTCC AGGCCGAGAT 
• TGGTGTTCTT . CTGTGTGGGA" ATGATCGCAG GAGTCTTCCC 
GGCCCCTTGT. CATCCTCTTT TCAGTCCTGC ACATTGTCTC 
TGAAGCGTCT GGACAATATC ACGCAGTCAC CTTTCCTCTC 
AGGGCCTTGC CACCATCCAC GCCTACAATA AAGGGCAGGA 
AGCTGCTGGA TGACAACCAA GCTCCTTTTT. TTTTGTTTAC 
CTGTGCGGCT. GGACCTCATC 1 * AGCATCGCCC TCATCACCAC- 
TTATGCACGG GCAGATTCCC. CCAGCCTATG CGGGTCTCGC 
TAACGGGGCT GTTCCAGTTT ACGGTCAGAC TGGCATCTGA 
CGGTGGAGAG GATCAATCAC ■ TACATTAAGA- CTCTGTCCTT 
AGAACAAGGC TCCCTCCCCT" GACTGGCCCC AGGAGGGAGA 
AGATGAGGTA CCGAGAAAAC CTCCCTCTTG . TCCTAAAGAA 

' CTAAAGAGAA GATTGGCATT GTGGGGCGGA CAGGATCAGG 
CCCTCTTCCG tctggtggagC TTATCTGGAG. GCTGCATCAA 
GTGATATTGG CCTTGCCGAC CTCCGAAGCA AACTCTCTAT 
TGTTCAGTGG CACTGTCAGA TCAAATTTGG ACCCCTTCAA' 
TTTGGGATGC CCTGGAGAGG ACACACATGA AAGAATGTAT 
TTGAATCTGA AGTGATGGAG ( AATGGGGATA ACTTCTCAGT 
GCATAGCTAG AGOCCTGCTC OGCCACTGTA AGATTCTGAT. 
CCATGGACAC AGAGACAGAC TTATTGATTC AAGAGACCAT 

" GTACCATGCT GACCATTGCC . CATCGGCTGC ACACGGTTCT 
TGCTGGCCCA GGGACAGGTG\ GTGGAGTTTG ACACCCCATC 
GTTCCCGATT CTATGCCATG TTTGCTGCTG CAGAGAACAA 
TCCTCCCTGT TGACGAAGTC , TCTTTTCTTT AGAGCATTGC 
CCCCTCATCG CGTCCTCCTA CCGAAACCTT GCCTTTCTCG 
GTTCCGGATT GGCTTGTGTG TTTCACTTTT AGGGAGAGTC 
ATTCCATATT CATGTAAACA AAATTTAGTT TTTGTTCTTA 
GGGAACCGTT ATTATAATTG TATCAGAGGC CTATAATGAA 
TCTATATATA ATTCTGTACA TAGCCTATAT TTACAGTGAA 
TATTAAAATA AGCACTGTGC TAATAACAGT, GCATATTCCT 
TTGCTGTACT AGAGATCTGG TTTTGCTATT AGACTGTAGG 
CTCTAGCTGG TGGTTTCACG • GTGCCAGGTT TTCTGGGTGT 
ATAGTGGGCC CTCCGACAGC CCCCTCTGCC GCCTCCCCAC 
GAGACGGGTG GGCGGCTGGA GACCATGCAG AGCGCCGTGA 
CTGTCCTGGT GTCACTTACT GTTT CTGTCA" GGAGAGCAGC 
TTTCACTCCC TCCATCAAGA ATGGGGATCA' CAGAGACATT 
TTTCCTGCCT TCTTCTTTTT GCTGTTGTTT CTAAACAAGA 
TCCCACTGCC TCAGGTTCCT, ATGGCTGGCC ACTGCACAGA 
GTTGGTTCCA AGCCCTGGAG CCAACTGCTG CTTTTTGAGG 
ATTCCCACAC CTCCACAGTT CAGTGGCAGG GCTCAGGATT 
CTCACCGCAG TCGTCGCACA GTCTCTCTCT CTCTCTCCCC 
CAGCTCTTGC TAATCAGTGT CTCACACTGG CXjTAGAAGTT 
ACCTCAGGTT GCTGGTTGCT GTGTGGTTTG GTGTGTTCCC 
GGGGCTGGTA GCTCAGGTGG i GCGTGGTCAC TGCTGTCATC 
ATGTCGTGAC CAACTAGACa' TTCTGTCGCC TTAGCATGTT 
CAAAAATCTG AAAATGTGAA TAAAATTATT TTGGATTTTG 
AAAAAAAAAA AAAAAAAA 
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Seq ID NO:. 585 Protein sequence 
Protein Accession #: NP_005679.1 



MKDIDIGKEY 
IiDASMHSQLR 
VAHKKGELSM 
LSIVCLMITQ 
ALNYBTGVRL 
PWAILGMIY 
VLTYIKFIKM- 
miTLGPDLTA 
NKPASPHIKI 
VLAEQKGHLL 
KTSLISAILG 
CCLRPDLAIL 
UHIFNSAIRK 
FNNLLLGETP 
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IIPSPGYRSV 
ILDEEHPKGK 
EDVWSLSKHB 
LAGFSGPAFM 
RGAILTMAFK 
NVI ILGPTGF 
YAWVKAFSQS 
AQAFTWTVP 
EMKNATLAWD 
LDSDERPSPE 
QMTLLEGSIA 
PSSDLTEIGE 
HLKSKTVLFV 
PVEINSKKET 
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i 

RERTSTSGTH 
YHHGLSALKP 
SSDVNCRRLE 
VKHLLEYTQA- 
KILKIiKNIKE 
LGSAVFILFY 
VQKIREEERR 
NSMTFALKVT 
SSHSSIQNSP 
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I 



ISGTFAYVAQ 
RGANLSGGQR 
THQLQYLVDC 
SGSQKKSQDK 



IRTTSKHQHP 
RIiWQEELNEV 
TESNLQYSU, 
KSLGELINIC 
PAMMFASRLT 
ILEKAGYFQG 
PFSVKSLSEA 
KLTPKMKKDK 
HIiRLQHTLHS 
QAWILNATLR 
QRISLARALY 
DEVI FM KEG C 
GPKTGSVKXE 



AATCTGCGGC 
GACGCTTCTA 
CTGGATCCTC 
AAGATACAAC 
CAGCGACCTG 
GATCAGCCTT 
CAGTGCCTTA 
CAAGTCCAAG. 
AGTGATCTTC 
TTTAAATGGT 
TGAGATCAAT 
TAAAACAGGA 
GCTGGAAGAG* 
TGCTGGGGGC 
CACCGCCTTC 
TGTGACTCGA 
GTACTATGCC 
AGGAGTTGTC 
CCGAAGGATC 
CAACAGGTTT 
GTTCATCCAG 
GTGGTTCCTT 
CAGGGTCCTG 
CCACATCACG 
GTTTCTGCAC 
GTGTGCGATG 
CACGGGGCTG 
CATCTCTTAT 
GACAGAAGCT 
GGAAGCACCT 
GGTGACCTTT 
AGTATCCTTC 
GAAGTCCTCG 
GATTGATGGA 
CATTCCTCAA 
CCAGTACACT 
TGCTCAGCTA 
GGGGGAACGG- 
TTTAGATGAA 
CCGAGAAGCA 
AGGCTCCGAT 
GGTCCTTCTG 
GGTCGCTGTC 
CATTCCCTGC 
ATTTTATCTT 
ATATTTTGAT 
ATTGCACTCT 
GCTTTATACG 
AATGTAAGCT 
TTCTATCATT 
AAGAGTAGCA 
CCAAAGGAAG 
AGCCGCTCCA 
GTTCTCAGGG 
GGGGCGAAGC 
CCTCCGAGCC 
ATCAGTCTAT 
GCTCTCCAGC 
TGGCACTTTT 
TCGTGGGTCr 
TCAAAGTCTG 
TTTGTACTGT 
GCAAACCCCC 
AGTTGAATGG 
TGCTGAACAC 
TAAAAAAAAA 
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TRPLECCDAL 
VDNAGLFSCM 
GPDAASLRRV 
LVLGIiLLTEI 
SNDGQRMFEA 
AYFRRKCVAA 
ITVGVAPIW 
SVAVDRFKSL 
RASRGKKEKV 
IDLEIQEGKL 
DNILFGXEYD 
SDRSIYILDD 
ITERGTHEEL 
KAVKPEEGQL 



AGTGTGGGAA 
GAGGGCAGCA 
AATGCTACTC 
TCTGTGCTGA 
ACGGAGATTG 
GCCCGGGCCT 

gatgcccatg 
acagttctgt 
atgaaagagg' 
gactatgcta 

TCAAAAAAGG 
TCAGTAAAGA 
AAAGGGCAGG 
CCCTTGGCAT 
AGCACCTGGT 
GGGAACGAGA 
AGCATCTACG. 
TTTGTCAAGG ' 
CTTCGAAGCC 
TCCAAAGACA 
AACGTTATCC 
GTGGCAGTGG 
ATTCGGGAGC. 
TCCAGCATAC 
AGATACCAGG 
CGGTGGCTGG 
ATGATCGTTC 
GCTGTCCAGT 
CGATTCACCT 
GCCAGAATTA- 
GAGAACGCAG ' 
ACGATCAAAC 
CTGGGGATGG 
GTGAGAATCA . 
GAGCCGGTGC 
GAAGACCAGA 
CCTCTGAAAC 
CAGCTCTTGT- 
GCCACAGCTG 
TTTGCAGACT 
AGGATTATGG 
TCCAACGACA. 
AAGGGCTGAC 
CTGGGGCGGG 
TCGCACAGCA- 
TATTGTATTT 
AAAAGGTTCA' 
TGTAGCTATA 
GTTTATTTTA 
TTTGTACAGT 
TTTCATTCTT 
ACGTGTGGCA 
GGGGTGGCTG 
CTCCTGCCTT 
CCAGGCCCCT 
GGGGAGTTTC 
CCACAGAGAG 
TCCAAGACCT 
TCATTTGCCT 
GTTTTCCTTT 
CAACTTTAAG 
AAAGAGACCT 
TTTGTGCTGT 
TCAGCGTTGC 
CTTGTGGAAG 
AAAAAAAAAA 
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ETAARAEGLS 

TFSWbSSLAR 

VWIFCRTRLI 

VRSWSLALTW 

AAVGSLLAGG 

TDERVQKKNE 

VIASWTFSV . 

FLMEEVHMIK 

RQLQRTEHQA 

.VGICGSVGSG 

EERYNSVLNS 

PLSALDAHVG 

MNLNGDYATI 

VQLEEKGQGS 
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480 
540 
600 . 
660 
720 
780 
840 
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1200 



WO 02/086443 

VPWSVYGVYI QAAGGPLAFL VIMALFMLNV GSTAFSTWWL SYWIKQGSGN TTVTRGNETS 
VSDSMKDNPH KQYYASIYAL SMAVMLILKA IRGWFVKGT LRASSRLHDE LFRRILRSPM 
KPPDTTPTGR ILNRFSKDMD EVDVRLPFQA EKFIQNVILV FPCVGMIAGV FPWFLVAVGP 1020 
LVILFSVLHI VSRVLIREIiK RLDNITQSPF' LSHITSSIQG LATIHAYNKG QEFLHRYQEL 
LDDNQAPFFL FTCAMRWIAV RLDLISIALI • TTTGLMIVLM KGQIPPAYAG 1AISYAVQLT 
GLFQFTVRLA SETEARFTSV. ERINHYIKTL SLEAPARIKN KAPSPDWPQE GEVTFENAEM . 
RYRENLPLVL KKVSFTIKPK EKIGIVGRTG SGKSSLGMAL FRLVELSGGC IKIDGVRISD 1260 
IGLADLRSKI* SIIPQEPVLF SGTVRSNLDP FNQYTEDQIW DALERTHMKE CI AQLPLKL3 1320 
SEVMENGDNF SVGERQLLCI ARALfcRHCXI LILDEATAAM DTETDLLIQE TIREAFADCT 1380 
MLTIAHRLHT VLGSDRIMVL AQGQWEFDT PSVLLSNDSS RFYAMFAAAE NKVAVKG 

Seq ID NO: S86 DNA sequence 

Nucleic Acid Accession ft: NM_001327.1 

Coding sequence: 89.. 631 w 

1 11 21 |1 41 51 

AGCAGGGGGC GCTGTGTGTA CCGAGAATAC GAGAATACCT CGTGGGCCCT GACCTTCTCT 60 

CTGAGAGCCG GGCAGAGGCT CCGGAGCCAT GCAGGCCGAA GGCCGGGGCA CAGGGGGTTC 120 

GACGGGCGAT GCTGATGGCC CAGGAGGCCC TGGCATTCCT GATGGCCCAG GGGGCAATGC 180 

TGGCGGCCCA GGAGAGGCGG GTGCCACGGG CGGCAGAGGT CCCCGGGGCG CAGGGGCAGC 240 

AAGGGCCTCG GGGCCGGGAG GAGGCGCCCC GCGGGGTCCG CATGGCGGCG CGGCTTCAGG 300 

GCTGAATGGA TGCTGCAGAT GCGGGGCCAG-- GGGGCCGGAG AGCCGCCTGC TTGAGTTCTA 360 

CCTCGCCATG CCTTTCGCGA CACCCATGGA AGCAGAGCTG GCCCGCAGGA GCCTGGCCCA 420 

GGATGCCCCA CCGCTTCCCG TGCCAGGGGT GCTTCTGAAG GAGTTCACTG TGTCCGGCAA 480 

CATACTGACT ATCCGACTGA CTGCTGCAGA CCACCGCCAA CTGCAGCTCT CCATCAGCTC 540 

CTGTCTCCAG CAGCTTTCCC TGTTGATGTG GATCACGCAG TGCTTTCTGC CCGTGTTTTT 600 

GGCTCAGCCT CCCTCAGGGC AGAGGCGCTA* AGCCCAGCCT GGCGCCCCTT CCTAGGTCAT 660 

GCCTCCTCCC CTAGGGAATG GTCCCAGCAC GAGTGGCCAG TTCATTGTGG GGGCCTGATT 720 
GTTTGTCGCT GGAGGAGGAC GGCTTACATG TTTGTTTCTG TAGAAAATAA AACTGAGCTA 

Seq ID NO: 587 Protein sequence 
protein Accession ft: NPJJ01318.1 

1 11 21 ] X j 1 I 1 

MQAEGRGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPGGGA 60 
PRGPHGGAAS GLNGCCRCGA RGPESRLLEF YLAMPFATPM EAELARRSLA QDAPPLPVPG 120 
VLLKEFTVSG NILTIRLTAA DHRQLQLSIS SCLQQLSLLM WITQCFLPVF LAQPPSGQRR 

Seq ID NO: 588 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding Bequence: 52.-459 

1 11 " - 21 31 41 51 

CCTCGTGGGC CCTGACCTTC TCTCTGAGAG CCGGGCAGAG GCTCCGGAGC CATGCAGGCC 60 

GAAGGCCAGG GCACAGGGGG ^TTCGACGGGC GATGCTGATG GCCCAGGAGG CCCTGGCATT 120 

CCTGATGGCC CAGGGGGCAA TGCTGGCGGC CCAGGAGAGG CGGGTGCCAC GGGCGGCAGA 180 

GGTCCCCGGG GCGCAGGGGC AGCAAGGGCC TCGGGGCCGA GAGGAGGCGC CCCGCGGGGT 240 

CCGCATGGCG GTGCCGCTTC " TGCGCAGGAT GGAAGGTGCC CCTGCGGGGC CAGGAGGCCG 300 

GACAGCCGCC TGCTTCAGTT CCGACTGACT GCTGCAGACC ACCGCCAACT GCAGCTCTCC 360 

ATCAGCTCCT GTCTCCAGCA GCTTTCCCTG TTGATGTGGA TCACGCAGTG CTTTCTGCCC 420 

GTGTTTTTGG CTCAGGCTCC CTCAGGGCAG AGGCGCTAAG CCCAGCCTGG CGCCCCTTCC 480 

TAGGTCATGC CTCCTCCCCT AGGGAATGGT CCCAGCACGA GTGGCCAGTT CATTGTGGGG 540 

GCCTGATTGT TTGTCGCTGG ' AGGAGGACGG CTTACATGTT TGTTTCTGTA GAAAATAAAG 600 
CTGAGCTA 

Seq ID NO: 589 Protein sequence 
Protein. Accession 8: Eos sequence 

1 11 f r r r 

MQAEGQGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPRGGA 60 
PRGPHGGAAS AQDGRCPCGA RRPDSRLLQF RLTAADHRQL QLSISSCLQQ LSLLMWITQC 120 
FLPVFLAQAP SGQRR 

Seq ID NO: 590 DNA sequence 
Nucleic Acid Accession ft: NM_005562.1 
Coding sequence: 90.. 3671 

1 11 21 31 |1 51 

ACAGCGGAGC ' GCAGAGTGAG AACCACCAAC CGAGGCGCCG GGCAGCGACC CCTGCAGCGG 60 

AGACAGAGAC TGAGCGGCCC GGCACCGCCA TGCCTGCGCT CTGGCTGGGC TGCTGCCTCT 120 

GCTTCTCGCT CCTCCTGCCC GCAGCCCGGG CCACCTCCAG GAGGGAAGTC TGTGATTGCA 180 

ATGGGAAGTC CAGGCAGTGT ATCTTTGATC GGGAACTTCA CAGACAAACT GGTAATGGAT 240 

' TCCGCTGCCT CAACTGCAAT GACAACACTG ATGGCATTCA CTGCGAGAAG TGCAAGAATG 300 

GCTTTTACCG GCACAGAGAA AGGGACCGCT GTTTGCCCTG CAATTGTAAC TCCAAAGGTT 360 

CTCTTAGTGC TCGATGTGAC AACTCTGGAC GGTGCAGCTG TAAACCAGGT GTGACAGGAG 420 

' CCAGATGCGA CCGATGTCTG CCAGGCTTCC ACATGCTCAC GGATGCGGGG TGCACCCAAG 480 

ACCAGAGACT GCTAGACTCC AAGTGTGACT GTGACCCAGC TGGCATCGCA GGGCCCTGTG 540 

' ACGCGGGCCG CTGTGTCTGC AAGCCAGCTG TTACTGGAGA ACGCTGTGAT AGGTGTCGAT 600 

CAGGTTACTA TAATCTGGAT GGGGGGAACC CTGAGGGCTG TACCCAGTGT TTCTGCTATG 660 

SctSc^C CAGCTGCCGt AGCTCTGCAG AATACAGTGT CCATAAGATC ACCTCTACCT 720 

TTCATCAAGA TGTTGATGGC TGGAAGGCTG TCCAACGAAA TGGGTCTCCT GCAAAGCTCC. 780 

AATGGTCACA GCGCCATCAA GATGTGTTTA GCTCAGCCCA ACGACTAGAC CCTGTCTATT 840 



414 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

TTGTGGCTCC TGCCAAATTT CTTGGGAATC AACAGGTGAG 
TTGACTACCG TGTGGACAGA GGAGGCAGAC ACCCATCTGC 
GTGCTGGTCT ACGGATCACA GCTCCCTTGA TGCCACTTGG 
TCACCAAGAC TTACACATTC AGGTTAAATG AGCATCCAAG 
TGAGTTACTT TGAGTATCGA AGGTTACTGC GGAATCTCAC 
CATATGGAGA ATACAGTACT GGGTACATTG ACAATGTGAC 
TCTCTGGAGC CCCAGCACCC TGGGTTGAAC AGTGTATATG 
' AATTCTGCCA GGATTGTGCT TCTGGCTACA AGAGAGATTC 
GCACCTGTAT TCCTTGTAAC TGTCAAGGGG'. GAGGGGCCTG 
GTTATTCAGG- GGATGAGAAT' CCTGACATTG AGTGTGCTGA 
ACGATCCGCA CGACCCCCGC AGCTGCAAGC CATGTCCCTG 
CAGTGATGCC GGAGACGGAG- GAGGTGGTGT GCAATAACTG. 
CCCGCTGTGA GCTCTGTGCT GATGGCTACT TTGGGGACCC- 
TGAGGCCTTG TCAGCCCTGT CAATGCAACA ACAATGTGGA 
GTGACCGGCT GACAGGCAGG TGTTTGAAGT GTATCCACAA- 
ACCAGTGCAA AGCAGGCTAC TTCGGGGACC CATTGGCTCC 
GAGCTTGCAA- CTGTAACCCC ATGGGCTCAG; AGCCTGTAGG 
GTGTTTGCAA GCCAGGATTT- GGTGGCCCCA ACTGTGAGCA 
CTTGCTATAA TCAAGTGAAG ATTCAGATGG ATCAGTTTAT 
AGGCCCTGAT TTCAAAGGCT CAGGGTGGTG. ATGGAGTAGT 
GCAGGATGCA GCAGGCTGAG CAGGCCCTTC AGGACATTCT 
AAGGTGCTAG- CAGATCCCTT GGTCTCCAGT- TGGCCAAGGT 
ACCAGAGCCG- CCTGGATGAC CTCAAGATGA CTGTGGAAAG' 
AGTACCAGAA CCGAGTTCGG GATACTCACA GGCTCATCAC 
CAGAAAGTGA AGCTTCCTTG GGAAACACTA ACATTCCTGC 
CAAATGGCTT TAAAAGTCTG GCTCAGGAGG CCACAAGATT. 
CAGCCAGTAA CATGGAGCAA CTGACAAGGG AAACTGAGGA 
CACTGGTGCG CAAGGCCCTG CATGAAGGAG TCGGAAGCGG 
CTGTGGTGCA AGGGCTTGTG: GAAAAATTGG AGAAAACCAA 
CAAGGGAGGC CACTCAAGCG GAAATTGAAG CAGATAGGTC 
TCCTGGATTC AGTGTCTCGG CTTCAGGGAG TCAGTGATCA 
CAAAGAGGAT CAAACAAAAA" GCGGATTCAC TCTCAACGCT 
AGTTCAAGCG. TACACAAAAG AATCTGGGAA ■■ ACTGGAAAGA 
AGAATGGAAA- AAGTGGGAGA GAGAAATCAG;- ATCAGCTGCT 
AAAGCAGAGC ACAAGAAGCA- CTGAGTATGG' GCAATGCCAC 
TCCTTAAAAA CCTCAGAGAG TTTGACCTGC; AGGTGGACAA 
AAGCCATGAA GAGACTCTCC TACATCAGCC" AGAAGGTTTC 1 . 
AGCAAGCAGA AAGAGCCCTG GGGAGCGCTG CTGCTGATGC 
CCGGGGAGGC CCTGGAAATC TCCAGTGAGA* TTGAACAGGA 
AAGCCAATGT GACAGCAGAT. GGAGCCTTGG CCATGGAAAA 
GTGAGATGAG GGAAGTGGAA - GGAGAGCTGG : AAAGGAAGGA 
TGGATGCAGT ACAGATGGTG ATTACAGAAG; CCCAGAAGGT 
CTGGGGTTAC AATCCAAGAC ACACTCAACA- CATTAGACGG 
AGCCTCTCAG TGTAGATGAA GAGGGGCTGG TCTTACTGGA 
AGACCCAGAT CAACAGCCAA CTGCGGCCCA TGATGTCAGA 
AGCAGAGGGG CCACCTCCAT TTGCTGGAGA CAAGCATAGA 
AGAACTTGGA GAACATTAGG GACAACCTGC. CCCCAGGCTG 
AGCAACAGTG AAGCTGCCAT AAATATTTCT CAACTGAGGT 
GGCTCGGGAG CCATGTCATG- TGAGTGGGTG GGATGGGGAC 
TATGCTCAGG TCAACTGACC TGACCCCATT CCTGATCCCA 
TGCACCAT AC ' TCCTTGCTTC CTGATGCTGG GCAATGAGGC 
ATGATCAAGG ATCTGGACCC CAAAGAATAG , ACTGGATGGA 
ATGTTTGCCT CATAATAGTC GTAAGTGGAG . TCCTGGAATT 
ATAGTCAACT TATTCTTTGA GTAATGTGAC TAAAGGAAAA 
ATGAAATTCT TCCTAATGTC AGAACAGAGT GCAACCCAGT, 
ACTATTGCCT CATATTGTCC TCTGCAAGCT, TCTTGCTGAT 
ACCCAGGGTG. TGAACATGTT CTCCATTTTC AAGCTGGAAG 
AGGACCTGTA AGGCAGGCCC ATTCAGAGCT ATGGTGCTTG 
GTTCTGGACC TGGGCATGAC ATCCTTTCTT TTAATGATGC 
ATTTTTATTA AAGCATTTCC TACCAGCAAA GCAAATGTTG 
GTTTCAAAGT GATAGAAAAG TGTGGCTTGG GCATTGAAAG 
ATTAGTCCTA ATTCAATCCT ACTTTTCGAA CACCAAAAAT 
ATCTTATTTT- CTCAATCTCC TCTCTCTTTC CTCCACCCAT. 
CACACTTCAG CTGGGTCACA. TCCATCCCTC CATTCATCCT 
TTACCTCCAT CCATCCTTCC AACATATATT TATTGAGTAC 
GTGGGACAGT GGTGACATAG TCTCTGCCCT CATAGAGTTG 
AGCATTTTTA AAAAATAAAT TTAAA CTTA C AAACTTTGTT 
GCAATAACCG CTTGGTTTGC AACCTCTTTG CTCAACAGAA 
CATGGGGGCA CTTGAGTTTT. GGCAAGGCTG ACAGAGCTCT 
CATTCCAGCT GTCACTCTGT GCCTTTCTAC AACTGATTGC 
TAACACCAGT GGGAATTGCT GGAGGAACCA GAGGCACTTC 
TGGTGCTGCC TTGCTTCTGT ATTTCCTTGG ATTTTCCTGA 
CAATTGTTAG ATGCC 

Seq ID NO: 591 Protein sequence 
Protein Accession #: NP_005553'.l 
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MPALWLGCCL 
DGIHCEKCKN 
HMLTDAGCTQ 
PEGCTQCFCY 
SSAQRLDPVY 
MPLGKTLPCG 
DNVTLISARP 
GGACDPDTGD 
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.CFSIiLLPAAR 
GFYRHRERDR 
DQRLLDSKCD 
GHSASCRSSA 
; FVAPAKFliGN 
LTKTYTFRLN 
VSGAPAPWVE 
CYSGDENPDI 
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ATSRREVCDC 
CLPCNCNSKG 
CDPAGIAGPC 
EYSVHKITST 
OQVSYGQSLS 
EHPSNNWSPQ 
QCICPVGYKG 
ECADCPIGPY' 
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NGKSRQCIFD 
SLSARCDKSG 
DAGRCVCKPA 
FHQDVDGWKA 
FDYRVDRGGR 
LSYFEYRRLL 
QFCQDCASGY 



CTATGGGCAA 
CCATGATGTG 
CAAGACACTG 
CAATAATTGG 
AGCCCTCCGC 
CCTGATTTCA 
TCCTGTTGGG 
AGCGAGACTG 
TGATCCAGAC 
CTGCCCAATT 
TCATAACGGG 
CCCTCCCGGG 
CTTTGGTGAA" 
CCCCAGTGCC 
CACAGCCGGC 
CAACCCAGCA 
ATGTCGAAGT 
TGGAGCATTC 
GCAGCAGCTT 
ACCTGATACA 
GAGAGATGCC 
GAGGAGCCAA 
AGTTCGGGCT 
TCAGATGCAG 
CTCAGACCAC- 
AGCAGAAAGC. 
CTATTCCAAA 
AAGCGGTAGC 
GTCCCTGGCC 
TTATCAGCAC 
GTCCTTTCAG 
GGTAACCAGG 
AGAAGCACAG - 
TTCCCGTGCC, 
TTTTTATGAA 
CAGAAAAGCA 
AGATGCCAGT 
ACAGAGGGCA 
GATTGGGAGT 
GGGACTGGCC 
GCTGGAGTTT 
TGATACCAGA 
CCTCCTGCAT 
GCAGAAGCTT 
GCTGGAAGAG 
TGGGATTCTG 
CTACAATACC 
TCTTGGGATA 
ATTTGAACAT 
TGGCCAGGTG 
AGATAGCACT 
AAGACAAACT 
TGGACAAGTG 
AACTTTGACT 
CACACTGTGG 
CAGAGTTCCT 
AAGTGAGCAG 
CTGGTGCCTG 
CATGGCAACT 
GGAAAGTATT 
AGGTAAAATT 
GATGCGCATC 
AATAAGAGAA 
TCCATCCATC 
CTACTGTGTG 
ATTGTCTAGT 
TGTCACAAGT 
CATATGTTGC 
GGGTTGTGCA 
AACAGACTGT 
CACCTTGGCT 
AAGTGTTTTT 
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RELHRQTGNG 
RCSCKPGVTG 
VTGERCDRCR 
VQRNGSPAKL 
KPSAHDVILE 
RNLTALRIRA 
RRDSARLGPF 
PCPCHNGFSC 



AGCCTGTCCT 

ATTCTGGAAG 

CCTTGTGGGC 

AGCCCCCAGC 

ATCCGAGCTA * 

GCCCGCCCTG 

TACAAGGGGC ' 

GGGCCTTTTG 

ACAGGAGATT 

GGTTTCTACA • 

TTCAGCTGCT 

GTCACCGGTG 

CATGGCCCAG • 

TCTGGGAATT 

ATCTACTGCG 

GACAAGTGTC . 

GATGGCACCT 

AGCTGTCCAG. 

CAGAGAATGG 

GAGCTGGAAG. 

CAGATTTCAG. 

GAGAACAGCT- 

CTGGGAAGTC 

CTGAGCCTGG^ 

TACGTGGGGC 

CACGTTGAGT . 

CAAGCCCTCT 

CCGGACGGTG 

CAGCAGTTGA 

AGTCTCCGCC 

GTGGAAGAAG 

CATATGGATG- 

CAGCTCTTAC 

AATCTTGCTA - 

GTTGAGAGCA ^ 

GAAGCTGAAG 

GACAAGACCC 

AAGAATGGGG 

CTGAACTTGG ^ 

TCTCTGAAGA 

gacacgaata;. 
gccaagaacg ~ 

CTGATGGACC 
TCCCGAGCCA 
AGGGCACGTC 
GCTGATGTGA 
CAGGCTCTTG 
CAGATCTCAG 
GTTTAATGGG 
GTTGTCTTAT 
GGGTGTGAGA 
GCACAGGCAG 
CTGTTGGGAT 
TTGCCCAGGC 
CCAGTAAAAT 
CCTACTTACA 
TGTTGGAGTG- 
CCACCTTCAA 
TAGAGATTGC 
TACTTTTTCG 
CTCTAGATTT 
AATGTATTTT 
TGTTCCTACT 
TTTCCATCCA 
CCAGGGGCTG 
GAGGAAGACA 
GGTGTTTATT 
AAGACCCTCC 
CATTTCTTTG 
TGAGTTATGA 
GGGAAGACTA 
AAATAAAGAA 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
i680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
S160 



51 
I 

FRCLNCKDNT 
ARCDRCLPGF 
SGYYNLDGGN 
QWSQRHQDVF 
GAGLRITAPL 
TYGEYSTGYI 
GTCIPCNCQG 
SVMPETEEW 



60 
120 
180 
240 
300 
360 
420 
480 
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CNNCPPGyTG' ARCELCADGY * PGDPFGEHGP VRPCQPCQCN NNVDPSASGN CDRLTGRCLK 540 

CIHNTAGIYC DQCKAGYFGD PIAPNPADKC RACNCNPKGS EPVGCRSDGT CVCKPGFGGP 600 

MCEHGAFSCP ACYNQVXIQM DQFMQQLQRM EALISKAQGG. DGWPDTEIiE GRMQQAEQAL 660 

QDILRDAQIS EGASRSLGLQ LAKVRSQENS YQSRLDDLKM TVERVRALGS QYQNRVRDTR 720 

RLITQKQLSL AESEASLGNT NIPASDHYVG PNGFKSLAQE ATRLAESHVE SASNMEQLTR 780 

ETEDYSKQAL. SLVRKALHEG VGSGSGSPDG AWQGLVEKL EKTKSLAQQL TREATGAEIE 840 

ADRSYQHSLR LLDSVSRLQG VSDQSFQVEE AKRIKQKADS' LSTLVTRHMD EFKRTQKNLG 900 

NWKEEAQOLI; QNGKSGREKS DQLLSRANLA KSRAQEALSM GNATFYEVES ILKNLREFDL 960 

QVDNRKAEAE EAMXRLSYIS- QKVSDASDKT QQAERALGSA AADAQRAKNG AGEALEISSE . 1020 

IEQEIGSLNL EANVTADGAIi ' AMEKGLASLK SEMREVEGEh ERKELEFDTN HDAVQMVITE 1080 

AQKVDTRAKN AGVT IQDTLN. TLDGLLHLMD. QPLSVDEEGL VLLEQKLSRA KTQINSQLRP 1140 
MMSELEERAR QQRGHLHLLE TSIDGILADV KNLENIRDNL PPGCYNTQAL EQQ 

Seq ID' NO: 592 DNA sequence 

Nucleic Acid Accession 8: AF101051.1 

Coding sequence: 221-856 

1 . 11 . 21 31 41 .51 

GAGCAACCTC AGCTTCTAGT ATCCAGACTC CAGCGCCGCC CCGGGCGCGG ACCCCAACCC 60 

CGACCCAGAG CTTCTCCAGC GGCGGCGCAG CGAGCAGGGC TCCCCGCCTT AACTTCCTCC 120 

GCGGGGCCCA- GCCACCTTCG GGAGTCCGGG TTGCCCACCT. GCAAACTCTC" CGCCTTCTGC ' 180 

ACCTGCCACC CCTGAGCCAG CGCGGGCGCC CGAGCGAGTC ATGGCCAACG CGGGGCTGCA' 240 

GCTGTTGGGC TTCATTCTCG CCTTCCTGGG ATGGATCGGC GCCATCGTCA- GCACTGCCCT ■ 300 

GCCCCAGTGG AGGATTTACT CCTATGCCGG CGACAACATC GTGACCGCCC AGGCCATGTA 360 

OGAGGGGCTG TGGATGTCCT GCGTGTCGGA GAGCACCGGG CAGATCCAGT GCAAAGTCTT " 420 

TGACTCCTTG CTGAATCTGA GCAGCACATT GCAAGCAACC CGTGCCTTGA, TGGTGGTTGG 480 

CATCCTCCTG GGAGTGATAG CAATCTTTGT GGCCACCGTT GGCATGAAGT GTATGAAGTG 540 

CTTGGAAGAC GATGAGGTGC. AGAAGATGAG GATGGCTGTC ATTGGGGGTG CGATATTTCT 600 

TCTTGCAGGT CTGGCTATTT TAGTTGCCAC AGCATGGTAT GGCAATAGAA TCGTTCAAGA 660 

ATTCTATGAC CCTATGACCC CAGTCAATGC CAGGTACGAA TTTGGTCAGG CTCTCTTCAC 720 

TGGCTGGGCT GCTGCTTCTC TCTGCCTTCT GGGAGGTGCC CTACTTTGCT GTTCCTGTCC 780 

CCGAAAAACA ACCTCTTACC CAACACCAAG GCCCTATCCA ■ AAACCTGCAC CTTCCAGCGG' j 840 

GAAAGACTAC GTGTGACACA- GAGGCAAAAG GAGAAAATCA" TGTTGAAACA AACCGAAAAT 900 

GGACATTGAG ATACTATCAT. TAACATTAGG ACCTTAGAAT TTTGGGTATT GTAATCTGAA 960 

GTATGGTATT ACAAAACAAA CAAACAAACA AAAAACCCAT GTGTTAAAAT ACTCAGTGCT ' 1020 

AAACATGGCT TAATCTTATT TTATCTTCTT TCCTCAATAT AGGAGGGAAG ATTTTACCAT 1080 

TTGTATTACT GCTTCCCATT GAGTAATCAT ACTCAAATGG GGGAAGGGGT GCTCCTTAAA" 1140 

TATATATAGA- TATGTATATA" TACATGTTTT TCTATTAAAA ATAGACAGTA" AAATACTATT. , 1200 . 

CTCATTATGT TGATACTAGC ATACTTAAAA TATCTCTAAA. ATAGGTAAAT GTATTTAATT 1260 

CCATATTGAT GAAGATGTTT ATTGGTATAT. TTTCTTTTTC GTCCTTATAT ACATATGTAA 1320 

CAGTCAAATA TCATTTACTC TTCTTCATTA GCTTTGGGTG: CCTTTGCCAC AAGACCTAGC 1380 

CTAATTTACC AAGGATGAAT TCTTTCAATT CTTCATGCGT GCCCTTTTCA TATACTTATT 1440 

TTATTTTTTA CCATAATCTT ATAGCACTTG CATCGTTATT ; AAGCCCTTAT TTGTTTTGTG . 1500 

TTTCATTGGT CTCTATCTCC TGAATCTAAC ACATTTCATA- GCCTACATTT TAGTTTCTAA - 1560 

AGCCAAGAAG AATTT ATT AC AAATCAGAAC TTTGGAGGCA AATCTTTCTG CATGACCAAA 1620 

GTGATAAATT- CCTGTTGACC TTCCCACACA ATCCCTGTAC TCTGACCCAT AGCACTCTTG 16 B0 

TTTGCTTTGA ' AAATATTTGT " CCAATTGAGT AGCTGCATGC TGTTCCCCCA GGTGTTGTAA 1740 

CACAACTTTA TTGATTGAAT TTTTAAGCTA CTTATTCATA GTTTTATATC CCCCTAAACT 1800 

ACCTTTTTGT TCCCCATTCC TTAATTGTAT TGTTTTCCCA- AGTGTAATTA TCATGCGTTT 1860 

TATATCTTCC TAATAAGGTG- TGGTCTGTTT GTCTGAACAA AGTGCTAGAC TTTCTGGAGT 1920 

GATAATCTGG- TGACAAATAT TCTCTCTGTA GCTGTAAGCA- AGTCACTTAA TCTTTCTACC 1980 

TCTTTTTTCT. ATCTGCCAAA TTGAGATAAT GATACTTAAC CAGTTAGAAG AGGTAGTGTG . 2040 

AATATTAATT AGTTTATATT ■ ACTCTCATTC TTTGAACATG AACTATGCCT. ATGTAGTGTC 2100 

TTTATTTGCT CAGCTGGCTG AGACACTGAA GAAGTCACTG AACAAAACCT ACACACGTAC 2160 

CTTCATGTGA TTCACTGCCT TCCTCTCTCT ACCAGTCTAT TTCCACTGAA CAAAACCTAC 2220 

ACACATACCT TCATGTGGTT CAGTGCCTTC CTCTCTCTAC CAGTCTATTT CCACTGAACA 22 B0 

AAACCTACGC ACATACCTTC ATGTGGCTCA GTGCCTTCCT CTCTCTACCA GTCTATTTCC 2340 

ATTCTTTCAG CTGTGTCTGA CATGTTTGTG CTCTGTTCCA TTTTAACAAC TGCTCTTACT 2400 

TTTCCAGTCT GTACAGAATG CTATTTCACT TGAGCAAGAT GATGTATGGA AAGGGTGTTG 2460 

GCACTGGTGT CTGGAGACCT GGATTTGAGT CTTGGTGCTA TCAATCACCG TCTGTGTTTG 2520 

AGCAAGGCAT TTGGCTGCTG TAAGCTTATT GCTTCATCTG TAAGCGGTGG TTTGTAATTC 2580 

CTGATCTTCC CACCTCACAG TGATGTTGTG GGGATCCAGT GAGATAGAAT ACATGTAAGT 2640 

GTGGTTTTGT AATTTGAAAA GTGCTATACT AAGGGAAAGA ATTGAGGAAT TAACTGCATA 2700 

CGTTTTGGTG TTGCTTTTCA AATGTTTGAA" AATAAAAAAA TGTTAAGAAA TGGGTTTCTT 2760 

GCCTTAACCA GTCTCTCAAG TGATGAGACA GTGAAGTAAA ATTGAGTGCA CTAAACGAAT 2820 

AAGATTCTGA GGAAGTCTTA" TCTTCTGCAG TGAGTATGGC CCAATGCTTT CTGTGGCTAA 2880 

ACAGATGTAA TGGGAAGAAA TAAAAGCCTA CGTGTTGGTA AATCCAACAG CAAGGGAGAT 2940 

TTTTGAATCA TAATAACTCA TAAGGTGCTA TCTGTTCAGT GATGCCCTCA GAGCTCTTGC. 3000 

TGTTAGCTGG CAGCTGACGC TGCTAGGATA GTTAGTTTGG AAATGGTACT. TCATAATAAA 3060 

CTACACAAGG AAAGTCAGCC ACCGTGTCTT ATGAGGAATT- GGACCTAATA AATTTTAGTG 3120 

TGCCTTCCAA ACCTGAGAAT ATATGCTTTT GGAAGTTAAA ATTTAAATGG CTTTTGCCAC 3180 

ATACATAGAT CTTCATGATG TGTGAGTGTA ATTCCATGTG GATATCAGTT ACCAAACATT 3240 

ACAAAAAAAT TTTATGGCCC AAAATGACCA ACGAAATTGT TACAATAGAA TTTATCCAAT 3300 

TTTGATCTTT TTATATTCTT CTACCACACC TGGAAACAGA CCAATAGACA TTTTGGGGTT 3360 

TTATAATGGG AATTTGTATA AAGCATTACT CTTTTTCAAT AAATTGTTTT TTAATTTAAA 3420 
AAAAGGAAAA AAAAAAAAAA AAA 

Seq ID NO: ,593 Protein sequence 
Protein Accession JT: AAD16433.1 

-1 ■ 11 21 31 41 51- - 

I III I 1 

MANAGLQLLG FILAFLGWIG AIVSTALPQW RIYSYAGDNI VTAQAMYEGL WMSCVSQSTG 60 

QIQCKVFDSL LNLSSTLQAT RALMWGILL GVIAIFVATV GMKCMKCLED DEVQKMRMAV 120 

■ IGGAIFLLAG LAlLVATAMY GNRIVQEFYD PMTPVNARYE FGQAIiFTGWA AASLCLLGGA 180 

* LLCCSCPRKT' TSYPTPRPYP KPAESSGKDY V 
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Seq ID NO: 594 DNA sequence 

Nucleic Acid Accession 8 i NMJ)06180.1 

Coding sequence: 352.. 2820' 

1 " .11 21 31 |1 51 

CCCCCATTCG CATCTAACAA GGAATCTGCG CCCCAGAGAG TCCCGGACGC CGCCGGTCGG 60 

TGCCCGGCGC - GCCGGGCCAT GCAGCGACGG CCGCCGCGGA GCTCCGAGCA GCGGTAGCGC 120 

CCCCCTGTAA AGCGGTTCGG TATGCCGGGA CCACTGTGAA CCCTGCCGCC TGCCGGAACA ■ 18.0 

CTCTTCGCTC CGGACCAGCT CAGCCTCTGA TAAGCTGGAC TCGGCACGCC CGCAACAAGC 240 

ACCGAGGAGT TAAGAGAGCC. GCAAGCGCAG GGAAGGCCTC CCCGCACGGG TGGGGGAAAG 300 

CGGCCGGTGC AGCGCGGGGA- CAGGCACTCG • GGCTGGCACT GGCTGCTAGG GATGTCGTCC 360 

TGGATAAGGT GGCATGGACC CGCCATGGCG CGGCTCTGGG GCTTCTGCTG GCTGGTTGTG 420 

GGCTTCTGGA GGGCCGCTTT* CGCCTGTCCC ACGTCCTGCA AATGCAGTGC CTCTCGGATC 480 

TGGTGCAGCG ACCCTTCTCC, TGGCATCGTG GCATTTCCGA GATTGGAGCC TAACAGTGTA 540 

- GATCCTGAGA- ACATCACCGA AATTTTCATC GCAAACCAGA AAAGGTTAGA AATCATCAAC 600 
GAAGATGATG TTGAAGCTTA" TGTGGGACTG AG AAAT CTGA' CAATTGTGGA TTCTGGATTA 660 

"AAATTTGTGG CTCATAAAGC ATTTCTGAAA AACAGCAACC TGCAGCACAT CAATTTTACC 720 

CGAAACAAAC TGACGAGTTT GTCTAGGAAA CATTTCCGTC ACCTTGACTT GTCTGAACTG 780 . 

ATCCTGGTGG GCAATCCATT TACATGCTCC TGTGACATTA TGTGGATCAA GACTCTCCAA 840 

GAGGCTAAAT- CCAGTCCAGA CACTCAGGAT TTGTACTGCC TGAATGAAAG CAGCAAGAAT 900 

ATTCCCCTGG CAAACCTGCA GATACCCAAT TGTGGTTTGC CATCTGCAAA ■ TCTGGCCGCA .960 

CCTAACCTCA CTGTGGAGGA AGGAAAGTCT ATCACATTAT CCTGTAGTGT GGCAGGTGAT 1020 

CCGGTTCCTA'; ATATGT ATTG : GGATGTTGGT AACCTGGTTT. CCAAACATAT GAATGAAACA* 10B0 

AGCCACACAC: AGGGCTCCTT AAGGATAACT- AACATTTCAT CCGATGACAG TGGGAAGCAG - 1140 

• ATCTCTTGTG TGGCGGAAAA TCTTGTAGGA GAAGATCAAG' ATTCTGTCAA* CCTCACTGTG 1200 
CATTTTGCAC CAACTATCAC ATTTCTCGAA TCTCCAACCT- CAGACCACCA CTGGTGCATT 1260 
CCATTGACTG TGAAAGGCAA- CCCCAAACCA GCGCTTCAGT. GGTTCTATAA- CGGGGCAATA 1320 
TTGAATGAGT- ' CCAAATACAT CTGTACTAAA ATACATGTTA CCAATCACAC GGAGTACCAC 1380 
GGCTGCCTCC AGCTGGATAA TCCCACTCAC ATGAACAATG . GGGACTACAC TCTAATAGCC - 1440 
AAGAATGAGT • ATGGGAAGGAV- TG AGAAACAG ATTTCTGCTC ACTTCATGGG CTGGCCTGGA . 1500 
ATTGACGAtG GTGCAAACCC AAATTATCCT GATGTAATTT ATGAAGATTA TGGAACTGCA- 1560 
GCGAATGACA TCGGGGACAC CACGAACAGA AGTAATGAAA TCCCTTCCAC. AGACGTCACT 1620 
GATAAAACCG GTCGGGAACA TCTCTCGGTC TATGCTGTGG TGGTGATTGC GTCTGTGGTG 1680 
GGATTTTGCC TTTTGGTAAT GCTGTTTCTG CTTAAGTTGG CAAGACACTC CAAGTTTGGC 1740 
ATGAAAGGCC CAGCCTCCGT TATCAGCAAT GATGATGACT CTGCCAGCCC ACTCCATCAC 1800 
ATCTCGAATG GGAGTAACAC rCCATCTTCT TCGGAAGGTG GCCCAGATGC TGTCATTATT 1860 
GGAATGACCA AGATCCCTGT CATTGAAAAT CCCCAGTACT TTGGCATCAC CAACAGTCAG 1920 
CTCAAGCCAG ACACATTTGT TCAGCACATC AAGCGACATA ACATTGTTCT GAAAAGGGAG 1980 
CTAGGCGAAG GAGGCTTTGG AAAAGTGTTC CTAGCTGAAT GCTATAACCT CTGTCCTGAG 2040 
CAGGACAAGA- TCTTGGTGGC AGTGAAGACC CTGAAGGATG CCAGTGACAA TGCACGCAAG : 2100 
GACTTCCACC GTGAGGCCGA GCTCCTGACC AACCTCCAGC ATGAGCACAT CGTCAAGTTC 2160 
TATGGCGTCT GCGTGGAGGG CGACCCCCTC ATCATGGTCT TTGAGTACAT GAAGCATGGG 2220 
GACCTCAACA AGTTCCTCAG GGCACACGGC CCTGATGCCG TGCTGATGGC TGAGGGCAAC 2280 
CCGCCCACGG AACTGACGCA GTCGCAGATG CTGCATATAG CCCAGCAGAT CGCCGCGGGC 2340 
ATGGTCTACC TGGCGTCCCA GCACTTCGTG CACCGCGATT TGGCCACCAG GAACTGCCTG 2400 
GTCGGGGAGA ACTTGCTGGT GAAAATCGGG GACTTTGGGA- TGTCCCGGGA CGTGTACAGC 2460 
ACTGAGTACT ACAGGGTCGG TGGCCACACA ATGCTGCCCA TTCGCTGGAT GCCTCCAGAG 2520 
AGCATCATGT ACAGGAAATT- CACGACGGAA AGCGACGTCT. GGAGCCTGGG GGTCGTGTTG 25B0 . 
TGGGAGATTT TCACCTATGG CAAACAGCCC TGGTACCAGC TGTCAAACAA TGAGGTGATA 2640 
GAGTGTATCA CTCAGGGCCG AGTCCTGCAG. CGACCCCGCA CGTGCCCCCA GGAGGTGTAT 2700 
GAGCTGATGC TGGGGTGCTG. GCAGCGAGAG CCCCACATGA GGAAGAACAT CAAGGGCATC 2760 
CATACCCTCC TTCAGAACTT GGCCAAGGCA TCTCCGGTCT- ACCTGGACAT TCTAGGCTAG- 2820 
GGCCCTTTTC CCCAGACCGA TCCTTCCCAA ■ CGTACTCCTC AGACGGGCTG AGAGGATGAA 2880 

* CATCTTTTAAr CTGCCGCTGG: AGGCCACCAA" GCTGCTCTCC . TTCACTCTGA CAGTATTAAC ■• 2940 
ATCAAAGACT CCGAGAAGCT CTCGAGGGAA GCAGTGTGTA CTTCTTCATC CAT AGACACA 3000 
GTATTGACTT CTTTTTGGCA TTATCTCTTT -CTCTCTTTCC ATCTCCCTTG GTTGTTCCTT- 3060 
TTTCTTTTTT TAAATTTTCT. TTTTCTTCTT TTTTTTCGTC TTCCCTGCTT CACGATTCTT 3120 

- ACCCTTTCTT" TTGAATCAAT CTGGCTTCTG CATTACTATT- AACTCTGCAT AGACAAAGGC, 3180 
CTTAACAAAC GTAATTTGTT, ATATCAGCAG ACACTCCAGT TTGCCCACCA CAACTAACAA 3240 
TGCCTTGTTG TATTCCTGCC TTTGATGTGG ATGAAAAAAA GGGAAAACAA ATATTTCACT 33 00 
TAAACTTTGT CACTTCTGCT GTACAGATAT CGAGAGTTTC TATGGATTCA CTTCTATTTA 3360 
TTTATTATTA TTACTGTTCT TATTGTTTTT GGATGGCTTA AGCCTGTGTA TAAAAAAGAA 3420 
AACTTGTGTT CAATCTGTGA - AGCCTTTATC TATGGGAGAT TAAAACCAGA GAGAAAGAAG 3480 
ATTTATTATG AACCGCAATA TGGGAGGAAC AAAGACAACC ACTGGGATCA GCTGGTGTCA 3540 
GTCCCTACTT AGGAAATACT CAGCAACTGT TAGCTGGGAA GAATGTATTC GGCACCTTCC 3600 
CCTGAGGACC TTTCTGAGGA GTAAAAAGAC TACTGGCCTC TGTGCCATGG ATGATTCTTT 3 660 
TCCCATCACC AGAAATGATA GCGTGCAGTA GAGAGCAAAG ATGGCTT 

Seq ID NO 1 : 595 Protein sequence 
Protein Accession # : - NP_006171 11 ■' 

. 1. , . 11 - 21 -. 31 j 1 | X ' * ' 

MSSWIRWHGP AMARLWGFCW LWGFWRAAF ACPTSCKCSA SRIWCSDPSP GIVAFPRLEP 60 

NSVDPENITE IFIANQKRLE IINEDDVEAY VGLRNLTIVD SGLKFYAHKA FLKNSNLQHI 120 

NFTRNKLTSL SRKHFRHLDL SELILVGNPF TCSCDIMWIK TLQEAKSSPD TQDLYCLNES 180 

SKNIPLANLQ IPNCGLPSAN- LAAPNLTVEE GKSITLSCSV AGDPVPNMYW DVGNLVSKHM . 240 

NETSHTQGSL- RITNISSDDS GKQISCVAEN LVGEDQDSVN LTVHFAPTIT FLESPTSDHH 300 

WCIPFTVKGN PKPALQWFYN GAIliNESKYI CTKIHVTNHT EYHGCLQLDN PTHMNNGDYT 360 

LIAKNEYGKD EKQISAHFMG WPGIDDGANP NYPDVIYEDY GTAANDIGDT TNRSNEIPST 420 

' DVTDKTGREH LSVYAWVIA SWGFCLLVM LFLLKLARHS KFGMKGPASV ISNDDDSASP 480. 

LHHISNGSNT PSSSEGGPDA VIIGMTKIPV IENPQYFGIT N5QLKPDTFV QHIKRHNIVL 540 

KRELG2GAFG KVFLAECYNL CPEQDKILVA VKTLKDASDM ARKDFHREAE LLTNLQHEHI 600 

VKFYGVCVEG DPLIMVFEYM KHGDLNKFLR AHGPDAVLMA EGNPPTELTQ SQMLHIAQQI 660 

AAGMVYLASQ HFVHRDLATR NCLVGENLLV KIGDFGMSRD VYSTDYYRVG GHTMLPIRWM 720 
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PPESIKYRKF TTSSDVWSLG WLWEIFTYG XQPWYQLSNN EVIECITQGR VLQRPRTCPQ 780 
EVYELMLGCW QREPHMRKNI KGIHTLLQNL AKASPVYIDI LG . • . ' ; ..» . . - 

Seq ID NO: 596 DNA sequence 
Nucleic Acid Accession #: AF410899 
Coding sequence: 483.-2999 

1 11 . ' 21 - j 1 I 1 |* : 

' GGGAGCAGGA GCCTCGCTGG CTGCTTCGCT CGCGCTCTAC GCGCTCAGTC CCCGGCGGTA - 60 

"gcaggagcct GGACCCAGGC GCCGGCGGCG GGCGTGAGGC GCCGGAGCCC GGCCTCGAGG 120 

TGCATACCGG ACCCCCATTC GCATCTAACA AGGAATCTGC GCCCCAGAGA GTCCC6GACG, 180 

CCGCCGGTOG GTGCCCGGCG CGCCGGGCCA TGCAGCGACG: GCCGCCGCGG. AGCTCCGAGC 240 

AGCGGTAGCG CCCCCCTGTA AAGCGGTTCG CTATGCCGGG ACCACTGTGA ACCCTGCCGC. 300 

CTGCGGGAAC ACTCTTCGCT CCGGACCAGC TCAGCCTCTG ATAAGCTGGA CTCGGCACGC 360 

CCGCAACAAG. CACCGAGGAG TTAAGAGAGC CGCAAGCGCA GGGAAGGCCT CCCCGCACGG 420 

GTGGGGGAAA- GCGGCCGGTG- CAGCGCGGGG ACAGGCACTC GGGCTGGCAC TGGCTGCTAG- 480 

GGATGTCGTC. CTGGATAAGG- TGGCATGGAC CCGCCATGGC GCGGCTCTGG GGCTTCTGCT; 540 

• GGCTGGTTGT GGGCTTCTGG AGGGCCGCTT, TCGCCTGTCC CACGTCCTGC AAATGCAGTG 600 
CCTCTCGGAT CTGGTGCAGC GACCCTTCTC CTGGCATCGT GGCATTTCCG AGATTGGAGC - 660 
CTAACAGTGT AGATCCTGAG~ AACATCACCG AAATTTTCAT CGCAAACCAG AAAAGGrTAG 720 
AAATCATCAA CGAAGATGAT GTTGAAGCTT ATGTGGGACT GAGAAATCTG ACAATTGTGG 780 
ATTCTGGATT. AAAATTTGTG GCTCATAAAG CATTTCTGAA AAACAGCAAC CTGCAGCACA 840 
TCAATTTTAC CCGAAACAAA CTGACGAGTT, TGTCTAGGAA ACATTTCCGT- CACCTTGACT 900 
TOTCTGAACT GATCCTGGTG GGCAATCCAT- TTACATGCTC CTGTGACATT- ATGTGGATCA 960 
A^ACTCTCCA AGAGGCTAAA- TCCAGTCCAG • ACACTCAGGA TTTGTACTGC CTGAATGAAA- 1020- 
GCAGCAAGAA TATTCCCCTG GCAAACCTGC AGATACCCAA TTGTGGTTTG CCATCTGCAA 1080 
ATCTGGCCGC ACCTAACCTC. ACTGTGGAGG AAGGAAAGTC TATCACATTA TCCTGTAGTG 1140 
TGGCAGGTGA TCCGGTTCCT- AATATGTATT GGGATGTTGG TAACCTGGTT TCCAAACATA 1200: 
TGAATGAAAG AAGCCACACA OVGGGCTCCT TAAGGATAAC TAACATTTCA TCCGATGACA 1260 
GTGGGAAGCA GATCTCTTGT GTGGCGGAAA ATCTTGTAGG AGAAGATCAA- GATTCTGTCA 1320 
ACCTCACTGT GCATTTTGCA CCAACTATCA CATTTCTCGA ATCTCCAACC TCAGACCACC 1380- 
ACTGGTGCAT TCCATTCACT GTGAAAGGCA ACCCCAAACC AGCGCTTCAG TGGTTCTATA 1440 

• ACGGGGCAAT ATTGAATGAG TCCAAATACA" TCTGTACTAA AATACATGTT ACCAATCACA 1500 
Sx££. CGGCTGCCTC CAGCTGGATA- ATCCCACTCA CATGAACAAT. ^GACTACA 1560 
CTCTAATAGC CAAGAATGAG TATGGGAAGG ATGAGAAACA GATTTCTGCT CACTTCATGG - 1620 
GCTGGCCTGG AATTGACGAT GGTGCAAACC CAAATTATCC TGATGTAATT ™GAAGATT 1680 
ATGGAACTGC AGCGAATGAC ATCGGGGACA CCACGAACAG AAGTAATGAA ATCCCTTCCA 1740 
CAGACGTCAC TGATAAAACC GGTCGGGAAC ATCTCTCGGT CTATGCTGTG GTGGTGATTG 1800 

GGGATTTTGC CTTTTGGTAA TGCTGTTTCT. GCTTAAGTTG GCAAGACACT- 1860 

CCAAGTTTGG CATGAAAG AT TTCTCATGGT . TTGGATTTGG GAAAGTAAAA TCAAGACAAG 1920 

GTG^GGCCC AGCCTCCGTT ATCAGCAATG ATGATGACTC TGCCAGCCCA. CTCCATCACA 1980 

tSc^S SKSSt. CCATCTTCTT CGGAAGGTGG CCCAGATGCT GTCATTATTG 2040 

GAATGACCAA GATCCCTGTC ATTGAAAATC CCCAGTACTT- TGGCATCACC AACAGTCAGC 2100 

TCAAGCCAGA CACATTTGTT CAGCACATCA AGCGACATAA CATTGTTCTG AAAAGGGAGC 2160 

T^GCGAAGG AGCGTTTGGA AAAGTGTTCC TAGCTGAATG CTATAACCTC TGTCCTGAGC 2220 

IgSgIS CTTGGTGGCA GTGAAGACCC TGAAGGATGC CAGTGACAAT ^CGCAAGG 2280 

ACTTCCACCG TGAGGCCGAG CTCCTGACCA ACCTCCAGCA TGAGCACATC GTCAAGTTCT 2340 

AtScg?c?G CGTGGAGGGC ' GACCCCCTCA' TCATGGTCTT TGAGTACATG AAGCATGGGG 2400 

ACCTCAACAA GTTCCTCAGG GCACACGGCC CTGATGCCGT GCTGATGGCT GAGGGCAACC 2460 

CGCCCACGGA ACTGACGCAG TCGCAGATGC TGCATATAGC CCAGCAGATC GCCGCGGGCA 2520 

SSSSSS ScCCAG CACTTCGTGC ACCG CGATTT GGCCACCAGG AACTGCCTGG: 2580 

TCGGGGAGAA CTTGCTGGTG AAAATCGGGG ACTTTGGGAT GTCCCGGGAG GTGTACAGCA 2640 

CTGACTACTA CAGGGTCGGT, GGCCACACAA TGCTGCCCAT TCGCTGGATG CCTCCAGAGA 2700. 

GCATCATGTA CAGGAAATTC ACGAOGGAAA GCGACGTCTG GAGCCTGGGG GTCGTGTTGT 2760 

G^GA^S CACCTATGGC AAACAGCCCT- GGTACCAGCT GTCAAACAAT QAGGTGATAG 2820 

AGTGTATCAC TCAGGGCCGA GTCCTGCAGC GACCCCGCAC GTGCCCCCAG GAGGTGTATG 2880 

AGCTGATGCT GGGGTGCTGG CAGCGAGAGC CCCACATGAG GAAGAACATC AAGGGCATCC .2940. 

iSES. TCAGAACTTG GCCAAGGCAT; CTCCGGTCTA CCTGGACATT- CTAGGCTAGG . 3000 

GCCCTTTTCC CCAGACCGAT CCTTCCCAAC GTACTCCTCA GACGGGCTGA GAGGATGAAG 3060 

ATCTTTTAAC TGCCGCTGGA GGCCACCAAG CTGCTCTCCT TCACTCTGAC AGTATTAACA 3120 

TcSaGACTC CGAGAAGCTC TCGAGGGAAG CAGTGTGTAC TTCTTCATCC ATAGACACAG 3180 

TATTGACTTC TTTTTGGCAT TATCTCTTTC TCTCTTTCCA TCTCCCTTGG TTGTTCCTTT 3240 

TTCTTTTTTT AAATTTTCTT- TTTCTTCTTT TTTTTCGTCT TCCCTGCTTC ACGATTCTTA 3300 

rrCTTTCTTT TGAATCAATC TGGCTTCTGC ATTACTATTA ACTCTGCATA GACAAAGGCC 3360 

TTAACAAAOG TAATTTGTTA TATCAGCAGA CACTCCAGTT TGCCCACCAC AACTAACAAT. 3420 

GCCTTGTTGT ATTCCTGCCT TTGATGTGGA TGAAAAAAAG GGAAAACAAA TATTTCACTT 3480 

AAACTTTGTC ACTTCTGCTG TACAGATATC GAGAGTTTCT ATGGATTCAC TTCTATTTAT 3540 

TTATTATTAT TACTGTTCTT ATTGTTTTTG GATGGCTTAA GCCTGTGTAT AAAAAAGAAA 3600 

ACTTGTGTTC AATCTGTGAA GCCTTTATCT ATGGGAGATT AAAACCAGAG AGAAAGAAGA 3660 

TTTATTATGA ACCGCAATAT GGGAGGAACA AAGACAACCA CTGGGATCAG CTGGTGTCAG 3720 

TCCCTACTTA GGAAATACTC AGCAACTGTT AGCTGGGAAG AATGTATTCG GCACCTTCCC 3780 

SS TTCTCAGGAG TAAAAAGACT ACTGGCCTCT CTGCCATGGA ^ATTCTTTT 3840 

CCCATCACCA GAAATGATAG CGTGCAGTAG AGAGCAAAGA TGGCTTCCGT GAGACACAAG 3900 

ATGGCGCATA GTGTGCTCGG ACACAGTTTT GTCTTCGTAG GTTGTGATGA TAGCACTGGT 3960 

TTGTTTCTCA. AGCGCTATCC ACAGAACCTT TGTCAACTTC AGTTGAAAAG AGGTGGATTC 4020 
ATGTCCAGAG CTCATTTCGG GGTCAGGTGG GAAAGCC . 

Seq ID NO:' 597 Protein sequence 
protein Accession #.: AAL67965.1 

.1 ' 11 21 . 31 41 SI 

llSSWIRWHGP AMARLWGFCW- LWGFWRAAP ACPTSCXCSA SRIWCSDPSP GIVAFPRLEP 60 

' NSVDPENITE IFIANQKRLE- IINEDDVEAY VGLRNLTIVD SGLKFVAHKA FLKNSNLQHI 120 

^F?kN^TSL SRKHFRHLDL SELILVGNPF TCSCDIKWIK TLQEAKSSPD TQDIiYCLNES 180 

SSSS SsAN LAAPNLTVEE GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 240 
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NETSHTQGSIi " RITNISSDDS GKQISCVAEN LVGEDQDSVN LTVHFAPTIT FLESPTSDHH 300 

WCIP^TVKGN PKPALQWFYN GAILNESKYI CTKIHVTNHT EYHGCLQLDN PTHMNNGDYT 360 

LIAKNEYGKD EKQISAHFMG WPGIDDGANP NYPDVIYEDY GTAANDIGDT TNRSNEIPST 420 

DVTDKTGREH LSVYAWVIA SWGFCLLVM LFLLKLARHS KFGMKDFSWF GFGKVKSRQG 480 

VGPASVISND DDSASPLHHI SNGSNTPSSS EGGPDAVIIG MTKIPVIENP QYFGITNSQL 540 

KPDTFVQHIK RKNIVLKRBL GEGAFGKVFL AECYNLCPEQ DKILVAVKTL KDASDNARKD 600 

FEREAELLTN. LQHEHIVKFY GVCVEGDPLI MVFEYMKHGD LNKFLRAHGP DAVLMAEGNP" 660 

PTELTQSQML HIAQQIAAGM VYLASQHFVH RDLATRNCLV GENLLVKIGD FGMSRDVYST 720 

DYYRVGGHTM LPIRWMPPES IMYRKFTTES. DVWSLGWLW EIFTYGKQPW YQLSNNEVIE 780 
' CITQGRVLQR PRTCPQEVYE LMLGCWQREP HMRKNIKGIH TLLQNLAKAS PVYLDILG 

Seq ID NO: 598 DNA sequence 
Nucleic Acid Accession #: AB052906 - 
Coding sequence: 74.. 814 

' 1 • U 21 . . 31 41 I— ' 

AAAACCTTGA GGTGATTCAT CTTCCAGGCT CTCCTTCCAT CAAGTCTCTC CTCCCTAGCG 60 

. CTCTGGGTCC TTAATGGCAG CAGCCGCCGC TACCAAGATC CTTCTGTGCC TCCCGCTTCT 120 

• GCTCCTGCTG TCCGGCTGGT COOGGGCTGG- GCGAGCOGAC. CCTCACTCTC TTTGCTATGA ISO 

CATCACCGTC ATCCCTAAGT TCAGACCTGG ACCACGGTGG TGTGCGGTTC AAGGCCAGGT 240 

GGATGAAAAG ACTTTTCTTC ACT ATGACTG TGGCAACAAG ACAGTCACAC CTGTCAGTCC 300 

CCTGGGGAAG AAACTAAATG TCACAACGGC CTGGAAAGCA CAGAACCCAG TACTGAGAGA- 360 

GGTGGTGGAC ATACTTACAG ' AGCAACTGCG TGACATTCAG CTGGAGAATT- ACACACCCAA 42 0 

GGAACCCCTC ACCCTGCAGG CCAGGATGTC TTGTGAGCAG. AAAGCTGAAG GACACAGCAG 480 

TGGATCTTGG CAGTTCAGTT TCGATGGGCA 'GATCTTCCTC CTCTTTGACT CAGAGAAGAG'. 540 

AATGTGGACA ACGGTTCATC CTGGAGCCAG AAAGATGAAA GAAAAGTGGG AGAATGACAA 600 

GGTTGTGGCC ATGTCCTTCC ATTACTTCTC AATGGGAGAC TGTATAGGAT GGCTTGAGGA 660 

CTTCTTGATG GGCATGGACA GCACCCTGGA- GCCAAGTGCA GGAGCACCAC TCGCCATGTC 720 

CTCAGGCACA ACCCAACTCA GGGCCACAGC CACCACCCTC ATCCTTTGCT GCCTCCTCAT: 780 

CATCCTCCCC TGCTTCATCC TCCCTGGCAT CTGAGGAGAG TCCTTTAGAG TGACAGGTTA 840 

AAGCTGATAC CAAAAGGCTC" CTGTGAGCAC GGTCTTGATC AAACTCGCOC" TTCTGTCTGG 900 

' CCAGCTGCCC ACGACCTACG • GTGTATGTCC: AGTGGCCTCC AGCAGAT CAT GATGACATCA 960 

TGGACCCAAT AGCTCATTCA CTGCCTTGAT. TCCTTTTGCC AACAATTTTA CCAGCAGTTA 1020 

TACCTAACAT ATTATGCAAT TTTCTCTTGG TGCTACCTGA" TGGAATTCCT GCACTTAAAG 1080. 

TTCTGGCTGA CTAAACAAGA TATATCATTT TCTTTCTTCT. CTTTTTGTTT GGAAAATCAA 1140 

GTACTTCTTT GAATGATGAT CTCTTTCTTG CAAATGATAT' TGTCAGTAAA ATAATCACGT 1200 

• TAGACTTCAG ACCTCTGGGG ATTCTTTCCG TGTCCTGAAA GAGAATTTTT AAATTATTTA 1260 

ATAAGAAAAA - ATTTATATTA ATGATTGTTT. CCTTTAGTAA TTTATTGTTC TGTACTGATA 1320 
TTTAAATAAA GAGTTCTATT TCCCAAAAAA. AAAAAAAAAA AA 

Seq ID NO: 599 Protein sequence 
Protein Accession #: BAB61048.1 



11 



21 31 41 S-L 



MAAAAATKIL LCLPLLLLLS GWSRAGRADP HSLCYDITVI PKFRPGPRWC AVGGOVDEKT 60 

FLHYDCGNKT VTPVSPLGKK LNVTTAWKAQ NPVLREWDI- LTEQLRDIQIi ENYTPKEPLT 120 

LQARMSCEQK AEGHSSGSWQ FSFDGQIFLL FDSEKRMWTT VHPGARKMKE KWENDKWAM 180 

SFHYFSMGDC IGWLEDFIiMG MDS TLEPSAG APLAMSSGTT QLRATATTLI LCCLLIILPC 240 
PILPGI 

Seq ID NO: 600 DNA sequence 

Nucleic Acid Accession #: NM_0O1898.1 

Coding sequence: 57.. 482 

i ii |i » i 1 r 1 

GGCTCTCACC CTCCTCTCCT GCAGCTCCAG CTTTGTGCTC TGCCTCTGAG GAGACCATGG 60 

CCCAGTATCT GAGTACCCTG CTGCTCCTGC TGGCCACCCT AGCTGTGGCC CTGGCCTGGA 120 

GCCCCAAGGA GGAGGATAGG ATAATCCCGG GTGGCATCTA TAACGCAGAC CTCAATGATG 180 

AGTGGGTACA GCGTGCCCTT CACTTCGCCA TCAGCGAGTA TAACAAGGCC ACCAAAGATG 240 

ACTACTACAG ACGTCCGCTG • CGGGT ACTAA GAGCCAGGCA ACAGACCGTT GGGGGGGTGA 300 

ATTACTTCTT CGACGTAGAG GTGGGCCGCA' CCATATGTAC CAAGTCCCAG CCCAACTTGG. 360 

ACACCTGTGC CTTCCATGAA CAGCCAGAAC TGCAGAAGAA ACAGTTGTGC TCTTTCGAGA 420 

TCTACGAAGT TCCCTGGGAG AACAGAAGGT CCCTGGTGAA ATCCAGGTGT CAAGAATCCT 480 

AGGGATCTGT GCCAGGCCAT TCGCACCAGC CACCACCCAC TCCCACCCCC TGTAGTGCTC 540 

CCACCCCTGG ACTGGTGGCC CCCACCCTGC GGGAGGCCTC CCCATGTGCC TGCGCCAAGA 600 

GACAGACAGA GAAGGCTGCA GGAGTCCTTT GTTGCTCAGC AGGGCGCTCT GCCCTCCCIC 660 

CTTCCTTCTT GCTTCTAATA GCCCTGGTAC ATGGTACACA CCCCCCCACC TCCTGCAATT 720 
AAACAGTAGC ATCGCC 



Seq. ID NO: 601 Protein sequence 
Protein Accession 8: NPJJ01889.1 

!- ' . li ' ' 21 31 41 51 . 

MAOYLSTLLL LLATLAVALA WSPKEEDRII PGGIYNADLN DEWVQRALHF AISEYNKATK - 60 
DDYYRRPLRV LRARQQTVGG VNYFFDVEVG RTICTKSQPN LDTCAFHEQP BLQKXQLCSF .120 
EIYEVPWENR. RSLVKSRCQE S 

Seq ID NO: 602 DNA sequence 

Nucleic Acid Accession §: NM_003976.2 

Coding sequence: 299.961"- 
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CTCTGAGCTT CTCTGAGCCT TGTTTGCTCA TCTGGAAAAA GGGGATTAAA CCATTTACCT 60 

CATGGAGT7G TGAAAGAATA GCTGCAAAGC ACCTAACACA TAGTAAGGTT CCCAGTGCAG 120 

CTACTTCTGC TGGGTTGAGT CTAGCTGTGT AGGCCCCTTG TTCCTCACCT GGAGAAACTG 180 

GGGTGGCAGG CCGGTCCCCC ACAAAAGATA ACTCATCTCT TAATTTGCAA GCTGCCTCAA 240 

CAGGAGGGTG GGGGAACAGC TCAACAATGG CTGATGGGCG CTCCTGGTGT TGATAGAGAT 300 

GGAACTTGGA- CTTGGAGGCC TCTCCACGCT GTCCCACTGC CCCTGGCCTA GGCGGCAGCC 360 

TGCCCTGTGG CCCACCCTGG CCGCTCTGGC TCTGCTGAGC AGGGTCGCAG AGGCCTCCCT 420 

GGGOTCCGCG ' CCCCGCAGCC CTGCCCCCCG CGAAGGCCCC. CCGCCTGTCC TGGCGTCCCC 4 80 

' CGCCGGCCAC CTGCCGGGGG GACGCACGGC CCGCTGGTGC AGTGGAAGAG CCCGGCGGCC 540 

GCCGCCGCAG CCTTCTCGGC CCGCGCCCCC GCCGCCTGCA CCCCCATCTG CTCTTCCCCG 600 

CGGGGGCCGC GCGGCGCGGG CTGGGGGCCC GGGCAGCCGC GCTCGGGCAG CGGGGGCGCG 660 

GGGCTGCCGC CTGCGCTCGC AGCTGGTGCC GGTGCGCGCG CTCGGCCTGG, GCCACCGCTC 720 

CGACGAGCTG GTGOGTTTCC GCTTCTGCAG CGGCTCCTGC CGCCGCGCGC GCTCTCCACA '■ 780 

CGACCTCAGC CTGGCCAGCC TACTGGGCGC CGGGGCCCTG OGACCGCCCC CGGGCTCCCG 840 

GCCCGTCAGC CAGCCCTGCT GCCGACCCAC GCGCTACGAA GCGGTCTCCT TCATGGACGT 900 

CAACAGCACC TGGAGAACCG. TGGACCGCCT CTCCGCCACC GCCTGCGGCT GCCTGGGCTG 960 

AGGGCTCGCT\ CCAGGGCTTT GCAGACTGGA CCCTTACCGG TGGCTCTTCC TGCCTGGGAC - 1020 

CCTCCGGCAG AGTCCCACTA GCCAGCGGCC: TCAGCCAGGG ACGAAGGCCT CAAAGCTGAG 1080 

AGGCCCCTAC CGGTGGGTGA TGGATATCAT CCCCGAACAG GTGAAGGGAC AACTGACTAG 1140 

CAGCCCCAGA GCCCTCACCC TGCGGATCCC AGCCTAAAAG ACACCAGAGA CCTCAGCTAT 1200 

GGAGCCCTTC GGACCCACTT CTCAGAGACT CTGGCACTGG" CCAGGCCTCG. AACCTGGGAC 1260 

CCCTCCrCTG ATGAACACTA CAGTGGCTGA GGCATCAGCC CCCGCCCAGG* CCCTGTAGGG- 1320 

ACAGCATTTG AAGGACACAT- ATTGCAGTTG CTTGGTTGAA AGTGCCTGTG CTGGAACTGG 13 80 
CCTGTACTCA CTCATGGGAG" CTGGCCCC 

Seq ID NO: 603 Protein sequence 

Protein Accession *h NP_00 3 9 67.1 . . 

I 11 21 31 41 .5 1 

MELGLGGLST LSHCPWPRRQ PALWPTLAAL ALLSSVAEAS LGSAPRSPAP REGPPPVLAS 60 

PAGHLPGGRT ARWCSGRARR PPPQPSRPAP PPPAPPSALP RGGRAARAGG PGSRARAAGA 120 

RGCRIiRSQLV: PVRALGLGHR - SDELVRFRFC SGSCRRARSP HDLSIASLLG AGALRPPPGS 180 
RPVSQPCCRP'TRYEAVSPMD VNSTWRTVDR LSATACGCLG 

Seq ID NO: 604 DNA sequence 
Nucleic Acid Accession #: NM_057091.1 
Coding sequence: 783.. 1445 

1 ii 21 31 41 51 

i | j | I I 

ACTGGCCGCT GAGAGAAGAA TCGGGTGGAG CAGAGAGCAG CTGCTGCAGG GCAGACAGCC 60 

GGACCCCCAA ATCTGCACGT ACCAGCAGTC AGCCGCCCCA CGCAGGGACC GGCTTACCCC 120 

TCGCTCCCCG CCCTCACTCA CTTTCTCCCG CCCTCGGCCC GGCCTCCCAG CTCTCTACTT 180 

CGCGTGtCTA CAAACTCAAC TCCCGGTTTC CGTGCCTCTC CACCGCTCGA GTTCTCTACT 240 

CTCCATATCC GAGGGGCCCC TCCCAGCATC TACCCCCCTC CCAACCTCGG GGGACCTAGC 300 

CAAGCTAGGG GGGACTGGAT CCGACGGGTG GAGCAGCCAG GTGAGCCCCG AAAGGTGGGG 360 

CGGGGCAGGG GCGCTCCCAG CCCCACCCCG. GGATCTGGTG ACGCTGGGGC TGGAATTTGA 420 

CACCGGACGG CTGCGGCGGC GGGCAGGAGG CTGCTGAGGG ATGGAGTTGG GCCCGGCCCC 480 

CAGACAAGGC" CCGGGGGCTC CGCCAGCAGC AGGTCCCTCG GGCCCCAGCC CTCGCTGCCA- 540 

CCCGGGCCTG. GAGCCCCACA CCCGAGGGTG CAGACTGGCT GCCAAGGCCA CACTTTTGGC 600 

TAAAAGAGGC ACTGCCAGGT GTACAGTCCT GGGCATGCGC TGTTTGAGCT TCGGGGGAGA 660 

GCCCAGCACT GGTCCCCGGA AAGGTGCCTA GAAGAACAAG GTGCAGGACC CCGTGCTGCC 720 

TCAACAGGAG GGTGGGGGAA CAGCTCAACA ATGGCTGATG . GGCGCTCCTG GTGTTGATAG 780 

AGATGGAACT TGGACTTGGA GGCCTCTCGA CGCTGTCCCA CTGCCCCTGG CCTAGGCGGC 840 

AGCCTGCCCT GTGGCCCACC CTGGCCGCTC TGGCTCTGCT GAGCAGCGTC GCAGAGGCCT 900 

CCCTGGGCTC CGCGCCCCGC AGCCCTGCCC CCCGCGAAGG CCCCCCGCCT GTCCTGGCGT 960 

CCCCCGCCGG CCACCTGCCG GGGGGACGCA CGGCCCGCTG GTGCAGTGGA AGAGCCCGGC 1020 

GGCCGCCGCC GCAGCCTTCT. CGGCCCGCGC CCCCGCCGCC TGCACCCCCA TCTGCTCTTC 1080 

CCCGCGGGGG CCGCGCGGCG CGGGCTGGGG GCCCGGGCAG CCGCGCTCGG GCAGCGGGGG 1140 

CGCGGGGCTG CCGCCTGCGC TCGCAGCTGG TGCCGGTGCG CGCGCTCGGC CTGGGCCACC 1200 

GCTCCGACGA GCTCGTGCGT TTCCGCTTCT GCAGCGGCTC CTGCCGCCGC GCGCGCTCTC 1260 

CACACGACCT CAGCCTGGCC AGCCTACTGG GCGCCGGGGC CCTGCGACCG CCCCCGGGCT 1320 

CCCGGCCCGT CAGCCAGCCC TGCTGCCGAC CCACGCGCTA CGAAGCGGTC TCCTTCATGG 1380. 

ACGTCAACAG CACCTGGAGA ACCGTGGACC GCCTCTCCGC CACCGCCTGC GGCTGCCTGG' 1440 

GCTGAGGGCT CGCTCCAGGG CTTTGCAGAC TGGACCCTTA CCGGTGGCTC TTCCTGCCTG 1500 

GGACCCTCCC GCAGAGTCCC ACTAGCCAGC GGCCTCAGCC AGGGACGAAG GCCTCAAAGC 1560 

TGAGAGGCCC CTACCGGTGG GTGATGGATA TCATCCCCGA ACAGGTGAAG GGACAACTGA 1620 

CTAGCAGCCC CAGAGCCCTC ACCCTGCGGA TCCCAGCCTA AAAGACACCA GAGACCTCAG 1680 

CTATGGAGCC CTTCGGACCC ACTTCTCACA GACTCTGGCA CTGGCCAGGC CTCGAACCTG 1740 

GGACCCCTCC TCTGATGAAC ACTACAGTGG CTGAGGCATC PJSCCCCCGCC CAGGCCCTGT 1800 

AGGGACAGCA TTTGAAGGAC ACATATTGCA GTTGCTTGGT TGAAAGTGCC TGTGCTGGAA I860 
CTGGCCTGTA CTCACTCATG GGAGCTGGCC CC 

Seq ID NO: 60S Protein sequence 
Protein Accession #: NP__003 967.1 

1 ■ 11 21 ' ' 31 41 51 

I I ' ■ - I I I 1 

MELGLGGLST LSHCPWPRRQ PALWPTLAAL ALLSSVAEAS LGSAPRSPAP REGPPPVLAS -60 

PAGHLPGGRT ARWCSGRARR PPPQPSRPAP PPPAPPSALP RGGRAARAGG PGSRARAAGA 120 

RGCRLRSQLV PVRALGLGHR SDELVRFRFC SGSCRRARSP HDLSLASLLG AGALRPPPGS 180 
RPVSQPCCRP TRYEAVSFMD VNSTWRTVDR LSATACGCLG 

Seq ID NO: 606 DNA sequence 

Nucleic Acid Accession 8: NM_057160.1 
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Coding sequence: 1..714' 



PCT/US02/12476 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



ATGCCCGGCC 
CACCTGGGTG 
TGGCCCACCC 
GCGCCCCGCA 
CACCTGCCGG 
CAGCCTTCTC 
CGCGCGGCGC 
CGCCTGCGCT 
CTGGTGCGTT 
AGCCTGGCCA; 
AGCCAGCCCT 
ACCTGGAGAA 
GCTCCAGGGC 
CAGAGTCCCA 
TACCGGTGGG 
AGAGCCCTCA 
TTCGGACCCA V 
CTGATGAACA 
TTGAAGGACA 
TCACTCATGG 



11 
I 

TGATCTCAGC 
CCCTCTTTCT 
TGGCCGCTCT 
GCCCTGCCCC 
GGGGACGCAC 
GGCCCGCGCC 
GGGCTGGGGG 
CGCAGCTGGT 
TCCGCTTCTG 
GCCTACTGGG 
GCTGCCGACC 
CCGTGGACCG 
TTTGCAGACT 
CTAGCCAGCG 
TGATGGATAT 
CCCTGCGGAT. 
CTTCTCACAG 
CTACAGTGGC 
CATATTGCAG 
GAGCTGGCCC 



21 
I 

CCGAGGACAG 
CCCTGAGGCT 
GGCTCTGCTG 
CCGCGAAGGC 
GGCCCGCTGG 
CCCGCCGCCT 
CCCGGGCAGC 
GCCGGTGCGC 
CAGCGGCTCC. 
CGCCGGGGCC 
CACGCGCTAC 
CCTCTCCGCC 
GGACCCTTAC 
GCCTCAGCCA 
CATCCCCGAA 
CCCAGCCTAA 
ACTCTGGCAC 
TGAGGCATCA 
TTGCTTGGTT' 
C 



31 
I 

CCCCTCCTTG 
CCACTTGGTC 
AGCAGCGTCG 
CCCCCGCCTG 
TGCAGTGGAA 
GCACCCCCAT 
CGCGCTCGGG 
GCGCTCGGCC 
TGCCGCCGCG 
CTGCGACCGC 
GAAGCGGTCT 
ACCGCCTGCG 
CGGTGGCTCT 
GGGACGAAGG 
CAGGTGAAGG. 
AAGACACCAG. 
TGGCCAGGCC 
GCCCCCGCCC 
GAAAGTGCCT 



41 
i 

AGGTCCTTCC 
TCTCCGCGCA 
CAGAGGCCTC 
TCCTGGCGTC 
GAGCCCGGOG 
CTGCTCTTCC 
CAGCGGGGGC 
TGGGCCACCG 
CGCGCTCTCC 
CCCCGGGCTC 
CCTTCATGGA 
GCTGCCTGGG 
TCCTGCCTGG 
CCTCAAAGCT 
GACAACTGAC 
AGACCTCAGC 
TCGAACCTGG. 
AGGCCCTGTA 
GTGCTGGAAC 



51 • 
I 

TCCCCAAGCC 

GCCTGCCCTG 

CCTGGGCTCC 

CCCCGCCGGC • 

GCOGCCGCCG 

CCGCGGGGGC 

GCGGGGCTGC 

CTCCGACGAG 

ACACGACCTC. 

CCGGCCCGTC 

CGTCAACAGC 

CTGAGGGCTC 

GACCCTCCCG. 

GAGAGGCCCC 

TAGCAGCCCC 

TATGGAGCCC 

GACCCCTCCT. 

GGGACAGCAT. 

TGGCCTGTAC 



Seq ID NOs 607 Protein sequence 
Protein Accession NP_476501.1 
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MPGLISARGQ PLLEVLPPQA HLGALFLPEA PLGLSAQPAL WPTLAALALL SSVAEASLGS 
APRSPAPREG PPPVLASPAG HLPGGRTARW CSGRARRPPP QPSRPAPPPP- APPSALPRGG 
.RAARAGGPGS RARAAGARGC RLRSQLVPVR' ALGLGBRSDE LVRFRFCSGS CRRARSPHDL 
SLASLLGAGA LRPPPGSRPV- SQPCCRPTRY EAVSFMDVNS TWRTVDRLSA TACGCLG 

Seq ID HO: 60S DNA sequence 

Nucleic Acid Accession #: NM_0570 90.1 

Coding sequence: 2 9.. 715 



1 
I 

CTGATGGGCG 
GTCCCACTGC 
GTGGCCCACC 
CGCGCCCCGC 
CCACCTGCCG 
GCAGCCTTCT 
CCGCGCGGCG 
CCGCCTGCGC 
GCTGGTGCGT- 
CAGCCTGGCC 
CAGCCAGCCC 
CACCTGGAGA 
CGCTCCAGGG 
GCAGAGTCCC 
CTACCGGTGG 
CAGAGCCCTC. 
CTTCGGACCC 
TCTGATGAAC 
TTTGAAGGAC 
. CTCACTCATG 



11 
I 

CTCCTGGTGT 
CCCTGGCCTA 
CTGGCCGCTC 
AGCCCTGCCC 
GGGGGACGCA 
CGGCCCGCGC 
CGGGCTGGGG 
TCGCAGCTGG 
TTCCGCTTCT 
AGCCTACTGG 
TGCTGCCGAC 
ACCGTGGACC. 
CTTTGCAGAC 
ACTAGCCAGC 
GTGATGGATA 
ACCCTGCGGA 
ACTTCTCACA 
ACTACAGTGG 
ACATATTGCA 
GGAGCTGGCC 



21 
I 

TGATAGAGAT 
CGCGGCAGGC 
TGGCTCTGCT 
CCCGCGAAGG 
CGGCCCGCTG 
CCCCGCCGCC 
GCCCGGGCAG 
TGCCGGTGCG: 
GCAGCGGCTC 
GCGCCGGGGC 
CCACGCGCTA- 
GCCTCTCCGC 
TGGACCCTTA 
GGCCTCAGCC- 
TCATCCCCGA 
TCCCAGCCTA' 
GACTCTGGCA 
CTGAGGCATC 
GTTGCTTGGT 
CC 



31 
I 

GGAACTTGGA 
TCCACTTGGT 
GAGCAGCGTC 
CCCCCCGCCT 
GTGCAGTGGA 
TGCACCCCCA 
CCGCGCTCGG 
CGCGCTCGGC 
CTGCCGCCGC 
CCTGCGACCG 
CGAAGCGGTC 
CACCGCCTGC 
CCGGTGGCTC 
AGGGACGAAG 
ACAGGTGAAG 
AAAGACACCA 
CTGGCCAGGC 
AGCCCCCGCC 
TGAAAGTGCC 



41 

I 

CTTGGAGGCC 
CTCTCCGCGC 
GCAGAGGCCT 
GTCCTGGCGT' 
AGAGCCCGGC 
TCTGCTCTTC 
GCAGQGGGGG 
CTGGGCCACC 
GCGCGCTCTC 
CCCCCGGGCT 
TCCTTCATGG 
GGCTGCCTGG 
TTCCTGCCTG 
GCCTCAAAGC 
GGACAACTGA 
GAGACCTCAG 
CTCGAACCTG 
CAGGCCCTGT 
TGTGCTGGAA 



51 
I 

TCTCCACGCT 
AGCCTGCCCT 
CCCTGGGCTC 
CCCCCGCCGG 
GGCCGCCGCC 
CCCGCGGGGG 
CGCGGGGCTG 
GCTCCGACGA 
CACACGACCT 
CCCGGCCCGT 
ACGTCAACAG 
GCTGAGGGCT 
GGACCCTCCC 
TGAGAGGCCC. 
CTAGCAGCCC. 
CTATGGAGCC 
GGACCCCTCC 
AGGGACAGCA 
CTGGCCTGTA 



Seq ID NO: 609 Protein sequence 
Protein Accession ft: NP_476431.1 



11 



21 



41 



51 



31 

! ! I I I 

MELGLGGIiST • LSHCPWPRRQ APLGLSAQPA LWPTLAALAL. LSSVAEASLG SAPRSPAPRE 
GPPPVLASPA GHLPGGRTAR WCSGRARRPP PQPSRPAPPP PAPPSALPRG GRAARAGGPG 
SRARAAGARG CRLRSQLVPV RALGLGHRSD ELVRFRPCSG SCRRARSPHD LSLASLLGAG 
ALRPPPGSRP VSQPCCRPTR YEAVSPMDVN STWRTVDRLS ATACGCLG 

Seq ID NO: 610 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..1746 



ATGCCACTGA 
GCCTACCATG 
GGGGCACGCA 
CTCAACACGC 
GCCCTGAGGA 
GGCTCGCTGC 
rrCCAGGGCC 
CAGCCGGCCC 
CTGGAATACA 



11 

I 

AGCATTATCT 
GCTGCCCTAG 
TTGTGGCGGT 
ACATCACTGA 
TTGAGAAGAA 
GCTATCTCAG 
TGGACAGCCT 
ACTTCTCCCA 
TCCCTGACGG 



21 
I 

CCTTTTGCTG 
CGAGTGTACC 
GCCCACCCCT 
ACTCAATGAG 
TGAGCTGTCG 
CCTCGCCAAC 
TGAGTCTCTC 
GTGCAGCAAC 
AGCCTTCGAC 



31 
I 

GTGGGCTGCC 
TGCTCCAGGG 
CTGCCCTGGA 
TCCCCGTTCC 
CGCATCACGC 
AACAAGCTGC 
CTTCTGTCCA 
CTCAAGGAGC 
CACCTGGTAG 



41 
I 

AAGCCTGGGG 
CCTCCCAGGT 
ACGCCATGAG 
TCAATATCTC 
CTGGGGCCTT 
AGGTTCTGCC 
GTAACCAGCT 
TGCAGTTGCA 
GACTCACGAA 



51 
I 

TGCAGGGTTG 
GGAGTGCACC 
CCTGCAGATC 
AGCCCTCATC 
CCGAAACCTG 
CATCGGCCTC 
GTTGCAGATC 
CGGCAACCAC 
GCTCAATCTG 



60 
120 
180 
240 
300 
360 
420 . 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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GGCAAGAATA GCCTCACCCA CATCTCACCC AGGGTCTTCC AGCACCTGGG CAATCTCCAG 
GTCCTCCGGC TGTATGAGAA CAGGCTCACG GATATCCCCA TGGGCACTTT TGATGGGCTT 
GTTAACCTGC AGGAACTGGC TCTACAGCAG AACCAGATTG GACTGCTCTC CCCTGGTCTC 
TTCCACAACA ACCACAACCr CCAGAGACTC .TACCTGTCCA ACAACCACAT. CTCCCAGCTG. 
5 CCACCCAGCA- TCTTCATGCA GCTGCCCCAG CTCAACCGTC TTACTCTCTT TGGGAATTCC 
CTGAAGGAGC TCTCTCTGGG GATCTTOGGG CCCATGCCCA ACCTGCGGGA GCTTTGGCTC 
* " TATGACAACC ACATCTCTTC TCTACCCGAC AATGTCTTCA-. GCAACCTCCG CCAGTTGCAG 
GTCCTGATTC TTAGCCGCAA .TCAGATCAGC TTCATCTCCC CGGGTGCCTT CAACGGGCTA 
ACGGAGCTTC GGGAGCTGTC CCTCCACACC AACGCACTGC ": AGGACCTGGA CGGGAATGTC 

10 TTCCGCATGT TGGCCAACCT- GCAGAACATC TCCCTGCAGA ACAATCGCCT CAGACAGCTC 
CCAGGGAATA TCTTCGCCAA CGTCAATGGC CTCATGGCCA" TCCAGCTGCA GAACAACCAG 
CTGGAGAACT- TGCCCCTCGG CATCTTCGAT CACCTGGGGA- AACTGTGTGA GCTGCGGCTG, 
- TATGACAATC CCTGGAGGTG TGACTCAGAC ATCCTTCCGC. TCCGCAACTG GCTCCTGCTC 
AACCAGCCTA GGTTAGGGAC GGACACTGTA CCTGTGTGTT TCAGCCCAGC CAATGTCCGA 

15 GGCCAGTCCC TCATTATCAT CAATGTCAAC GTTGCTGTTC- CAAGCGTCCA TGTCCCTGAG 
- GTGCCTAGTT. ACCCAG AAAC ACCATGGTAC CCAGACACAC CCAGTTACCC TGACACCACA 
TCCGTCTCTT CTACCACTGA GCTAACCAGC CCTGTGGAAG' ACTACACTGA- TCTGACTACC 
ATTCAGGTCA CTGATGACCG CAGCGTTTGG GGCATGACCC AGGCCCAGAG- CGGGCTGGCC 
ATTGCCGCCA TTGTAATTGG ■ CATTGTCGCC CTGGCCTGCT CCCTGGCTGC CTGCGTCGGC 

20 TGTTGCTGCT GCAAGAAGAG GAGCCAAGCT gtcctgatgc agatgaaggc. acccaatgag - 

TGTTAAAGAG GCAGGCTGGA- GCAGGGCTGG GGAATGATGC GACTGGAGGA' CCTGGGAATT 
TCATCTTTCT GCCTCCACCC CTGGGTCCAT . GGAGCTTTCC CGTGATTGCT CTTTCTGGCC 
CTAGATAAAG. GTGTGCCTAC CTCTTCCTGA- CTTGCCTGAT TCTCCCGTAG AGAAGCAGGT 
CGTGCCGGAC CTTCCTACAA TCAGGAAGAT AGATCCAACT GGCCATGGCA AAAGCCCTGG 
25 GGATTTCCGA TTCATACCCC TGGGCTTCCT TCGAGAGGGC TCTTCCTCCA AATCCTCCCC 
ACCTGTCCTC CAAGAACAGC CTTCCCTGCG CCCAGGCCCC CTCCGGGCCT CTGTAGACTC. 
AGTTAGTCCA- CAGCCTGCTC ACTTCGTGGG AATAGTTCTC CGCTGAGATA GCCCCTCTCG 
• CCTAAGTATT ATGTAAGTTG ATTTCCCTTC TTTTGTTTCT CTTGTTTGTG CTATGGCTTG 
ACCCAGCATG TCCCCTCAAA TGAAAGTTCT CCCCTTGATT' TTCTGCTCCT GAAGGCAGGG 
30 TGAGTTCTCT, CCTCAAAGAA GACTTCAAAC CATTTAACTG GTTTCTTAAG AGCCGTCAAT . 

CAGCCTGGTT- TTGGGGATGC TATGAAAGAG . AGAAGGAAAA TCATGCCGCT- CAGTTCCTGG 
AGACAGAAGA GCCGTCATCA- GTGTCTCACT TGTGATTTTT ATCTGGAAAA GGAAGAAACA 
CCCCAGCACA- GCAAGCTCAG- CCTTTTAGAG AAGGATATTT CCAAACTGCA- AACTTTGCTT 
TGAAAAGTTT AGCCCTTTAA- GGAATGAAAT- CATGTAGAAT TTTGGACTTC TAAAAACATT 
35 AAAAT CAGCT T ATT AATACG GGATAGAGAA AGAAATCTGG TGCGTGGGGG' TCCCTGTGTT 
CACCCCTAGA GTTTGTTTTA AAATTTTTAA" TTGAAGCATG TGAAGTGTAC STGCAGAAAA 
GTGGGAACAT- GATAGTGTAT GGCTTGGTGG ATTTTCACAA- ACTGAACATA- CCTGTGTAAT. 
CAGCATCTAG ACCCAGACCC. AGAGCATCAC AAATATCCCC. CATCCTGGGC TTTTCCCAGA 
GGAGATGGGG" GCTTCTGAAG" ATGGACTTAC, CTGGGACCTG CCCCCCATGA GCCAGGACGG 
40 ' TCCCCCCACA GTCAGCCTGT GCAAAGGCCC CGTGGCCAGG GGTGGAGGAG AATATGTGGG - 
TGTGGACAGG ATGGGAGACT GTGGCCTGAA CAGGAGATTT' TATTATATCT GGAGACCCTG 
AGAGACCCTG AGACCTGGGG CACCATGGCT GGCCAGGTCA GAAGCATCCT GACTGCAGAG 
GTCCGTGCAG CCACACCCTC TTCCCTGCCA GCAAGTTGTC TGCGGCTCAT CGGAGGCCCC- 
TCCGCCTGGA GCCTTCTATG GACGTGATAT GCCTGTATCT GTTTTTAATT TTCATTCTTC 
45 ACTTAGGGGA AGTGAAATCG CTCAGAGATG AGATCCTTTA ATTGAAAACG AAGTGTAACG 
GAATCTAGTG ■ TCTTTCTAAT GTGGTAAAAT TCTCCATCAA CATCACAGTC AGCTGGCAGC 
- . TGAACTTGAG AATCTCACTT ACAGCAGGCG ACACGGGGGT ACACCGATGG GTCACACTGG - 

GTCTGGGGGC TCCCTGGAGC TCCTCCTGCG TGTGGTCTGG TTAGGAGTTG AGTTGTTTGC 
TCCAGGGTTA TTCTCCTCCT CGAGTCACAG TCACACGAAT ACCTGCCTTC TCTGGCTTTC.- 
50 CTGCTATACA CATATTCACA TGGCGCTCAA GAAGTTAGGC TCATGGCAAC GTGTGTCTTT 
CTCTGGACAA CTGGCCCAGT- TTACAGTGAA ATGGAGAATT TCAGGTCTCC ACGTCTGCCC 
AGGAAAGAAC TTCAGCTGAC TCCACGGGGA TCTGGAAATC CACGACCAAT CCCGATCGGC- 
TCTTATTAGC TCCCCGCTCC ACAAGACACC TGTGCTTTGG AAATCCACCA CCAATCCCGA 
TCGGCTCTTA- TTAGCTCCCC GCTCCACAAG ACACCTGTGA TCTGGAAATC TACCACCAAT 
55 CCCGATCGGC TCTTATTAGC TCCCCGCTCC ACAAGACACC TGTGACATCC TCCAGGGCCA 
CAGGAGCACG TGCTGACCAG TTTTCCCTTC CAGTTCCTGC ACAAAAAGTG TCCAGAGGGC 
TGTTTGCAAA CACTAGTGCA' CTTTGTAGCT TTTCACCCTC TGTCCCAGGG AATCTAGGAG 
AGATGAGGCC CGTCAGAGTC AAGAGATGTC ATCCCCCCAG GGTCTCCAAG GCATTTCCAC 
■: ACTATTGGTG GCACCTGGAG GACATGCACC' AAGGCTTGCC AGAGCCAACA GGAAGTGAGC 
60 CCAGAGCATG GCACATGAGC ATCACCCGCT GATGGTGGCC TGCTGTGCCT GGTGCCAACA 
GGGGCATCCC GGCCCGTACC CCTCCAGACA GGAAGCATGG GTTTGCCCAC AGACCTGTCG 
GGTGCTCCTG TGAGTGGCCT CCAGATGTCT TTGTGCATAG GCACAAGTGG GCCAGGGCTG 
GAGGGAGGTG GGAAACCTCA TCATCCGGTG GGCCCTGCCA ATCTTAACCC AGAACCCTTA 
GGTATTCCTG GCAGTAGCCA TGACATTGGA GCACCTTCCT CTCCAGCCAG AGGCTGACCT 
65 GAGGGCCACT GTCCTCAGAT" GACACCACCC AGGAGCACCC TAGGTGAGGG GTGAGGGCCC 
CCTTATGTGA ACCTCTTGCC TCTTCCTTTC TCCCATCAGA GTGGTTGGAT GGAGCCATTG 
GCCTCCTTTT CTTCAGCGGG CCCTTCAACC TCTCTGCACC ATGTTGTCTG GCTGAGGAGC 
' TACTAGAAAA GCTGAGTGGA GTCTCCTTTC CAACAGGATG ATGCATTTGC TCAATTCTCA 
GGGCTGGAAT GAGCCGGCTG GTCCCCCAGA AAGCTGGAGT GGGGTACAGA GTTCAGTTTT 
70 • CCTCTCTGTT TACAGCTCCT. TGACAGTCCC ACGCCCATCT GGAGTGGGAG CTGGGAGTTA 
GTGTTGGAGA- AGAAACAACA AAAGCCAATT AGAACCACTA TTTTTAAAAA GTGCTTACTG 
TGCACAGATA CTCTTCAAGC ACTGGACGTG GATTCTCTCT CTAGCCCTCA GCACCCCTGC 
GGTAGGAGTG CCGCCTCTAC CCACTTGTGA TGGGGTACAG AGGCACTTGC TCTTCTGCAT 
GGTGTTCAAT AGGCTGGGAG TTTTATTTAT CTCTTCAAAC TTTGTACAAG AGCTCATGGC 
75 TTGTCTTGGG CTTTCGTCAT TAAACCAAAG GAAATGGAAG CCATTCCCCT GTTGCTCTCC 
TTAGTCTTGG TCATCAGAAC CTCACTTGGT ACCATATAGA TCAAAAGCTT TGTAACCACA 
GGAAAAAATA AACTCTTCCA TCCCTTAAAG AATAGAATAG TTTGTCCCTC TCATGGGAAT 
TGGGCTGTAT GTATATTGTT .CTTCCTCCTT AGAATTTAGA> GATACAAGAG TTCTACTTAG 
AACTTTTCAT GGACACAATT TCCACAACCT TTCAGATGCT GATGTAGAGC TATTGGGAAA 
80 GAACTTCCAA ACTCAGGAAG TTTGCAGAGA GCAGACAGCT AGAGATAACT CGGGACCCAG 
AGTTGGTCGA CAGATGTTAG ATGTATCCTA GCTTTTAGCC ATAAACCACT CAAAGATTCA 
GCCCCCAGAT CCCACAGTCA GAACTGAATC TGCGTTGTTG GGAAGCCAGC AGTGGCCTTG 
GGAAGGAAGC CATGGCTGTG GTTCAGAGAG GGTGGGCTGG C^CCACTT CCGGGGAAAA 
CTCCTTCCGC CCCAGGTTTC TTCTTCTCTT AAGGAGAGAT TGTTCTCACC AACCCGCTGC 
85- CTTCATGCTG CCTTCAAAGC TAGATCATGT TTGCCTTGCT tagagaatta ctgcaaatca 

gccccagtgc ttggcgatgc atttacagat ttctaggccc tcagggtttt gtagagtgtg 
agccctggtg ggcagggttg gggggtctgt cttctgctgg atgctgcttg taatccattt 
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GGTGTACAGA ATCAACAATA AATAATATAC ATGTAT 

' Seq ID NOr 611. Protein sequence . 
Protein Accession. ft: BAB84587.1 

1 ■ 11 21 31 41 51 

T i i.i i i 

MPLKHYLLLL VGOQAWGAGL AYHGCPSECT CSRASQVECT GARIVAVPTP. LPWNAMSLQI * 60 

LNTHITELNE SPFLNISALI ALRIEKNELS RITPGAFRNL GSLRYLSLAN. NKLQVLPIGL 120 

FQGLDSLESL LLSSNQLLQ1 QPAHFSQCSN LKELQLHGNH LEYIPDGAFD HLVGLTKLNL- 180 

GKNSLTHISP RVFQHLGNLQ VLRLYENRLT- DIPMGTFDGL VNLQELALQQ NQIGLLSPGL 240 

FHNNHNLQRL YLSNNHISQL PPSIFMQLPQ LNRLTLFGNS LKELSLGIFG" PMPNLRELWL 300 

YDNHISSLPD. NVFSNLRQLQ VLILSRNQIS FISPGAFNGL TELRELSLHT NALQDLDGNV 360 

• FRMLANLQNU SLQNNRLRQL PGNIFANVNG LMAIQLQKNQ. LENLPLGIFD HLGKLCELRL 420 

YDNPWRCDSD ILPLRNWLLL NQPRLGTDTV- PVCFSPANVR GQSLIIINVN VAVPSVHVPE 480 

VPSYPETPWY PDTPSYPDTT SVSSTTELTS PVEDYTDLTT IQVTDDRSVW GMTQAQSGLA 540 
IAAIVIGIVA" LACSLAACVG CCCCKKRSQA- VLKQMKAPNE C 

Seq ID NO: '612 DNA sequence 
Nucleic Acid Accession ft: XM_X>98151 
Coding Bequence: 1 . -447 ■" 

1 11 . 21 ' 31 41 51 

i r i . i i i 

ATGATGCATT TGCTCAATTC TCAGGGCTGG AATGAGCCGG CTGGTCCCCC AGAAAGCTGG 60 

AGTGGGGTAC AGAGTTCAGT TTTCCTCTCT GTTTACAGCT CCTTGACAGT CCCACGCCCA 120 

TCTGGAGTGG GAGCTGGGAG TCAGTGTTGG AGAAGAAACA ACAAAAGCCA ATTAGAACCA 180 

; CT ATTTTTAA AAAGTGCTTA CTGTGCACAG ATACTCTTCA AGCACTGGAC GTGGATTCTC 240 

TCTCTAGCCC TCAGCACCCC TGCGGTAGGA; GTGCCGCCTC TACCCACTTG TGATGGGGTA" 300 

CAGAGGCACT TGCTCTTCTG CATGGTGTTC. AATAGGCTGG GAGTTTTATT TATCTCTTCA 360 

AACTTTGTAC AAGAGCTCAT GGCTTGTCTT GGGCTTTCGT CATTAAACCA AAGGAAATGG 420 
AAGCCATTCC CCTGTTGCTC TCCTTAG 

Seq ID NO: 613 .Protein sequence 
Protein Accession ft: XP_098151 

1 11 21' 31 41 51 

I | I I I I 

MMHLLNSOGW NEPAGPPESW SGVQSSVFLS . VYSSLTVPRP SGVGAGSQCW RRNNKSQLEP 60 

LFLKSAYCAQ ILFKHWTWIL SLALSTPAVG VPPLPTCDGV QRHLLFCMVF NRLGVLFISS 120 
NFVQELMACL GLSSLNQRKW KPFPCCSP 

Seq ID NO: 614 DNA sequence 

Nucleic Acid Accession ft: NMJ)026S8.1 

Coding sequence: 77.. 1372 . 

1 11 21 31 41 51 

GTCCCCGCAG CGCCGTCGCG CCCTCCTGCC GCAGGCCACC GAGGCCGCCG CCGTCTAGCG 60 

CCCCGACCTC GCCACCATGA GAGCCCTGCT GGCGCGCCTG CTTCTCTGCG TCCTGGTCGT 120 

GAGCGACTCC AAAGGCAGCA ATGAACTTCA TCAAGTTCCA" TCGAACTGTG ACTGTCTAAA 180 

TGGAGGAACA TGTGTGTCCA ACAAGTACTT CTCCAACATT CACTGGTGCA ACTGCCCAAA 240 

GAAATTCGGA GGGCAGCACT GTGAAATAGA TAAGTCAAAA ACCTGCTATG AGGGGAATGG. 300 

TCACTTTTAC CGAGGAAAGG CCAGCACTGA CACCATGGGC CGGCCCTGCC TGCCCTGGAA - 360 

CTCTGCCACT GTCCTTCAGC AAACGTACCA TGCCCACAGA TCTGATGCTC TTCAGCTGGG 420 

CCTGGGGAAA CATAATTACT GCAGGAACCC AGACAACCGG AGGCGACCCT GGTGCTATGT . 480 

GCAGGTGGGC CTAAAGCCGC TTGTCCAAGA ' GTGCATGGTG CATGACTGCG CAGATGGAAA 540 

AAAGCCCTCC TCTCCTCCAG AAGAATTAAA . ATTTCAGTGT GGCCAAAAGA CTCTGAGGCC 600 

CCGCTTTAAG ATTATTGGGG GAGAATTCAC CACCATCGAG AACCAGCCCT GGTTTGCGGC 660 

CATCTACAGG AGGCACCGGG GGGGCTCTGT CACCTACGTG TGTGGAGGCA GCCTCATCAG 720 

CCCTTGCTGG GTGATCAGCG CCACACACTG CTTCATTGAT TACCCAAAGA AGGAGGACTA 780 

CATCGTCTAC CTGGGTCGCT . CAAGGCTTAA CTCCAACACG CAAGGGGAGA TGAAGTTTGA 840 

GGTGGAAAAC CTCATCCTAC . ACAAGGACTA CAGCGCTGAC ACGCTTGCTC ACCACAACGA 900 

CATTGCCTTG CTGAAGATCC GTTCCAAGGA GGGCAGGTGT GCGCAGCCAT CCCGGACTAT 960 

ACAGACCATC TGCCTGCCCT CGATGTATAA CGATCCCCAG TTTGGCACAA GCTGTGAGAT 1020 

CACTGGCTTT GGAAAAGAGA ATTCTACCGA CTATCTCTAT CCGGAGCAGC TGAAAATGAC 1080. 

TGTTGTGAAG CTGATTTCCC ACCGGGAGTG TCAGCAGCCC CACTACTACG GCTCTGAAGT 1140 

CACCACCAAA- ATGCTATGTG CTGCTGACCC .CCAATGGAAA ACAGATTCCT GCCAGGGAGA 1200 

CTCAGGGGGA CCCCTCGTCT GTTCCCTCCA AGGCCGCATG ACTTTGACTG GAATTGTGAG 1260 

CTGGGGCCGT GGATGTGCCC TGAAGGACAA GCCAGGCGTC TACACGAGAG TCTCACACTT 1320 

CTTACCCTGG ATCCGCAGTC ACACCAAGGA AGAGAATGGC CTGGCCCTCT GAGGGTCCCC 1380 

AGGGAGGAAA CGGGCACCAC CCGCTTTCTT GCTGGTTGTC ATTTTTGCAG TAGAGTCATC 1440 

TCCATCAGCT GTAAGAAGAG ACTGGGAAGA TAGGCTCTGC ACAGATGGAT TTGCCTGTGG 1500 

CACCACCAGG GTGAACGACA ATAGCTTTAC CCTCACGGAT AGGCCTGGGT GCTGGCTGCC 1560 

CAGACCCTCT GGCCAGGATG GAGGGGTGGT CCTGACTCAA CATGTTACTG ACCAGCAACT 1620 

TGTCTTTTTC TGGACTGAAG CCTG CAGGAG TTAAAAAGGG CAGGGCATCT CCTGTGCATG 1680 

GGCTCGAAGG GAGAGCCAGC TCCCCCGACC GGTGGGCATT TGTGAGGCCC ATGGTTGAGA 1740 

AATGAATAAT TTCCCAATTA GGAAGTGTAA GCAGCTGAGG TCTCTTGAGG GAGCTTAGCC 1800 

AATGTGGGAG: CAGCGGTTTG GGGAGCAGAG ACACTAACGA CTTCAGGGCA GGGCTCTGAT 1860 

ATTCCATGAA TGTATCAGGA AATATATATG TGTGTGTATG TTTGCACACT TGTTGTGTGG 1920 

GCTGTGAGTG TAAGTGTGAG TAAGAGCTGG TGTCTGATTG TTAAGTCTAA ATATTTCCTT 1980 

AAACTGTGTG GACTGTGATG CCACACAGAG . TGGTCTTTCT GGAGAGGTTA TAGGTCACTC 2040 . 

CTGGGGCCTC TTGGGTCCCC CACGTGACAG TGCCTGGGAA TGTACTTATT CTGCAGCATG 2100 

ACCTGTGACC AGCACTGTCT CAGTTTCACT TTCACATAGA TGTCCCTTTC TTGGCCAGTT 2160 

ATCCCTTCCT TTTAGCCTAG TTCATCCAAT . CCTCACTGGG TGGGGTGAGG ACCACTCCTT 2220 

ACACTGAATA TTTATATTTC ACTATTTTTA TTTATATTTT TGTAATTTTA AATAAAAGTG 2280 
ATCAATAAAA . TGTGATTTTT CTGA . 
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Seq ID NO: 615 Protein sequence 
Protein Accession #: NP_002649.1 

1 11 21 31 |1 51 

iffiALLARLLL .CVLWSDSKG SNELHQVPSN CDCLNGGTCV SNKYFSNIHW " CNCPKKFGGQ 60 

HCEIDKSKTC YEGNGHFYRG KASTDTMGRP. CLPWNSATVL QQTYHAHRSD ALQLGLGKHN .120 

YCRNPDNRRR PWCYVQVGLK PLVQECMVHD CADGKKPSSP PEELKFQCGQ KTLRPRPKI I 160 

GGEFTTIENQ PWFAAIYRRH RGGSVTYVCG' GSLISPCWVl SATHCFIDYP KKEDYIVYLG 240 

RSRLNSNTQG EMKFEVENLI LHKDYSADTL AHHNDIALLK IRSKEGRCAQ PSRTIQTICL 300 

PSMYNDPQFG TSCE1TGFGK ENSTDYLYPE. QLKMTWKLI SHRECQQPHY YGSEVTTKMl* 360 

CAADPQWKTD SCQGDSGGPL VCSLQGRMTL TGIVSVJGRGC ALKDKPGVYT RVSHFLPWIR 420 
. L . 



Seq ID NO: 616 DNA sequence 

Nucleic Acid Accession #: NM_024422.1 

Coding sequence: 202.. 2907 

1 11 21 31 41 SI' 

CGCCAAAGGA AAAGCCCCTT GGATGAGAGG. CAGGCGCTTC. AGAGAAGCTA AGAAAAGCAC 60 

CTCTCCGCGC GCCCCACCTC CTCCGCCTCG CGCTCCTCCT GAGCAGCGGG CCCAGACTGC- 120 

GCTCCGGCCG-CGGCCCTCGC. CCCGCGGAGC CCTCCTACCC CGGCCCGACG CTCGGCCCGC 180 

GACCTGCCCC GAGCCCTCTC CATGGAGGCA GCCCGCCCCT CCGGCTCCTG GAACGGAGCC 240 

CTCTGCCGGC TGCTCCTGCT GACCCTCGCG ; ATCTTAATAT TTGCCAGTGA TGCCTGCAAA > 300. 

AATGTGACAT TACATGTTCC . CTCCAAACTA - GATGCCGAGA AACTTGTTGG TAGAGTTAAC " 360 

CTGAAAGAGT GCTTTACAGC TGCAAATCTA ATTCATTCAA" GTGATCCTGA CTTCCAAATT.. 420 

TTGGAGGATG GTTCAGTCTA TACAACAAAT. ACTATTCTAT TGTCCTCGGA GAAGAGAAGT 480 

TTTACCATAT TACTTTCCAA CACTGAGAAC CAAGAAAAGA' AGAAAATATT, TGTCTTTTTG 540 

GAGCATCAAA CAAAGGTCCT AAAGAAAAGA' CATACTAAAG AAAAAGTTCT- AAGGCGCGCC 600 

AAGAGAAGAT GGGCTCCAAT TCCTTGTTCG " ATGCTAGAAA ACTCCTTGGG TCCTTTTCCA • 660 

CTTTTCCTTC AACAGGTTCA ATCTGACACG GCCCAAAACT ATACCATATA CTATTCCATA 720 

AGAGGTCCTG GAGTTGACCA AGAACCTCGG AATTTATTTT ATGTGGAGAG AGACACTGGA • 780 

AACTTGTATT- GTACTCGTCC TGTAGATCGT GAGCAGTATG . AATCTTTTGA GATAATTGCC; 840 

- TGCCCCTAAT AATCAAAATA-. 900 

CTTATACTTT' TACAATTTTT 960 



TTTGCAACAA CTCCAGATGG GT AT ACT CCA. GAACTTCCAC 
GAGGATGAAA ATGATAACTA CCCAATTTTT ACAGAAGAAA 

GAAAATTGCA GAGTGGGCAC TACTGTGGGA CAAGTGTGTG CTACTGACAA AGATGAGCCT 1020 

" - - — ' AGGTGCCACC ATCACCCACC 1080 

CATCATCTCA' GCTAGACAGA 1140 

ACATGGATGG TCAGTATTTT 1200 

ATGTAAATGA V CCACTTGCCA 1260 

ATACAGTTGA TGTGGAAATC, 1320 

CTAACTGGAG • AGCTAATTAT;- 1380 



GACACGATGC ACACACGCCT GAAGTACTCC \ ATCATTGGGC 
CTATTTTCTA TGCATCCAAC TACAGGCGTG' ATCACCACAA 
GAGTTAATTG ACAAGTACCA GTTGAAAATA AAAGTACAAG 
GGTCTACAGA CAACTTCAAC TTGTATCATT AACATTGATG 
ACATTTACTC GTACTTCTTA TGTG ACATCA GTGGAAGAAA 
TTACGAGTTA CTGTTGAGGA TAAGGACTTA GTGAATACTG 

ACCATTTTAA AGGGCAATGA AAATGGCAAT. TTTAAAATTG TAACAGATGC- CAAAACCAAT- 1440 

GAAGGAGTTC TTTGTGTAGT TAAGCCTTTG AATTATGAAG AAAAGCAACA GATGATCTTG 1500 

CAAATTGGTG TAGTTAATGA AGCTCCATTT TCCAGAGAGG CTAGTCCAAG ATCAGCCATG- 1560 

AGCACAGCAA CAGTTACTGT TAATGTAGAA GATCAGGATG AGGGCCCTGA GTGTAACCCT >■ 1620 

CCAATACAGA CTGTTCGCAT GAAAGAAAAT GCAGAAGTGG GAACAACAAG CAATGGATAT 1680 

AAAGCATATG ACCCAGAAAC AAGAAGTAGC . AGTGGCATAA GGTATAAGAA- ATTAACTGAT 1740 

CCAACAGGGT GGGTCACCAT TGATGAAAAT ACAGGATCAA TCAAAGTTTT CAGAAGCCTG 1800 

GATAGAGAGG CAGAGACCAT . CAAAAATGGC ATATATAATA TTACAGTCCT" TGCATCAGAC 1860 

CAAGGAGGGA GAACATGTAC GGGGACACTG GGCATTATAC TTCAAGACGT GAATGATAAC 1920 

AGCCCATTCA TACCTAAAAA- GACAGTGATC ATCTGCAAAC CCACCATGTC" ATCTGCGGAG 1980 

ATTGTTGCGG TTGATCCTGA TGAGCCTATC CATGGCCCAC CCTTTGACTT TAGTCTGGAG; 2040 

AGTTCTACTT- CAGAAGTACA GAGAATGTGG AGACTGAAAG CAATTAATGA TACAGCAGCA- 2100 

CGTCTTTCCT ATCAGAATGA TCCTCCATTT GGCTCATATG TAGTACCTAT AACAGTGAGA 2160 

GATAGACTTG GCATGTCTAG TGTCACTTCA TTGGATGTTA CACTGTGTGA CTGCATTACC' 2220 

GAAAATGACT GCACACATCG TGTAGATCCA AGGATTGGCG GTGGAGGAGT. ACAACTTGGA 2280 

AAGTGGGCCA TCCTTGCAAT ATTGTTGGGC ATAGCATTGC TCTTTTGCAT • CCTGTTTACG 2340 

CTGGTCTGTG GGGCTTCTGG GACGTCTAAA CAACCAAAAG TAATTCCTGA TGATTTAGCC 2400 

CAGCAGAACC TAATTGTATC AAACACAGAA GCTCCTGGAG ATGACAAAGT GTATTCTGCG 2460 

AATGGCTTCA CAACCCAAAC TGTGGGCGCT TCTGCTCAGG GAGTTTGTGG CACCGTGGGA 2520 

TCAGGAATCA.AAAACGGAGG TCAGGAGACC ATCGAAATGG TGAAAGGAGG ACACCAGACC 2580 

TCGGAATCCT GCCGGGGGGC TGGCCACCAT CACACCCTGG ACTCCTGCAG GGGAGGACAC 2640 

ACGGAGGTGG ACAACTGCAG ATACACTTAC TCGGAGTGGC ACAGTTTTAC TCAGCCCCGT 2700 

CTTGGTGAAA' AAGTGTATCT GTGTAATCAA GATGAAAATC ACAAGCATGC CCAAGACTAT 2760 

GTCCTGACAT ATAACTATGA AGGAAGAGGA TCGGTGGCTG GGTCTGTAGG TTGTTGCAGT 2820 

GAACGACAAG AAGAAGATGG GCTTGAATTT TTGGATAATT TGGAGCCCAA ATTTAGGACA 2880 

CTAGCAGAAG CATGCATGAA GAGATGAGTG TGTTCTAATA AGTCTCTGAA AGCCAGTGGC 2 940 

' TTTATGACTT TTAAAAAAAA TTACAAACCA AGAATTTTTT AAAGCAGAAG ATGCTATTTG 3000 

. TGGGGGTTTT TCTCTCATTA TTTGGATGGA ATCTCTTTGG TCAAATGCAC ATTTACAGAG 3060 

AGACACTATA AACAAGTACA CAAATTTTTC AATTTTTACA TATTTTTAAA TTACTTATCT 3120 

TCTATCCAAG GAGGTCTACA GAGAAATTAA AGTCTGCCTT ATTTGTTACA TTTGGGTATA 3180 

ATGACAACAG CCAATTTATA GTGCAATAAA ATGTAATTAA TTCAAGTCCT TATTATAGAC 3240 

TATTTGAAGC ACAACCTAAT GGAAAATTGT AGAGACCTTG CTTTAACATT ATCTCCAGTT 3300 

AATTAAGTGT TCATGTGGTG CTTGGAAACT GTTGTTTTCC TGAACATCTA AAGTGTGTAG 3360 

ACTGCATTCT TGCTATTATT TTATTCTTGT AATGTGACCT. TTTCACTGTG CAAAGGGAGA 3420 
TTTCTAGCCA GGCATTGACT, ATTACAATTT CATT 

Seq ID NO: 617 Protein sequence 
Protein Accession. ft: NP_077740.l 

t ll r .21 . 31 41 . 51 

'JcEAARPSGSW NCSALCRLLLL TLAIIiIFASD ACKNVTLHVP SKLDAEKLVG RVNLKECFTA 60 

ANLIHSSDPD FQILEDGSVY TTNTILLSSE KRSFTILLSN TENQEKKKIF VFLEHQTKVL 120 
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WO 02/086443 

KKRHTKEKVL RRAKRRWAPI PCSMLENSLG PFPbFLQQVQ 
EPRNLFYVER DTGNLYCTRP VDREQYESFE IIAPATTPDG 
PIPTSBTYTF TIFENCRVGT TVGQVCATDK DBPDTMHTRL 
TGVITTTSSQ IiDRELIDKYQ LKIKVQDMDG QYFGLQTTST 
VTSVEENTVD VEILRVTVED KDLVNTANWR ANYTILKGNE 
KPLNYEEKQQ MILQIGWNE APFSREASPR SAMSTATVTV 
KENAEVGTTS NGYKAYDPET RSSSGIRYKK LTDPTGWVTI 
KNGIYNITVL ASDQGGRTCT - GTLGII LQDV NDNSPFIPKK 
EPIHGPPFDF- SLESSTSEVQ RMWRLKAIND TAARLSYQND 
VTSLDVTLCD CITENDCTHR VDPRIGGGGV QLGKWAILAI 

' TSKQPKVIPD DLAQQNLIVS- NTEAPGDDKV YSANGPTTQT. 

. QETIEMVKGG HQTSBSCRGA GHHHTLDSCR GGHTEVDNCR 
CNQDENHKHA QDYVLTYKYE GRGSVAGSVG 
R 

Seq ID NO; 618 DNA sequence 
Nucleic Acid Accession ft: NM_00494 9.1 
Coding sequence: 202.. 2745 



PCT/US02/12476 



SDTAQNYTIY 
YTPELPLPLI 
KYSIIGQVPP 
CIINIDDVND 
NGNFKIVTDA 
NVEDQDEGPS 
DENTGSIKVF 
TVTICKFTMS. 
PPPGSYWPI- 
LLGIALLFCI- 
VGASAQGVCG, 
YTYSEWHSFT 
LEFLDNLEPK 



YSIRGPGVDO. 

IKIEDENDNY 

SPTLFSMHPT. 

HLPTFTRTSY 

FCTNEGVLCW 

CNPPIQTVRM- 

RSLDREAETI.. 

SAEIVAVDPD- 

TVRDRLGMSS 

LFTLVCGASG* 

TVGSGIKNGG' 

QPRLGEKVYL 

FRTLAEACMK 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



1 

I ' 

CGCCAAAGGA- 
CTCTCCGCGC 
GCTCCGGCCG 
GACCTGCCCC 
CTCTGCCGGC ' 
AATGTGACAT 
CTGAAAGAGT 
TTGGAGGATG 
TTTACCATAT 
GAGCATCAAA 
AAGAGAAGAT 

. CTTTTCCTTC 
AGAGGTCCTG* 
AACTTGTATT.. 
TTTGCAACAA' 
GAGGATGAAA- 
GAAAATTGCA 
GACAOGATGC 
CTATTTTCTA 
GAGTTAATTG 
GGTCTACAGA 
ACATTTACTC 
TTACGAGTTA 
ACCATTTTAA 
GAAGGAGTTC 
CAAATTGGTG 
AGCACAGCAA 
CCAATACAGA 
AAAGCATATG. 
CCAACAGGGT 
GATAGAGAGG 
CAAGGAGGGA 
AGCCCATTCA 
ATTGTTGCGG 
AGTTCTACTT 
CGTCTTTCCT 
GATAGACTTG 
GAAAATGACT 
AAGTGGGCCA 
CTGGTCTGTG 
CAGCAGAACC 
AAITGGCTTCA 
TCAGGAATCA 
TCGGAATCCT 
ACGGAGGTGG 
CTTGGTGAAG 
GTATCTGTGT 
CTATGAAGGA 
AGATGGGCTT 
CATGAAGAGA 
AAAAAATTAC 
TCATTATTTG 
AGTACACAAA 

' TCTACAGAGA 
TTTATAGTGC 
CCTAATGGAA 
GTGGTGCTTG 
ATTATTTTAT 
TTGACTATTA 



It 
I 

AAAGCCCCTT 
GCCCCACCTC 
CGGCCCTCGC 
GAGCCCTCTC 
TGCTCCTGCT 
TAGATGTTCC ' 
GCTTTACAGC 
GTTCAGTCTA 
TACTTTCCAA- 
CAAAGGTCCT 
GGGCTCCAAT. 
AACAGGTTCA 
GAGTTGACCA 
GTACTCGTCC 
CTCCAGATGG 
ATGAT AACTA 
GAGTGGGCAC 
ACACACGCCT 
TGCATCCAAC: 
ACAAGTACCA 
CAACTTCAAC 
GTACTTCTTA- 
CTGTTGAGGA 
AGGGCAATGA 
TTTGTGTAGT 
TAGTTAATGA- 
CAGTTACTGT- 
CTGTTCGCAT- 
ACCCAGAAAC 
GGGTCACCAT 
CAGAGACCAT 
GAACATGTAC 
TACCTAAAAA 
TTGATCCTGA 
CAGAAGTACA- 
ATCAGAATGA- 
GCATGTCTAG. 
GCACACATCG 
TCCTTGCAAT 
GGGCTTCTGG 
TAATTGTATC 
CAACCCAAAC 
AAAAOGGAGG 
GCCGGGGGGC 
ACAACTGCAG' 
AATCCATTAG 
AATCAAG ATG : 
AGAGGATCGG- 
GAATTTTTGG 
TGAGTGTGTT. 
AAACCAAGAA 
GATGGAATCT 
TTTTTCAATT' 
AATTAAAGTC 
AATAAAATGT 
AATTGTAGAG 
GAAACTGTTG 
TCTTGTAATG 
CAATTTCATT 



21 
I 

GGATGAGAGG 
CTCCGCCTCG 
CCCGCGGAGC 
CATGGAGGCA 
GACCCTCGCG 
CTCCAAACTA 
TGCAAATCTA 
TACAACAAAT ■' 
CACTGAGAAC 
AAAGAAAAGA- 
TCCTTGTTCG; 

atctgacacg: 
agaacctcgg 

TGTAGATCGT. 
GTATACTCCA 
CCCAATTTTT 
TACTGTGGGA 
GAAGTACTCC 

tacaggcgtg; 

GTTGAAAATA 
TTGTATCATT 
TGTGACATCA 
TAAGGACTTA 
AAATGGCAAT 
TAAGCCTTTG 
AGCTCCATTT 
TAATGTAGAA 
GAAAGAAAAT 
AAGAAGTAGC 
TGATGAAAAT 
CAAAAATGGC 
GGGGACACTG 
GACAGTGATC 
TGAGCCTATC 
GAGAATGTGG 
TCCTCCATTT 
TGTCACTTCA 
TGTAGATCC7V 
ATTGTTGGGC 
GACGTCTAAA 
AAACACAGAA 
TGTGGGCGCT 
TCAGGAGACC 
TGGCCACCAT 
ATACACTTAC 
AGGACACACT 
AAAATCACAA 
TGGCTGGGTC 
ATAATTTGGA 
CTAATAAGTC 
TTTTTTAAAG 
CTTTGGTCAA 
TTTACATATT 
TGCCTTATTT 
AATTAATTCA 
ACCTTGCTTT 
TTTTCCTGAA 
TGACCTTTTC 



31 
I 

CAGGCGCTTC 
CGCTCCTCCT. 
CCTCCTACCC 
GCCCGCCCCT 
ATCTTAATAT. 
GATGCCGAGA" 
ATTCATTCAA 
ACT ATT CT AT 

caagaaaaga' 
catactaaag 
atgct ag aaa ' 
gcccaaaact; 

AATTTATTTT 
GAGCAGTATG 
GAACTTCCAC 
ACAGAAGAAA 
CAAGTGTGTG 
ATCATTGGGC 
ATCACCACAA 
AAAGTACAAG 
AACATTGATG 
GTGGAAGAAA 
GTGAATACTG. 
TTTAAAATTG 
AATTATGAAG 
TCCAGAGAGG 
GATCAGGATG 
GCAGAAGTGG 
AGTGGCATAA 
ACAGGATCAA 
ATATATAATA 
GGCATTATAC 
ATCTGCAAAC 
CATGGCCCAC 
AGACTGAAAG 
GGCTCATATG 
TTGGATGTTA 
AGGATTGGCG 
ATAGCATTGC 
CAACCAAAAG. 
GCTCCTGGAG 
TCTGCTCAGG 
ATCGAAATGG 
CACACCCTGG 
TCGGAGTGGC 
CTGATTAAAA 
GCATGCCCAA 
TGTAGGTTGT 
GCCCAAATTT 
TCTGAAAGCC 
CAGAAGATGC 
ATGCACATTT 
TTTAAATTAC 
GTTACATTTG 
AGTCCTTATT 
AACATTATCT 
CATCTAAAGT 
ACTGTGCAAA 



41 
I 

AGAGAAGCTA 
GAGCAGCGGG 
CGGCCCGACG 
CCGGCTCCTG 
TTGCCAGTGA 
AACTTGTTGG 
GTGATCCTGA" 
TGTCCTCGGA 
AGAAAATATT 
AAAAAGTTCT 
ACTCCTTGGG 
ATACCATATA 
ATGTGGAGAG 
AATCTTTTGA 
TGCCCCTAAT 
CTTATACTTT 
CTACTGACAA 
AGGTGCCACC 
CATCATCTCA* 
ACATGGATGG 1 
ATGTAAATGA 
ATACAGTTGA 
CTAACTGGAG 
TAACAGATGC 
AAAAGCAACA 
CTAGTCCAAG 
AGGGCCCTGA 
GAACAACAAG 
GGTATAAGAA 
TCAAAGTTTT 
TTACAGTCCT 
TTCAAGACGT 
CCACCATGTC 
CCTTTGACTT 
CAATTAATGA 
TAGTACCTAT 
CACTGTGTGA 
GTGGAGGAGT 
TCTTTTGCAT- 
TAATTCCTGA 
ATGACAAAGT 
GAGTTTGTGG 
TGAAAGGAGG 
ACTCCTGCAG 
ACAGTTTTAC 
ATTAAACAAT 
GACTATGTCC 
TGCAGTGAAC 
AGGACACTAG 
AGTGGCTTTA 
TATTTGTGGG 
ACAGAGAGAC 
TTATCTTCTA 
GGTATAATGA 
ATAGACTATT 
CCAGTTAATT 
GTGTAGACTG 
GGGAGATTTC 



51 
I 

AGAAAAGCAC 

CCCAGACTGC 

CTCGGCCCGC 

GAACGGAGCC 

TGCCTGCAAA 

TAGAGTTAAC 

CTTCCAAATT 

GAAGAGAAGT. 

TGTCTTTTTG 

AAGGCGCGCC 

TCCTTTTCCA ■ 

CTATTCCATA 

AGACACTGGA' 

GATAATTGCC 

AATCAAAAT A ' 

TACAATTTTT 

AGATGAGCCT 

ATCACCCACC. 

GCTAGACAGA 

TCAGTATTTT 

CCACTTGCCA 

TGTGGAAATC 

AGCTAArTAT ; 

CAAAACCAAT 

GATGATCTTG- 

ATCAGCCATG < 

GTGTAACCCT 

CAATGGATAT 

ATTAACTGAT^ 

CAGAAGCCTG 

TGCATCAGAC 

GAATGATAAC 

ATCTGCGGAG 

TAGTCTGGAG' 

TACAGCAGCA- 

AACAGTGAGA- 

CTGCATTACC 

ACAACTTGG A -' 

CCTGTTTACG- 

TGATTTAGCC 

GTATTCTGCG 

CACCGTGGGA 

ACACCAGACC 

GGGAGGACAC 

TCAGCCCCGT' 

GAAAGAAAGT 

TGACATATAA 

GACAAGAAGA 

CAGAAGCATG 

TGACTTTTAA 

GGTTTTTCTC 

ACTATAAACA 

TCCAAGGAGG 

CAACAGCCAA 

TGAAGCACAA 

AAGTGTTCAT 

CATTCTTGCT- 

TAGCCAGGCA 



60 
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Seq ID NO: 619 Protein sequence 
Protein Accession ft: NP_004940.1 



51 



1 11 21 31 41 

1 - \ ' ' : I I I I 

MEAARPSGSW NGALCRLLLL TLAZLIFASD ACKNVTLHVP SKLDAEKLVG RVNLKECFTA 



60 



425 



WO 02/086443 PCT/US02/12476 . 

ANLIHSSDPD FQILEDGSVY TTNTILLSSE KRSFTILLSN TENQEKKXIF VFLEHQTKVL 120 

KKRHTKEKVTj rrakrrwapi pcsmlenslg" pfplflqqvq sotaqnytiy YSIRGPGVDQ 100 

• EPRNLFYVER DTGNLYCTRP VDREQYESFE I IAFATTPDG YTPELPLPLI IKIED END NY 240 

PIFTEETYTF TIPENCRVGT TVGQVCATDK DEPDTMHTRL KYSIIGQVP? SPTLFSKKPT 300 

5 TGVITTTSSQ LDRELIDKYQ LKIKVQDMDG QYFGLQTTST CIDTIDDVND ELPTFTRTSY 360 

VTSVEENTVD VEILRVTVED KDLVNTANWR ANYTII/KGNE NGNFKIVTDA KTNEGVLCW 420 

KPLNYEEKQQ MILQIGWNE APFSREASPR SAMSTATVTV NVEDQDEGPE CNPPIQTVRM 480 

KENAEVGTTS NGYKAYDPET RSSSGIRYKK LTDPTGWVT I • DENTGSIKVF RSLDREAETI "540 . ; 

KNGIYNITVL ASDQGGRTCT GTLGIILQDV NDNSPFIPKX TVIICKPTMS SAEIVAVDPD 600 . . 

1 0 EPIHGPPFDF SLESSTSEVQ RMWRLKAXND TAARLSYQND PPFGSYWPI TVRDRhGHSS .660 

VTSI*DVTLCD "CITENDCTHR VDPRIGGGGV QLGKWAILAI LLGIA1»LFCI LFTLVCGASG 720 

TSKQPKVIPD DLAQQNLIVS NTEAPGDDKY YSANGFTTQT VGASAQGVCG TVGSGIKNGG 780 

QETIEMVKGG HQTSESCRGA GHHHTLDSCR GGHTEVDNCR YTYSEWHSFT QPRLGEESIR 840 
GHTLIKN 



15 



40 



70 



Seq ID NO: 620 DNA sequence 

Nucleic Acid Accession ft: NM_032545.1 

Coding sequence : 4 6 . . 7 1 8 



20 1 11 21 31 |1 51 

AAACTGATCT TCAATGCACT ' AAGAGAAGGA GACTCTCAAA CCAAAAATGA CCTGGAGGCA 60 

CCATGTCAGG CTTCTGTTTA CGGTCAGTTT GGCATTACAG ATCATCAATT TGGGAAACAG 120 

CTATCAAAGA GAGAAACATA ACGGCGGTAG. AGAGGAAGTC ACCAAGGTTG CCACTCAGAA 180. 

25 GCACCGACAG TCACCGCTCA ACTGGACCTC CAGTCATTTC GGAGAGGTGA CTGGGAGCGC' 240 

CGAGGGCTGG GGGCCGGAGG AGCCGCTCCC CTACTCCCGG GCTTTCGGAG AGGGTGCGTC 300 

CGCGCGGCCG CGCTGCTGCA GGAACGGCGG TACCTGCGTG CTGGGCAGCT TCTGCGTGTG 360 

CCCGGCCCAC TTCACCGGCC GCTACTGCGA GCATGACCAG AGGCGCAGTG AATGCGGCGC 420 

CCTGGAGCAC GGAGCCTGGA CCCTCCCCGC CTGCCACCTC TGCAGGTGCA- TCTTCGGGGC 480 

30 CCTGCACTGC CTCCCCCTCC AGACGCCTGA' CCGCTGTGAC. CCGAAAGACT TCCTGGCCTC 540 

CCACGCTCAC GGGCCGAGCG CCGGGGGCGC GCCCAGCCTG CTACTCTTGC TGCCCTGCGC 600 

ACTCCTGCAC CGCCTCCTGC GCCCGGATGC GCCCGCGCAC CCTCGGTCCC. TGGTCCCTTC 660 

CGTCCTCCAG CGGGAGCGGC GCCCCTGCGG AAGGCCGGGA" CTTGGGCATC GCCTTTAATT- 720 

TTCTATGTTG TAAATAATAG- ATGTGTTTAG TTTACCGTAA 7 GCTGAAGCAC TGGGTGAATA 7B0 

35 TTTTTATTGG GTAATAAATA TTTTCATGAA AGCGCCAAAA AAAAAAAAAA AAAAAAAAAA 840 
AAAAAA 



Seq ID NO:" 621 Protein" sequence 
Protein Accession tt : NP_115934 . 1 



! ii 21 31 41 51 

MTWRHHVRLL FTVSLALQII NLGNSYQREK LgGREEVTK VATQKHRQSP LNWTSSHFGE 60 

VTGSAEGWGP EEPLPYSRAF GEGASARPRC CRNGGTCVLG. SFCVCPAHFT GRYCEHDQRR 120 

45 SECGALEHGA WTLRACHLCR- CIFGAiaCLP LQTPDRCDPK DFIASHAHGP SAGGAPSLLL 180 
• LLPCALLHRL LRPDAPAHPR SfcVPSVLQRE RRPCGRPGLG HRL 

Seq ID NO: 622 DNA sequence 
Nucleic Acid Accession ft: FGENESH predicted 
50 Coding sequence: 1..390 

! ii 21 31 |1 51 

ATGAGGTTCA GTGTCTCAGG * CATGAGGACC GACTACCCCA GGAGTGTGCT GGCTCCTGCT 60 

55 TATGTGTCAG TCTGTCTCCT CCTCTTGTGT CCAAGGGAAG TCATCGCTCC CGCTGGCTCA 120 

GAACCATGGC TGTGCCAGCC GGCACCCAGG TGTGGAGACA AGATCTACAA CCCCTTGGAG 180 

CAGTGCTGTT ACAATGACGC CATCGTGTCC CTGAGCGAGA CCCGCCAATG TGGTCCCCCC 240 

• TGCACCTTCT - GGCCCTGCTT TGAGCTCTGC TGTCTTGATT CCTTTGGCCT CACAAACGAT 300 

TTTGTTGTGA AGCTGAAGGT TCAGGGTGTG AATTCCCAGT GCCACTCATC TCCCATCTCC 360. 
60 AGTAAATGTG AAAGAGGCCG GATATGTTAG 

Seq ID NO: 623 Protein sequence 
protein Accession ft: FGENESH predicted 

65 1 - 11 V 21 |1 41 51 

MRFSVSGMRT -DYPRSVLAPA YVSVCLLLLC PREVIAPAGS EPWLCQPAPR CGDKIYNPLE 60 
QCCYNDAIVS LSETRQCGPP CTFWPCFEIiC CLDSFGLTND FWKLKVQGV NSQCHSSPIS 120 
SKCERGRIC 



Seq ID NO: 624 DNA sequence 
Nucleic Acid Accession ft: M18728. 
Coding sequence : 51 1085 



75 1 U 21 31 |1 51 

GGAGCTCAAG CTCCTCTACA AAGAGGTGGA CAGAGAAGAC AGCAGAGACC ATGGGACCCC 60 

CCTCAGCCCC 'TCCCTGCAGA TTGCATGTCC. CCTGGAAGGA GGTCCTGCTC ACAGCCTCAC 120 

TTCTAACCTT CTGGAACCCA' CCCACCACTG. CCAAGCTCAC TATTGAATCC ACGCCATTCA 180 

80 ATGTCGCAGA GGGGAAGGAG GTTCTTCTAC TCGCCCACAA CCTGCCCCAG AATCGTATTG 240 

GTTACAGCTG GTACAAAGGC GAAAGAGTGG ATGGCAACAG TCTAATTGTA GGATATGTAA 300 

TAGGAACTCA ACAAGCTACC CCAGGGCCCG CATACAGTGG TCGAGAGACA ATATACCCCA 360 

ATGCATCCCT GCTGATCCAG. AACGTCACCC AGAATGACAC AGGATTCTAT ACCCTACAAG 420 

TCATAAAGTC AGATCTTGTG AATGAAGAAG CAACCGGACA GTTCCATGTA TACCCGGAGC 480 

85 TGCCCAAGCC CTCCATCTCC AGCAACAACT CCAACCCCGT GGAGGACAAG GATGCTGTGG 540 

CCTTCACCTG TGAACCTGAG GTTCAGAACA CAACCTACCT GTGGTGGGTA AATGGTCAGA 600 

GCCTCCCGGT CAGTCCCAGG CTGCAGCTGT CCAATGGCAA CATGACCCTC ACTCTACTCA 660 



426 



GCGTCAAAAG GAACGATGCA GGATCCTATG AATGTGAAAT ACAGAACCCA CCGAGTCCCA 720 
ACOGCAGTGA CCCAGTCACC CTGAATCTCC TCTATGGCCC AGATGTCCCC ACCATTTCCC 780 
CCToGuGGC CAATTACCGT CCAGGGGAAA ATCTGAACCT CTCCTGCCAC GCAGCCTCTA 840 
££^c SSt TGGTTTATCA ATGGGACGTT CCAGCAATCC ACACAAGAGC 900 
TOT^ScCC CAACATCACT GTGAATAATA GCGGATCCTA TATGTGCCAA GCCCATAACT 960 
CAGCCACTGG CCTCAATAGG ACCACAGTCA- CGATGATCAC AGTCTCTGGA AGTGCTCCTC 1020 
^SSgC TGTGGCCACC GTCGGCATCA CGATTGGAGT GCTGGCCAGG.GTGGCTCTGA 1080 
TATAGCAGCC CTGGTGTATT TTCGATATTT CAGGAAGACT GGCAGATTGG ACCAGACCCT . 1140 
^£StCT AGCTCCTCCA ATCCCATTTT ATCCCATGGA ACCACTAAAA MUamn 1200 
CTCTGCTCCT GAAGCCCTAT. ATGCTGGAGA TGGACAACTC AATGAAAATT TAAAGGGAAA, 1260 
ACCCTCAGGC* CTGAGGTGTG TGCCACTCAG AGACTTCACC TAACTAGAGA CAGTCAAACT 1320 
GCAAACCAK3 ■ GTGAGAAATT- GACGACTTCA CACTATGGAC AGCTTTTCCC AAGATGTCAA . 1380 
A^ACTC CTCATCATGA- TAAGGCTCTT ACCCCCTTTT AATTTGTCCT. TGCTTATGCC ". 14 40 
^cScTTTC G^GGCAGG ATGATGCTGT. CATTAGTATT TCACAAGAAG TAGCTTCAGA;. 1500 
GGGTAACTTA ACAGAGTGTC AGATCTATCT TGTCAATCCC AACGTTTTAC ATAAAATAAG . 1560 
££££££ GTGCACCCAG TGACTGACAT. TAGCAGCATC TTTAACACAG CCGTGTGTrC' 620 
AAATCTACAG TGGTCCTTTT. CAGAGTTGGA CTTCTAGACT CACCTGTTCT CACTCCCTGT 1680 
TTTAATl"cAA CCCAgIxATG CAATGCCAAA TAATAGAATT GCTCCCTACC AGCTGAACAG 1740 
GgIg^GTCT GTGCAGTTTC TGACACTTGT- TGTTGAACAT GGCTAAATAC AATGGGTATC 800 
GCTGAGACTA AGTTGTAGAA ATTAACAAAT GTGCTGCTTG GTTAAAATGG CTACACTCAT 1860 
CTGACTCATT CTTTATTCTA TTTTAGTTGG TTTGTATCTT GCCTAAGGTG COTAOTCCAA. 1920 
CTCTTGGTAT TACCCTCCTA ATAGTCATAC TAGTAGTCAT ACTCCCTGGT GTAGTGTATT ■ I960 
" ^AaItGTC TGCATGCAGC CAGCCATCAA ATAGTGAATG GTCTCTCTTT ' 2040 
wSSgaatt acaaaactca GAGAAATGTG TCATCAGGAG AACATCATAA CCCATGAAGG 2100 
ATAAAAGCCC CAAATGGTGG TAACTGATAA TAGCACTAAT GCTTTAAGAT. «GGTCACAC 2160 
TCTCACCTAG GTGAGCGCAT TGAGCCAGTG GTGCTAAATG- CTACATACTC CAACTGAAAT 2220 
OTMGGAAG AAGATAGATC CAATTAAAAA- AAATTAAAAC CAATTTAAAA AAAAAAAAGA 2280 
Sa«S TTCCAGTCTA CTTGAGTTAG; CATAATACAG AAGTCCCCTC TACTTTAACT 2340 
TTTACAAAAA AGTAACCTGA ACTAATCTGA TGTTAACCAa: TGTATTTATT TCTGTGGTTC 2400 
TGTTTCCTTG TTCCAATTTG ACAAAACCCA CTGTTCTTGT ATTGTATTGC CCAGGGGGAG 2460 
CTATCACTGT ACTTGTAGAG TGGTGCTGCT TTAATTCATA AATCACAAAT AAAAGCCAAT 2520 
TAGCTCTATA ACT 

Seq ID NO: 625 Protein sequence 
protein Accession #: AAA5 9 90 7.1 



11 



21 31 41 51 



ilGPPSAPPCR LHVPWKEVLL TASLLTFWNP PTTAKLTIES TPFNVAEGKE 60 

SSlGYSWYKCr ERVDGNSLIV GYVIGTQQAT. PGPAYSGRET IYPNASLLIQ NVTQNDTGFY 120 

TLQVIKSDLV NEEATGQFHV" YPELPKPSIS; SNNSNPVEDK" DAVAFTCEPE VO^TYLWWV 130 

NGQSLPVSPR LQLSNGNMTL TLLSVKRNDA GSYECEIQNP ASANRSDPVT LNVL^PDVP 240 

TISPSKANYR PGENLNLSCH AASNPPAQYS WFINGTFQQS TQBLFIPNIT VNNSGSYMCQ 300 
AHNSATGLNR TTVTMITVSG SAPVLSAVAT VGITIGVLAR VALI 

Seq ID KO: 626 DNA sequence 
Nucleic Acid Accession ft: M1872B.1 
Coding sequence: 1355.. 1657 

1 11 21 31 41 SI 

IrirrTCIAG CTCCTCTACA AAGAGGTGGA CAGAGAAGAC AGCAGAGACC ATGGGACCCC 60 

cSc£c £cct£a"gA TTGCATGTCC CCTGGAAGGA GGTCCTGCTC ACAGCCTCAC 120 

S^COT CTGGAACCCA CCCACCACTG CCAAGCTCAC TATTGAATCC ACGCCATTCA 180 

ATGtSgSgA GGGGAAGGAG GTTCTTCTAC TCGCCCACAA CCTGCCCCAG AATCGTATTG' 240 

GTTACAGCTG GTACAAAGGC GAAAGAGTGG ATGGCAACAG- TCTAATTGTA GGATATGTAA 300 

^GAACTCA ACAAGCTACC CCAGGGCCOG. CATACAGTGG TCGAGAGACA ATATACOCCA 360 

ATgIaTCCCT GCTGATCCAG AACGTCACCC AGAATGACAC AGGATTCTAT ACCCTACAAG 420 

TCATAWGTC AGATCTTGTG AATGAAGAAG CAACOGGACA GTTCCATGTA TAOCCGGAGC 480 

££ariBCC CTCCATCTCC AGCAACAACT CCAACCCCGT GGAGGACAAG GATGCTGTGG 540 

SSScCTG TGAACCTGAG GTTCAGAACA CAACCTACCT GTGGTGGGTA AATGGTCAGA 600 

SScCCGGT CAGTCCCAGG CTGCAGCTGT CCAATGGCAA CATGACCCTC ACTCTACTCA 660 

GCGTCAAAAG GAACGATGCA GGATCCTATG. AATGTGAAAT ACAGAACCCA GC8AGTGCCA 720 

ACCGCAGTGA CCCAGTCACC CTGAATGTCC TCTATGGCCC AGATCTCCCC ACCATTTCCC 780 

CCTCAAAGGC CAATTACCGT CCAGGGGAAA- ATCTGAACCT- CTCCTGCCAC GCAGCCTCTA 840 

aScSc SaScT TGGTTTATCA ATGGGACGTT CCAGCAATCC ACACAAGAGC . 900 

TCTTTATCCC CAACATCACT GTGAATAATA GCGGATCCTA TATGTGCCAA GCCCATAACT 960 

££ScTGG CCTCAATAGG ACCACAGTCA CGATGATCAC AGTCTCTGGA AGTGCTCCTG 1020 

TcScTCAGC TGTGGCCACC GTCGGCATCA CGATTGGAGT GCTGGCCAGG <^%™A 1080 

TATAGCAGCC CTGGTGTATT TTCGATATTT CAGGAAGACT GGCAGATTGG ACCAGACCCT 1140 

GAATTCTTCT AGCTCCTCCA ATCCCATTTT ATCCCATGGA ACCACTAAAA ACAAGGTCTG 1200 

ScTgScCT GAAGCCCTAT ATGCTGGAGA TGGACAACTC AATGAAAATT TAAAGGGAAA 260 

ACCCTCAGGC CTGAGGTGTG TGCCACTCAG AGACTTCACC TAACTAGAGA CAGTCAAACT 1320 

GCAAACCATG GTGAGAAATT GACGACTTCA CACTATGGAC AGCTTTTCCC AAGATGTCAA 1380 

AACAAGACTC CTCATCATGA TAAGGCTCTT ACCCCCTTTT AATTTGTCCT TGCTTATGCC 1440 

TGCCTCTTTC SgCAGG ATGATGCTGT CATTAGTATT TCACAAGAAG TAGCTTCAGA 1500 

• SgtaACTTA ACAGAGTGTC AGATCTATCT TGTCAATCCC AACGTTTTAC ATAAAATAAG 1560 

SSScrrTA gSSaG TGACTGACAT TAGCAGCATC TTTAACACAG CCGTGKTTC 1620 

' AAATGTACAG TGGTCCTTTT CAGAGTTGGA CTTCTAGACT CACCTGTTCT CACTCCCTGT 1680 

™£aa'ccSgCc!tG ^AATGCCAAA TAATAGAATT GCTCCCTACC AGCTGAACAG 1740 

gSgGAGTCT GTGCAGTTTC TGACACTTGT TGTTGAACAT GGCTAAATAC AATGGGTATC 1800 

GCTGAGACTA StTGTAGAA ATTAACAAAT GTGCTGCTTG GTTAAAATGG CTACACTCAT 1860 

f-Tf-ACTCATT CTTTATTCTA TTTTAGTTGG TTTGTATCTT GCCTAAGGTG CGTAGTCCAA 1920 

S^GTAT tIcCC^CCTA ATAGTCATAC TAGTAGTCAT ACTCCCTGGT GTAGTGTATT 1980 

TTTAAATGTC TGCATGCAGC CAGCCATCAA ATAGTGAATG GTCTCTCTTT 2040 

GGCTGGAATT ACAAAACTCA GAGAAATGTG TCATCAGGAG AACATCATAA CCCATGAAGG 2100 
S£cS CAAATGGTGG TAACTGATAA TAGCACTAAT GCTTTAAGAT TTGGTCACAC 
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WO 02/086443 

TCTCACCTAG GTGAGCGCAT TGAGCCAGTG 
GTTAAGGAAG AAGATAGATC CAATTAAAAA 
ACACAGGAGA TTCCAGTCTA' CTTGAGTTAG 
TTTACAAAAA" AGTAACCTGA ACTAATCTGA 
TGTTTCCTTG TTCCAATTTG' ACAAAACCCA 
CTATCACTGT ACTTGTAGAG TGGTGCTGCT 
TAGCTCTATA ACT 



PCT7US02/12476 



GTGCTAAATG CTACATACTC CAACTGAAAT 
AAATTAAAAC CAATTTAAAA AAAAAAAAGA 
CATAATACAG AAGTCCCCTC TACTTTAACT 
TGTTAACCAA TGTATTTATT TCTGTGGTTC 
CTGTTCT TG T ATTGTATTGC CCAGGGGGAG 
TTAATTCATA AATCACAAAT AAAAGCCAAT 



Seq ID NO: 627 Protein sequence 
Protein Accession 8: AAA59908.1 



11 



. 21 



GGAGCTCAAG 
CCTCAGCCCC 
TTCTAACCTT 
ATGTCGCAGA 
GTTACAGCTG 
TAGGAACTCA 
ATGCATCCCT 
TCATAAAGTC 
TGCCCAAGCC 
CCTTCACCTG 
GCCTCCCGGT 
GCGTCAAAAG 
ACCGCAGTGA 
CCTCAAAGGC 
ACCCACCTGC 
TCTTTATCCC 
CAGCCACTGG 
TCCTCTCAGC 
TATAGCAGCC 
GAATTCTTCT 
CTCTGCTCCT 
ACCCTCAGGC 
GCAAACCATG 
AACAAGACTC 
TGCCTCTTTC 
GGGTAACTTA 
AGATCCTTTA 
AAATGTACAG 
TTTAATTCAA 
GGAGGAGTCT 
GCTGAGACTA 
CTGACTCATT. 
CTCTTGGTAT 
CTCTAAAAGC 
GGCTGGAATT 
ATAAAAGCCC 
TCTCACCTAG 
GTTAAGGAAG 
ACACAGGAGA 
TTTACAAAAA 
TGTTTCCTTG 
CTATCACTGT 
TAGCTCTATA 



11 
I 

CTCCTCTACA 
TCCCTGCAGA 
CTGGAACCCA 
GGGGAAGGAG 
GTACAAAGGC 
ACAAGCTACC 
GCTGATCCAG 
AGATCTTGTG 
CTCCATCTCC 
TGAACCTGAG 
CAGTCCCAGG 
GAACGATGCA 
CCCAGTCACC 
CAATTACCGT 
ACAGTACTCT 
CAACATCACT 
CCTCAATAGG; 
TGTGGCCACC 
CTGGTGTATT 
AGCTCCTCCA 
GAAGCCCTAT 
CTGAGGTGTG 
GTGAGAAATT 
CTCATCATGA 
GCTTGGCAGG 
ACAGAGTGTC 
GTGCACCCAG 
TGGTCCTTTT 
CCCAGCCATG 
GTGCAGTTTC 
AGTTGTAGAA 
CTTTATTCTA 
TACCCTCCTA 
TTTAAATGTC 
ACAAAACTCA 
CAAATGGTGG 
GTGAGCGCAT 
AAGATAGATC 
TTCCAGTCTA 
AGTAACCTGA 
TTCCAATTTG 
ACTTGTAGAG 
ACT 



21 
I 

AAGAGGTGGA 
TTGCATGTCC 
CCCACCACTG 
GTTCTTCTAC 
GAAAGAGTGG 

ccagggcccg: 
aacgtcaccc 
aatgaagaag 
agcaacaact 
gttcagaaca 

CTGCAGCTGT 
GGATCCTATG 
CTGAATGTCC 
CCAGGGGAAA- 
TGGTTTATCA 
GTGAATAATA 
ACCACAGTCA 
GTCGGCATCA 
TTCGATATTT - 
ATCCCATTTT 
ATGCTGGAGA 
TGCCACTCAG 
GACGACTTCA 
TAAGGCTCTT 
ATGATGCTGT 
AGATCTATCT 
TGACTGACAT 
CAGAGTTGGA 
CAATGCCAAA 
TGACACTTGT 
ATTAACAAAT 
TTTTAGTTGG 
ATAGTCATAC 
TGCATGCAGC 
GAGAAATGTG 
TAACTGATAA 
TGAGCCAGTG 
CAATTAAAAA 
CTTGAGTTAG 
ACTAATCTGA 
ACAAAACCCA 
TGGTGCTGCT 



31 
I 

CAGAGAAGAC 
CCTGGAAGGA 
CCAAGCTCAC 
TCGCCCACAA 
ATGGCAACAG 
CATACAGTGG 
AGAATGACAC 
CAACCGGACA 
CCAACCCCGT 
CAACCTACCT 
CCAATGGCAA 
AATGTGAAAT 
TCTATGGCCC 
ATCTGAACCT 
ATGGGACGTT 
GCGGATCCTA 
CGATGATCAC 
CGATTGGAGT 
- CAGGAAGACT 
ATCCCATGGA 
TGGACAACTC 
AGACTTCACC 
CACTATGGAC 
ACCCCCTTTT 
CATTAGTATT 
TGTCAATCCC 
TAGCAGCATC. 
CTTCTAGACT 
TAATAGAATT 
TGTTGAACAT 
GTGCTGCTTG. 
TTTGTATCTT 
TAGTAGTCAT 
CAGCCATCAA 
TCATCAGGAG 
TAGCACTAAT 
GTGCTAAATG 
AAATTAAAAC 
CATAATACAG 
TGTTAACCAA 
CTGTTCTTGT 
TTAATTCATA 



41 



51 
I 



AGCAGAGACC ATGGGACCCC 
GGTCCTGCTC: ACAGCCTCAC 
TATTGAATCC- ACGCCATTCA 
CCTGCCCCAG: AATCGTATTG * 
TCTAATTGTA GGATATGTAA 
TCGAGAGACA ATATACCCCA 
AGGATTCTAT- ACCCTACAAG 
GTTCCATGTA- TACCCGGAGC 
GGAGGACAAG GATGCTGTGG - 
GTGGTGGGTA- AATGGTCAGA 
CATGACCCTC ACTCTACTCA 
ACAGAACCCA GCGAGTGCCA 
AGATGTCCCC ACCATTTCCC 
CTCCTGCCAC GCAGCCTCTA 
CCAGCAATCC ACACAAGAGC 
TATGTGCCAA GCCCATAACT 
AGTCTCTGGA AGTGCTCCTG- 
GCTGGCCAGG GTGGCTCTGA 
GGCAGATTGG ACCAGACCCT 
ACCACTAAAA ACAAGGTCTG 
AATGAAAATT TAAAGGGAAA 
TAACTAGAGA CAGTCAAACT 
AGCTTTTCCC AAGATGTCAA 
AATTTGTCCT TGCTTATGCC 
TCACAAGAAG TAGCTTCAGA 
AACGTTTTAC ATAAAATAAG 
TTTAACACAG CCGTGTGTTC 
CACCTGTTCT CACTCCCTGT 
GCTCCCTACC AGCTGAACAG 
GGCTAAATAC AATGGGTATC 
GTTAAAATGG CTACACTCAT 
GCCTAAGGTG CGTAGTCCAA 
ACTCCCTGGT GTAGTGTATT 
ATAGTGAATG GTCTCTCTTT 
AACATCATAA CCCATGAAGG 
GCTTTAAGAT TTGGTCACAC 
CTACATACTC CAACTGAAAT 
CAATTTAAAA AAAAAAAAGA 
AAGTCCCCTC TACTTTAACT 
TGTATTTATT TCTGTGGTTC ' 
ATTGTATTGC CCAGGGGGAG 
AATCACAAAT AAAAGCCAAT 



Seq id NO: 629 Protein sequence 
Protein Accession #: AAA59909.1 



31 



41 



1 11 21 

I I I I 1 

HLTNVFISW LFPCSNLTKP TVLVLYCPGG AITVLVEWCC FNS 



Seq ID NO: 630 DMA sequence 

Nucleic Acid Accession &: NM_01663 9.1 

Coding sequence: 40. .429 



2220 
2280 
2340 
2400 
2460 
2S20 



31 41 51 

I I ... I . I I I - 

MDSFSQDVKT RLLIMIRLLP PPNIiSLLMPA" SFAWQDDAVI SISQEVASEG NLTECQIYLV 
NPNVLHKIRD PLVHPVTDIS SIFNTAVCSN VQWSFSEU)F 

Seq ID NO: 628 DNA sequence 
Nucleic Acid Accession B: M18728.1 
Coding sequence: 2370.. 2501 



60 



60 
120 
190 
240 
300 
360 
420 
480 
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1800 
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2340 
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2520 



GCGGCGGGCG 
CGGTTGCTGC 
GAGCAAGCGC 
AAGTGCATGG 
GCTGCAGCAC 
CTGACCTTCG 
GAGAAGTTCA 



11 

I V 
CAGACAGCGG 
GGCTCCTCGT 
CAGGCACCGC 
ACTGCGCGTC 
CTCCTGCCCC 
TGCTGGGGCT 
CCACCCCCAT 



21 
I 

CGGGCGCAGG 
GCTGGGGCTC 
CCCCTGCTCC 
TTGCAGGGCG 
CTTCCGGCTG 
GCTTTCTGGC 
AGAGGAGACC 



31 
I 

ACGTGCACTA 
TGGCTGGCGT 
CGCGGCAGCT 
CGACCGCACA 
CTTTGGCCCA 
TTTTTGGTCT 
GGCGGAGAGG 



41 
I 

TGGCTCGGGG 
TGCTGCGCTC 
CCTGGAGCGC 
GCGACTTCTG 
TCCTT GGGGG 
GGAGACGATG 
GCTGCCCAGC 



51 
I 

CTCGCTGCGC ' 60 

CGTGGCCGGG 120 

GGACCTGGAC 180 

CCTGGGCTGC 240 

CGCTCTGAGC 300 

CCGCAGGAGA 360 

TGTGGCGCTG 420 



428 



5 

10 
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WO 02/086443 

ATCCAGTGAC AATGTGCCCC CTGCCAGCCG GGGCTCGCCC 
TTCTAGAGCC AGTCTCTGCC TCCCAGACGC GGOGGGAGCC 
GGGGTGGGGG GCGGTGAATC ACCTCTGAGG CCTGGGCCCA 
AGGTGTCTGG TTGCCCTGCC TCTGGCTCCA GAACAGAAAG 
ACAAAACAGC TGACACTGAC TAAGGAACTG CAGCATTTGC 
CCTTCCTTAG GACCTGGGGG CCAGGCTGAC TTGGGGGGCA 
TCACTCAGAT- GTCCTGAAAT TCCACCACGG GGGTCACCCT 
TTAACACTAG GGGCTGGCCC ACTAGGAGGG. CTGGCCCTAA 
CCCAAAGCGG GGAGGAGATA TTTATTTTGG GGAGAGTTTG 
AATAAAAGAA- TCTTTAACTT TAAAAAAAAA AAAAAAAA 

Seq ID NO: 631 Protein sequence 
Protein Accession ft; NP_057723.1 



PCT/US02/12476 



ACTCATCATT 
AAGCTCCTCC 
GGGTTCAGGG 
GGAGCCTCAC 
ACAGGGGAGG 
GACTTGACAC 
GGGGGGTTAG 
GATACAGACC 
GAGGGGAGGG 



CATTCATCCA . 

AACCACAAGG 

GAACCTTCCA 

GCTGGCTCAC 

GGGGTGCCCT 

TAGGCCCCAC 

GGACCTATTT 

CCCCCAACTC 

AGAATTTATT 



51 



1 11 21 31 41 

I I 1 I ! i 

MARGSLRRLL' RLLVLGLWLA LLRSVAGEQA PGTAPCSRGS SWSADLDKCM DCASCRARPH 
SDFCLGCAAA PPAPFRLLWP ILGGALSLTF VLGLLSGFLV WRRCRRRSKF TTPIEETGGE 
GCPAVALIQ 

Seq ID NO: 632 DNA sequence 

Nucleic Acid Accession #: NMJJ03 816.1 

Coding sequence: 79.. 2538 



1 
I 

CGGCAGGGTT 
CCTGCGGAAT 
CGGTGGTTGC 
CAACAGACCT 
AGAAGAGAAG; 
AAAGAGCATA 
TATACTTACA 
CATTATCGGG 
GGACTCAGAG 
AGCTCTCATT 
TGTGGAGTTT 
CCCAGCATGA- 
GAGCTGTTCA 
GTGAGAGAAG 
ATTCGAATTG 
GGGGGTGCTG 
CGTCGGAGAC 
ATGGCATTTG 
CAAATCACTG 
ATGAATCACG 
GGAGCATCGG: 
TTAAATAAAG 
CCCTCCTGTGr 
GAATGTGAAT 
TGTGCATATG" 
GGAAAAACCA 
CCAGATGTTT 
GGCATGTGCC 
GCCCCCAAAG- 
TTCTCTGGCA- 
TGTGAGAATG 
AGTCGAGGCA 
GGGATGGTTA 
GTAGATGCTT 
GTATGTAATA 
ACTAAAGGAT 
TTGAGGGACG 
TTTATCTTCA 
ACATATGAGT 
CGACATGTTT 
GTACCAACCT 
CCGAAAGTAT 
TATAGTTCCC 
CTAATACTTT 
GAAAACAAAA 
AGTTGTGAAA 
CATCATTGAA 
TGAACATGTT 
AGTGTTTAAG 
AACATGTGAT 
TTTTTCATCA 
CATGAATAAG 
TTATTTTGAA 
TCCATTTTTA 
GAATTTCTAT 
TAAATTATAA 
GGCTATAATA 
CTTGAGAATT 
AGAATGTTTA 
CATAGAAATT 
TTACTGTGGT 



11 
I 

GGAAAATGAT 
CGGCCGAGAT 
TGTTGCTTGG 
CACATCTTTC 
CCCCTAGGCC 
TTATTCACTT 
ACAAGGAAGG 
GCTATGTGGA 
GATTGCTGCA 
TTGAGCACAT 
CCAACAAGGA 
CTCAGCTACT 
TTGTCGTAGA 
AGATGATTCT 
TGCTAGTTGG 
GTGATGTGCT 
ATGACAGTGC. 
TGGGAACAGT 
TGGAGACATT 
ATGATGGGAG 
GTTCCAGAAA 
GAGGAAA CTG 
GTAATAAGTT 
TGGACCCTTG 
GTGACTGTTG. 
GTGAGTGTGA 
TTATTCAGAA 
AGTATTATGA 
ATTGTTTCAT 
ATGAATACAA 
TACAAGAGAT 
CCAAATGTTG 
ACGAAGGCAC 
CTGTTCTGAA 
GCAATAAGAA 
ACGGAGGAAG 
GACTTCTGGT 
TCAAGAGGGA 
CAGATGGCAA 
CTCCAGTGAC 
ATGCAGCCAA 
CATCTCAGGG 
TCACTTGATT 
TTTTTTTTCT 
CACCACAAAA 
TACAAGGAAA 
TAAGTCTTAT 
ATTGCAGTGA 
TGTTATTCTG 
AATCTAATAC 
TGGACGAATT 
CAAATATTGT 
AGTACAAAAT 
TGACCTTTCA 
TATGAATCAT 
GCTTTAAGGT 
AAGCAGGAGC 
TCATGAGCAC 
CATTTACTAA 
AGGCTGGAGA 
ATCTATGAGT 



TCAACTGTGG 
AAATCAAGCA 
ACCTCCCAGA 
GCAACCTCAG 
AAACTTAATT 
TTTTTAACCT 
TGATGTTTTC 
CAGACTTCAC 
TGCAGTAAAG 
TCAGTCATCG 
TTCTCAAATT 
AATTTTCTAC 
CTGTGAAAAC 
AATAATCATC 
CTTCAAAAGA 
ATACTAAAAG 
ACTATAGGTA 
GTGAAAGCAT 
ACGAAGTATT 
AATTAT AAAA 
TTTAAAATCT 
GGTGTGCTGG 
AAGAAGGAAG 
TATCATCTTA 



31 

I 

GAGGTGGAGG. 
GCGCGCTTTC 
CCAGTCCTCG 
• ATTATAACTC 
CAAGTATCTT-. 
AAAGACCTTT 
ACTGACCATC 
AATTCATCCA 
GCGAGTTATG: 
ATGGATGATG 
GAAACTGCAA 
AGAGCTGTCT 
TATGACATGA 
TACTTGGATA k 
TGGACCAATG 
GTGCAGTGGC 
CTAAAGAAAG 
AGCCACGCAG. 
GTTGCTCATG 
TGTGGAGCAA' 
TGCAGTGCAG 

attccaaagc 
ggggaagagt 
agtacctgta 
cggttccttc 
tactgcaatg 
tgccagaata 
caagtcatct 
tctaaaggtg 
actgggaatg: 
ggaattgtgc 

TTCCAGCTAG- 
GCTGGAAAGA 
GATGTTCAGA 
GAAAATGGCT. 
GGACCTACAT 
CTAATTGTTC 
AGAAGCTACT 
AACCCTTCTA 
GAAGTTCCTA 
CAGTTCCCAT 
CCTGCCCGTC 
TCTTTTTGCA 
TTGAAAAGCC 
TAACACAGAA 
CCAGGGAATT 
GTGAGGTTAA 
AACTGTATTG 
CTTAGTTATC 
TGACTAATCA 
ATACTCTAGA 
ATGCACAAGA 
AGTGTGTGTG 
ATAACTCTTA 
GACATTCGTT 
TAATAGATCT 
TCTTCAATCA 
GAACTTTCAA 
GTCATGTAAA 
AAATGGTTTT 
GCTGTGTTAA 



41 

I 

CGACCGAGTG 
CCTCGGGGAC 
GTGCGGCGCG 
CTTGGAGATT 
ATGTTATTCA 
TGCCTGAAGA 
CCAATATACA- 
TTGCTCTTAG. 
GGATTGAACC 
TCTACAAAGA 
AGGATGAAGA 
TGCCACAGAC 
TGGGAAGAAA 
GTATGTATAT 
GAAACCTGAT 
GGGAAAAGTT 
GTTTTGGTGG 
GCGGGATTAA 
AATTGGGTCA 
AGAGCTGCAT 
AGGACTTTGA 
CTGATGAAGC 
GTGACTGTGG 
AGCTTAAATC 
CAGGAGGTAC 
GTTCrrCTCA 
ACAAAGCCTA 
TTGGCTCAAA 
ACAGATTTGG 
CTTTGTGTGG- 
CTGCTATTAT 
GATCAGATGT, 
TCTGTAGAAA 
AAAAGTGTCA 
GGGCTCCCCC 
ACAATGAAAT 
CCCTTATTGT. 
TCAGAAAGAA 
GACAGCCGGG 
TATATGCAAA 
CAAGGCCACC 
CTGCTCCTGC 
AATGTCTTCA 
TTTCTGTTGC 
AAACAGAAAC 
TACAATAACA 
TGCACTAATC 
GTGTAAGATT 
ATTAATGTAG 
GCTGCCAATA 
ATCTTGTCTG 
ACCACAATTA" 
TATTCACGCA 
GAGAAATTAA 
CACAATAGCA 
AATCAAATAT 
ATTGAACTTT 
AGCTTGCTAT 
ATATTAGACA 
CTTAAATACC 
AAATGAATTT 



51 
I 

CTGAGAGGAA 

CCTTCGTGTC 

GCCAGGCTTT 

AACTAGAGAA 

GGCTGAAGGA 

TTTTGTGGTT 

GAATCATTGr 

CGACTGTTTT 

CCTGCAGAAC 

GCCTCTGAAA 

GGAAGAGCCT 

CCGGTATGTG- 

TCAGACTGCT" 

TATGTTAAAT - 

CAACATAGTT.' 

TCTTATCACA 

AACTGCAGGA 

TGTGTTTGGA 

TAATCTTGGA- 

CATGAATTCA 

GAAGTTAACT 

CTATAGTGCT 

TACTCCAAAG 

ATTTGCTGAG 

TTTATGCCGA 

GTTCTGTC^G 

TTGCTACAAC 

AGCCAAGGCT 

CAATTGTGGT 

AAAGCTTCAG- 

TCAAACGCCT 

TCCAGATCCT 

CTTCCAGTGT 

TGGACATGGG 

AAATTGTGAG" 

GAATACTGCA 

CTGTGCTATT 

GAGATCACAA 

GAGTGTTCCT 

CAGATTTGCA 

TCCACCACAA 

ACCTCCTTTA" 

GGGAACTGAG 

AACTATGAAT 

TGAGTGTGAG 

TTTCCGTTTC 

ATGGATTTTT 

TTTGTCATTA 

TTCCTCATTG 

ATATCTAATA 

TCACTCACTA 

AGATGTCATA 

GTTACTCGCT 

TTTAATATTA 

CTATTTTAAA 

GTTGATTCAT 

TACAAAACCA 

TAAATCATTT 

CTAATATTTT 

TACAAAAAAG 

TTACTATGGC 
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WO 02/086443 " 

AGATATGGTA TGGATCGTAA AATTTTAAGC ACTAAAAATT TTTTCATAAC CTTTCA TAAT 
AAAGTTTAAT AATAGGTTTA TTAACTGAAT TTCATTAGTT TTTTAAAAGT GTTTTTGGTT 
TGTGTATATA TACATATACA AATACAACAT TTACAATAAA TAAAATACTT GAAATTCTCA 
AAAAAAAAAA AAAAAAAAAA AAAAA 



PC1YUS02/12476 



3720 

3780. 

3840 



Seq ID KO: . 633 Protein sequence 
Protein Accession #: NP_003807.1 



10 I 



MGSGARFPSG 
PYSKQVSYVI 
EGVHNSSIAIj 
DIEKETAKDE 
LLANYLDSMY 
AQLVLKKGFG 
RDCSCGAKSC 
LVDAGEECDC 
DVPEYCNGSS 
IEVNSKGDRF 
WGVDFQLGSD 
NCHCENGWAP. 
DQLWRSYFRK 
KQPQQPPSRP 



11 
I 

TLRVRWLLLI* 
QAEGKEHIIH 
SDCFGLRGLL 
EEEPPSMTQL 
IMLNIRIVLV 
GTAGMAFVGT 
IMNSGASGSR 
GTPKECELDP 
QFCQPDVPIQ 
GNCGFSGNEY 
VPDPGMVNEG 
PNCETKGYGG 
KRSQTYESDG 
PPPQPKVSSQ 



21 
I 

GLVGPVLGAA 
LERNKDLLPE 
HLENASYGIE 
LRRRRAVLPQ 
GIjEIWTNGNL 
VCSRSHAGGI 
KFSSCSAEDF 
CCEGSTCKLK 
NGYPCQNNKA 
KKCATGNALC 
TKCGAGKICR 
SVDSGPTYNE 
KNQANPSRQP 
GNLIPARPAP 



31 
! 

RPGFQQTSHL 
DFWYTYNKB 
PLQNSSHFEH 
TRYVELFIW. 
IKIVGGAGDV 
NVFGQITVET 
EXLTLNKGGN 
SFAECAYGDC 
YCYNGMOQYY 
GKLQCENVQE 
NFQCVDASVL 
MNTALRDGLL 
GSVPRHVSPV 
APPLYSSLT 



41 
I 

SSYEIITPWR 
GTLITDHPNI 
I IYRMDDVYK 
DKERYDMMGR. 
LGNFVQWREK 
FASIVAHELG- 
CLLNIPKPDE 
CKDCRFLPGG 
DAQCQVIFGS 
IPVFGIVPAI 
NYDCDVQKKC 
VFFFLIVPLI 
TPPREVPIYA 



51 
I 

LTRERREAPR- 
QNHCHYRGYV 
EPLKCGVSNK 
NQTAVREEMI 
FLITRRRHD3 
HNLGMNHDDG 
AYSAPSOGNK 
TLCRGKTSEC 
KAKAAPKDCF 
IQTPSRGTKC 
HGHGVCNSNK 
VCAIFIFIKR 
NRFAVPTYAA 



Seq ID NO: 634 DNA sequence 

Nucleic Acid Accession #: NM_00209l.l 

Coding sequence: 56. .501 



1 

AGTCTCTGCT 
CGGCAGTGAG 
AGCGGTCCCG 
CCACTGGGCG 
TGAGAGAGGG 
GAATTTGCTG 
GGCCTTGGGC 
AGGTTCAAAA 
CCCCCAGCTG 
TAAGAGACTG. 
AAATATTTGA 
CTTCTGGTTT 
TTTTTATATC 
TAAAAGCTTA 



11 
I 

CTTCCCAGCC 
CTCCCGCTGG 
CTGCCTGCGG 
GTGGGGCACT 
AGCCTGAAGC 
GGTCTCATAG 
AATCAGCAGC 
GGCAAAGTTG 
AACCAGCAAT 
AGTTCTGCAA 
CTATTCTGTA 
AAACTTGTTT 
TAGGCTACCT 
AACACAT 



21 

T. 

TCTCCGGCGC 
TCCTGCTGGC 
GCGGAGGGAC 
TAATGGGGAA 
AGCAGCTGAG 
AAGCAAAGGA 
CTTCGTGGGA 
GTAGACTCTC 
GATAATGATG 
GCATCAGTTC 
TCTTTCATCC 
GCTGTGAACA' 
GTTGGTTAGA 



31 
I 

GCTCCAAGGG 
GCTGGTCCTC 
CGTGCTGACC 
AAAGAGCACA 
AGAGTACATC 
GAACAGAAAC 
TTCAGAGGAT. 
TGCTCCAGGT. 
GCCTCTCTCA 
TACGGATCAT 
TTGACTAAAT 
ATTGTCGAAA 
TTCAAGGCCC 



41 
I 

TGCCTAGCGC 
AAGATGTACC 
GGGGAGTCTT 
AGGTGGGAAG 
CACCAGCCAC 
AGCAGCAACT. 
TCTCAACGTG 
AAAGAGAAAA 
CAACAAGATT 
TCGTGATTTT 
AGAGTCTTCC 
CGAGCTGTTA 



51 
I 

.GGACCATGCG 

CCCGGGGGOG 

CGCGCGGCAA 

CTTCTGTTTC 

AAGCTGCAAG.; 

CTCAACCCAA 

TCAAAGATGT 

AAGGAAGGAA 

ACAAAACCCC' 

TCCTTGTGCA- 

CAAGCAGCAT 

AATTAATGCT- 

CCATTCACAA 



Seq ID NO: 635 Protein sequence 
Protein Accession fh NP_Q02082.1 



1 • ii 21 31 41 51 

I I I I I I 

MRGSELPLVL LALVLCLAPR GRAVPLPAGG GTVLTKMYPR GNHWAVGHLM GKKSTGESSS 
VSERGSLKQQ LREYIRWEEA ARNLLGLIEA KENRNHQPPQ ' PKALGNQQPS WDSEDSSNFK 
DVGSKGKVGR LSAPGSQREG RNPQLNQQ 

Seq ID NO: . 636 DNA sequence 

Nucleic Acid Accession #: NMJ116522.1 

Coding sequence: 265.. 1299 



1 
I 

GCGGAAGCAG 
CTGGCAAAAG 
TTTTCTCCTC 
CCGCACCCCA 
rCGGGGAAGT 
TGCCTCGTGG 
AGCGGAGATG 
GCCACCCTCA 
ACCATCCTCT 
AACACCCAAA 
TACACCTGCT 
CAAGTATCTC 
ATTAGCCTCA 
TCTCCCAAAG 
CGGGAACAGT 
CGGAGAGTAA 
GTCCCCGTGG 
TTCCAGTGGT 
AACAGACCTT 
TACACTTGCG 
CCAGGCGCCG 



CGGGAAAGGC 
CCAATCAGAT 
GGGAGGGGAA 
CCTTGCAGAT 



11 

A 

CGAGGAGGGA 
CCGAGGCTGG 
CCCGCGCCTC 
CCCACTTCCT 
TGTGGCTGTC 
TCGTGTCTCT 
CCACCTTCCC 
GGTGCACTAT 
ATGCTGGGAA 
CGCAGTACAG 
CGGTGCAGAC 
CCAAAATTGT 
CCTGCATAGC 
CGGTTGGCTT 
CAGGGGACTA 
AGGTCACCGT 
GACAAAAGGG 
ACAAGGATGA 
TCCTCTCAAA 
TGGCCTCCAA 
TCAGCGAGGT 
TGGTCTTGCA" 
TGCCGCCACC 
ATATACAAAT 
CAAAGAATAC 
ATTTAGGTAC 



21 
I 

GCCCCCTTTG 
ATTTGGGGGA 
CCGGTCGCCG 
GTGCTCGCCC 
GAGAATGGGG 
CAGGCTGCTG 
CAAAGCTATG 
TGACAACCGG 
TGACAAGTGG 
CATCGAGATC 
AGACAACCAC 
AGAGATTTCT 
AACTGGTAGA 
TGTGAGTGAA 
CGAGTGCAGT 
GAACTATCCA 
GACACTGCAG 
CAAAAGACTG 
ACTCATCTTC 
CAAGCTGGGC 
GAGCAACGGC 
CCTGCTTCTC 
ACCACCACCA 
GAAATTAGAA 
TTTGGGGGGA 
AATGGAGTTT 



31 

I 

GCCGTCCTCC 
GGAATATTAG 
CGGGTTCACC 
GGGGGGCGTG 
GTCTGTGGGT 
TTCCTTGTAC 
GACAACGTGA 
GTCACCCGGG 
TGCCTGGATC 
CAGAACGTGG 
CCAAAGACCT 
TCAGATATCT 
CCAGAGCCTA 
GACGAATACT 
GCCTCCAATG 
CCATACATTT 
TGTGAAGCCT 
ATTGAAGGAA 
TTCAATGTCT 
CACACCAATG 
ACGTCGAGGA 
AAATTTTGAT 
ACACAACAGC 
GAAACACAGC 
AAAGAGTTTT 
TCTTTTCCCA 



41 
I 

GTGGAACCGG 
ACTCGGAGGA 
GCTCAGTCCC 
TGCCGTGCGG 
ACCTGTTCCT 
CCACAGGAGT 
CGGTCCGGCA 
TGGCCTGGCT 
CTCGCGTGGT 
ATGTGTATGA 
CTAGGGTCCA 
CCATTAATGA 
CGGTTACTTG 
TGGAAATTCA 
ACGTGGCCGC 
CAGAAGCCAA 
CAGCAGTCCC 
AGAAAGGGGT 
CTGAACATGA 
CCAGCATCAT 
GGGCAGGCTG 
GTGAGTGCCA 
AATGGCAACA 
CTCATGGGAC 
AAAAAAGAAA 
AACGGGAAGA 



51 

I- 

TTTTCCGAGG 

GTCTGCGCGC 

CGCGCTCGCT. 

CTGCCGGAGT 

GCCCTGGAAG 

GCCCGTGCGC 

GGGGGAGAGC 

AAACCGCAGC 

CCTTCTGAGC 

CGAGGGCCCT 

CCTCATTGTG 

AGGGAACAAT 

GAGA CA CATC 

GGGCATCACC 

GCCCGTGGTA 

GGGTACAGGT 

CTCAGCAGAA 

GAAAGTGGAA 

CTATGGGAAC 

GCTATTTGGT 

CGTCTGGCTG 

CTTCCCCACC . 

CCGACAGCAA 

AGAAATTTGA 

TTGAAAATTG 

ACACAGCACA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



. 60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 



60 
120 
180 . 
240 
300 . 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



430 



5 

15 
20 
25 
30 
35 



40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO.02/086443 , . - 

CCOGGCTTGG ACCCACTGCA. AGCTGCATCG TGCAACCTCT TTGGTGCCAG TGTGGGCAAG ! 
GGCTCAGCCT CTCTGCCCAC AGACTGCCCC CACGTGGAAC ATTCTGGAGC TGGCCATCCC 
AAATTCAATC AGTCCATAGA GACGAACAGA ATGAGACCTT CCGGCCCAAG CGTGGQSCIT 
CCGGCCCAAG CGTGGCGCTG CGGGCACTTT GGTAGACTGT GCCACCACGG CGTGTGTTGT 
GAAACGTGAA ATAAAAAGAG CAAAAAAAAA AAAAAAAAA 

Seq ID NO: 637 Protein sequence 
Protein Accession H: NP_057606.1 .* ' 



PCT/US02/12476 



KGVCGYLFLP 
NRVTRVAWLN 
NHPKTSRVHL. 
SEDEYLEIQG 
LQCSASAVPS 
U3HTNASIML 



11 . i -. 

I 

WKCLWVSLR 
RSTILYAGND 
IVQVSPKIVE 
ITREQSGDYE 
AEFQWYKDDK 
FGPGAVSEVS 



21 * 31 41 51 

I I I 1 

LLFLVPTGVP VRSGDATFPK AMDNVTVRQG ESATLRCTID 
. KHCLDPRWL LSNTQTQYSI" EIQNVDVYDE GPYTCSVQTD 
ISSDISINEG NNISLTCIAT GRFEPTVTWR HISPKAVGFV 
CSASMDVAAP WRRVKVTVN YPPYISEAKG TGVPVGQKGT 
RLIEGKKGVK VENRPFLSKL IPFNVSEHDY GNYTCVASNK 
NGTSRRAGCV WLLPLLVLHL LLKF 



Seq ID NO: 638 DNA sequence 

Nucleic Acid Accession #: NM_01226l.l 

Coding sequence: 203. .1045 



GATTTGCTCT 
ACAGAATACG 
CACTCCAGCG 
CCTCATTCGG 
ACTTCGAGTT 
GGAAAATCTC 
TGGGACGACG 
GGCCAGCAAC 
TGAGGTGAAG 
CGCATATGCA 

ggcgacttgg; 
caaagacgca 
cacccccgct 
tgatcogcag 

. TATCTCAGAT 
GGAAGAAACC 
CGCGATTTAC 
ATCCCAGTAT 
CCAACTGGAT 
CATAGCTACA 
AACCCACGGA 
ATGCTGGGGA 
TGACTCTCCA 
TTGAAAACAT 
TGCTCCCTTG 
TCATGCTCCC 
GTTTAGTGAT 
AAAACGACTA 
GGGGGACCTG 
TTCTCTGGC 



11 
I 

GCCAGCAGCT 
CGCTCCCTCC 
GQGACTTTGA 
GGCACTGCGA 
CTCCTGATGT. 
TCAGGCCTTT 
TGTCTCATGG 
TACGTAGATC 
GGCCGCTGTG 
CTCAAAATGC 
AGGCTGAGCA 
GTCAGTGCTG 
GGGAAGTCCT 
AAGACGGTCA 
TTTGTCTTCA 
TTGCCCCTGA 
CACGTCCACX 
AAGCACATGG' 
CAGGTAGAAC 
ATCAAACAGG 
AGGGGGAGAC 
GGAGGGGAGG 
AAGAGCAATA 
GCTTCTTTGA 
GACACAGCTG: 
TGCAGCAAGA 
TGTCTTGGGA 
ATGTAACTAT 
AAGAATCAAT 



21 
I 

GTCGGTGCCG 
CTCCCCCTTC 
GGGATTCCCT 
GTATGGATCT 
TGTTCCATAC 
CCACTAACCC 
CAGAGTTTGC 
TGATCACAGA 
GCCACAGCCA 
TCTTTGTAAA 
AAGTGCAGTT 
GGAAGCACAC 
ATGAGTGTCA 
CCATGATCCT: 
GTGAAGAGCA 
TTTTGGGGCT 
ACAAAATGAC 
GCTAGAGGCC 
AACAAAAGCA. 
CCTGGGTATC 
TCTTTCGGAT. 
AGGGTCTCAG 
AATGCCACTT 
GGAGGAAACC 
GCTTATCCTA 
CCCCTGAAAG " 
ATGTTTCACT 
GCAGAGTTGT 
CTGTGTGAGT 



31 
I 

CGCTCGACAC 
TCTGTCCCCC 
CTCTGGCGGC 
CCAAGGAAGA' 
AATGGCTCAA 
TGAAAAAGAT 
AGCCAAATTT 
ACAGGCCGAT 
GTCGGAGCTG 
GGAAAGCCAC 
TGTCTACGAC 
AGCCAACTCG 
AGCTCAACAA 
GTCTGCGGTC 
TAAATGCCCA 
CATCTTGGGC 
TGCCAACCAG 
GTTAGGCAGG 
CTTTTCCATC 
TGAGGCTTGC 
TTGTAGGGTG 
ACAGCTTTCG 
GGAGCTGTAT 
CCTTTAGGTT 
TACAGTTGTC 
TGATTCATGC 
GCTACCCGCA 
TTGGACTTCT 
CTGTTTTTCA 



41 
I 

CGAGTCCTAG 
GCCTCTCGCT 
CTCTGCAGCA 
GGGGTCCCCA 
ATCATGGCAG 
ATATTTGTGG 
ATTGTACCTT 
ATCGCATTGA- 
CAAGTGTTCT 
AACATGTCCA 
TCCTCGGAGA 
CACCACCTCT 
ACCATTTCAC 
CACATCCAAC 
GTGGATGAGC 
CTCGTCATCA 
GTGCAGATCC 
CACCCCCTAT 
TTGTACACGA 
TTGGCTTGTG 
AAATGGCAAT 
TGCTCATGGT 
CTGGCCCCAA 
CAGAAGAATA 
AATGCACACA 
TTCTGGCTGG 
TCCAGCGACT 
TCCTGTGCCA 
AAATGAAATA 



Seq ID NO: 639 Protein sequence 
protein Accession #: NP_036393.1 



11 



KDLOGRGVPS IDRIiRVLLML 
EFAAKPIVPY DVWASNYVDL 
FVKESHNMSK GPEATWRLSK 
EOQAQQTISL ASSDPQKTVT 
LGLILGLVIM VTLAIYHVHH 



21 
I 

FHTMAQIMAB 
ITEQADIALT 
VQFVYDSSEK 
MILSAVHIQP 
KMTANQVQIP 



31 



41 



51 



Seq' ID NO: ' 640 DNA sequence 

Nucleic Acid Accession #: NM_002 993.1 

Coding sequence: 64.. 408 



1 

I 

GGCACGAGCC 
ACTATGAGCC 
GCGCTGCTCG 
GTCTCTGCTG 
CCCAAAACGA 
GTGGTAGCCT 
AAGAAAGTCA 
ACCATGCATC 
CAGTAAGAAT 
. GAAGAGTGTG 
CTAATATAGT 
CAATTGACCA 
TGAAGATAAC 
ATTTCGTATG 
ACTCACTCTT 



•11 

•I 

AGTCTCCGCG 
TCCCGTCCAG 
CGCTGCTGCT 
TGCTGACAGA 
TTGGTAAACT 
CCCTGAAGAA 
TCCAGAAAAT 
ATAAAATTGC 
AAGAAGGAAG 
GGGGAAAGCC 
ATTTCCACTA 
TATTGTGAGC 
TATTGTATTT 
GAAATAATGT 
CTCATAAAAT 



21 
1 

CCTCCACCCA 
CCGCGCGGCC 
CCTGCTGACG 
GCTGCGTTGC 
GCAGGTGTTC 
CGGGAAGCAA 
TTTGGACAGT 
CCAGTCTTCA 
GGTTGGTTTT 
TACGCTTCTC 
TTTACTGTTA 
AAAGAATCAC 
CT AT CATACA 
TTTATTAGTG 
AGGAAATATT 



31 

I 

GCTCAGGAAC 
CGTGTCCCGG 
CCGCCGGGGC 
ACTTGTTTAC 
CCCGCAGGCC 
GTTTGTCTGG 
GGAAACAAGA 
GCGGAGCAGT 
TTTCCATTTT 
CCTGAAGTTT 
TTTTACCTGA 
TGGTTATTAG 
TTCCTTAAAG 
TGCTGTTGAG 
TTAGTTCTGT 



41 

I 

CCGCGAACCC 
GTCCTTCGGG 
CCCTCGCCAG 
GCGTTACGCT 
CGCAGTGCTC 
ACCCGGAAGC 
AAAACTGAGT 
TTTCTGGAGA 
CTACATGGAT 
ACAGCTCAGC 
TAAGTTATTG 
TCTTTCAATG 
TCTTACCGAA 
GGAGGTATCC 
TTTCTTGGGG 



1620 
1680 
1740 
1800 



51 

I 

CTAGGCGCTC 

CACCCCGGCC 

GCACAGCCGG " 

GCATCGACAG- 

AACAAGAAGT 

TGCGGGAAAA ' 

ATGATGTGTG 

CCCGGGGAGC 

GGGTGGATCG 

AGGGACCTGA 

AAACCCACTT ' 

CTGCCTTGGT 

TGGCCTCTAG 

CTTTTGACAT 

GGGAGCAACT 

TGGTAACACT 

CTCGGGACAG 

TCCTGCTCCC 

GATACACCAA 

TCCATGCTTA 

TATTCTCTCC 

GGCTTGGCTT 

AGTTTAGGGA 

TGGGGTGCTT 

GAATACAACC ■ 

CATTCTGCAT 

GCAGCACCAG 

GGTCCAAGTC 

AAACACACTA 



QEVENLSGLS TNPEKDIFW RENGTTCLMA 
RGAEVKGRCG HSQSELQVFW VDRAYALKML 
THFKDAVSAG KKTANSHHLS ALVTPAGKSY 
FDIISDFVFS EEHKCPVDER EQLEETLPLI 
RDRSQYKHMG 



60 
120 
180 
240 

3.00 



60 
120 
180 
240 
300 
360 
420 
.480 • 
540 
600 
660 
720 
7B0 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



60 
120 
180 
240 



51 
I 

TCTCTTGACC 
CTCCTTGTGC 
CGCTGGTCCT 
GAGAGTAAAC 
CAAGGTGGAA- 
. CCCTTTTCTA* 
AACAAAAAAG 
TCCCTGGACC 
TCCCTACTTT 
TAATGAAGTA 
AACCCTTTGG 
AATATTGAAT - 
AAGGCTGTGG 
TGTTGTTCTT 
AATATGTTAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780. 
840 
900 



431 



WO 02/086443 

TCTTTACCCT AGGATGCTAT TTAAGTTGTA CTGTATTAGA ACACTGGGTG TGTCATACCG 960 

TTATCTGTGC AGAATATATT TCCTTATTCA GAATTTCTAA AAATTTAAGT TCTGTAAGGG 1020 

CTAATATATT CTCTTCCTAT GGTTTTAGAT GTTTGATGTC TTCTTAGTAT GGCA7AATGT 1060 

CATGATTTAC TCATTAAACT TTGATTTTGT ATGCTATTTT- TTCACTATAG GATGACTATA _ 1140 

ATTCTGGTCA CTAAATATAC ACTTTAGATA GATGAAGAAG CCCAAAAACA GATAAATTCC 1200 

TGATTGCTAA TTTACATAGA AATGTATTCT CTTGGTTTTT TAAATAAAAG CAAAATTAAC 1260 

AATGATCTCT. GCTCTGCAAA GTTTTGAAAA TATATTTGAA CAATTTGAAT ATAAATTCAT 1320 

CATTTAGTCC TCAAAATATA TACAGCATTG CTAAGATTTT CAGATATCTA TTGTGGATCT 1360 

TTTAAAGGTT TTGACCATTT TGTTATGAGG AATTATACAT GTATCACATT CACTATATTA 1440 

AAATTGCACT TTTATTTTTT CCTGTGTGTC ATGTTGGTTT TTGGTACTTG TATTGTCATT IS 00 
TGGAGAAACA ATAAAAGATT TCTAAACCAA AAAAAAAAAA AAAAAAA 

Seq ID NO: 641 Protein sequence 
Protein Accession 8: NP_002934.1 

1 , 11 21 t 31 41 51 

MSLPSSRAAR VPGPSGSLCA LLALLLLLTP PGPLASAGPV SAVLTELRCT CLRVTLRVNP 60 
KTIGKLOVFP AGPQCSKVEV VASLKNGKQV CLDPEAPFLK KVIQKILDSG NKKN 

Seq ID NO: 642 DNA sequence 

Nucleic Acid Accession 8: NM_013271.1 

Coding sequencet 27.. 809 

1 XI . 21 31 41 51 

TCCGGAGCCA GGCTCGCTGG GGCAGCATGG CGGGGTCGCC GCTGCTCTGG GGGCCGCGGG' 60 

CCGGGGGCGT CGGCCTTTTG. GTGCTGCTGC TGCTCGGCCT GTTTCGGCCG CCCCCCGCGC 120 

TCTGCGCGCG GCCGGTAAAG GAACCCCGCG GCCTAAGCGC AGCGTCTCCG CCCTTGGCTG 180 

AGACTGGCGC TCCTCGCCGC TTCCGGCGGT. CAGTGCCCCG AGGTGAGGCG GCGGGGGCGG 240 

TGCAGGAGCT GGCGCGGGCG CTGGCGCATC TGCTGGAGGC CGAACGTCAG GAGCGGGCGC 300 

GGGCCGAGGC GCAGGAGGCT GAGGATCAGC PJ3GCGCGCGT CCTGGCGCAG CTGCTGCGCG 360 

TCTGGGGCGC CCCCCGCAAC TCTGATCCGG . CTCTGGGCCT GGACGACGAC CCCGACGCGC 420 

CTGCAGCGCA GCTCGCTCGC GCTCTGCTCC GCGCCCGCCT TGACCCTGCC GCCCTAGCAG 480 

CCCAGCTTGT; CCCCGCGCCC GTCCCCGCCG CGGCGCTCCG. ACCCCGGCCC CCGGTCTACG 540 

ACGACGGCCC: CGCGGGCCCG GATGCTGAGG AGGCAGGCGA- CGAGACACCC GACGTGGACC 600 

CCGAGCTGTT, GAGGTACTTG CTGGGACGGA TTCTTGCGGG- AAGCGCGGAC TCCGAGGGGG • 660 

TGGCAGCCCC GCGCCGCCTC CGCCGTGCCG CCGACCACG* TGTGGGCTCT- GAGCTGCCGC 720 

CTGAGGGCGT GCTGGGGGCG CTGCTGCGTG TGAAACGCCT, AGAGACCCCG GCGCCCCAGG 7 B0 

TGCCTGCACG CCGCCTCTTGv CCACCCTGAG CACTGCCCGG ATCCCGTGCA CCCTGGGACC 840 

CAGAAGTGCC CCCGCCATCC CGCCACCAGG ACTTCTCCCC GCCAGCACGT CCAGAGCAAC 900 

TTACCCCGGC CAGCCAGCCC TCTCACCCGA GGATCCCTAC CCCCTGGCCC ACAATAACAT 960 
GATCTGAGC 

Seq ID NO: 643 Protein sequence 
Protein Accession ft: NP__0 374 03.1 

1 11 . 1 21 31 41 51 

MAGSPLLWGP RAGGVGLLVL LLLGLFRPPP ALCARPVKEP RGLSAASPPL AETGAPRRFR 60 

RSVPRGEAAG AVQELARALA HLLEAERQER- ARAEAQEAED QQARVLAQLL RVWGAPRNSD 120 

PALGLDDDPD APAAQLARAL LRARLDPAAL AAQbVPAPVP AAALRPRPPV YDDGPAGPDA 160 

EEAGDETPDV DPELLRYLLG RILAGSADSE GVAAPRRLRR AADHDVGSEL PPEGVLGALL 240 
RVKRLETPAP QVPARRLLPP - . 

Seq ID NO: 644 DNA sequence 
Nucleic Acid Accession #: NM_002214 
Coding sequence: 681. .2990 

1 11 21 31 41 51 

CCCAGAGCCG CCTCCCCCTG TTGCTGGCAT CCCGAGCTTC CTCCCTTGCC AGCCAGGACG 60 

CTGCCGACTT GTCTTTGCCC GCTGCTCCGC AGACGGGGCT GCAAAGCTGC AACTAATGGT. 120 

GTTGGCCTCC CTGCCCACCT GTGGAAGCAA CTGCGCTGAT- TGATGCGCCA CAGACTTTTT 180 

TCCCCTCGAC CTCGCCGGCG TACCCTCCCA' CAGATCCAGC ATCACCCAGT GAATGTACAT- 240 

TAGGGTGGTT. TCCCCCCCAG CTTCGGGCTT TGTTTGGGTT TGATTGTGTT TGGCTCTTCG 300 

CTAAGCTGAT TTATGCAGCA GAAGCCCCAC CGGCTGGAGA' GAAACAAAAG CTCTTTTCTT 360 

TGTCCCGGAG CAGGCTGCGG AGCCCTTGCA GAGCCCTCTC TCCAGTCGCC GCCGGGCCCT 420 

TCGCCGTCGA AGGAGGTGCT TCTCGCGGAG ACCGCGGGAC CCGCCGTGCC GAGCCGGGAG 480 

GGCCGTAGGG GCCCTGAGAT GCCGAGCGGT GCCCGGGCCC GCTTACCTGC ACCGCTTGCT S40 

CCGAGCCGCG GGGTCCGCCT GCTAGGCCTG CGGAAAACGT CCTAGCGACA CTCGCCCGCG 600 

GGCCCCGAGG TCGCCCGGGA GGCCGAGCCC GCGTCCGGAA GGCAGCCAGG CGGCGGGCGC 660 

GGGGCGGGCT GTTTTGCATT .ATGTGCGGCT CGGCCCTGGC TTTTTTTACC GCTGCATTTG 720 

7CTGCCTGCA AAACGACCGG CGAGGTCCCG CCTCGTTCCT CTGGGCAGCC TGGGTGTTTT 780 

CACTTGTTCT TGGACTGGGC CAAGGTGAAG. ■ ACAATAGATG TGCATCTTCA AATGCAGCAT 840 

CCTGTGCCAG GTGCCTTGCG CTGGGTCCAG AATGTGGATG' GTGTGTTCAA GAGGATTTCA 900 

TTTCAGGTGG ATCAAGAAGT GAACGTTGTG ATATTGTTTC CAATTTAATA AGCAAAGGCT 960 

GCTCAGTTGA TTCAATAGAA TACCCATCTG TGCATGTTAT AATACCCACT GAAAATGAAA 1020 

TTAATACCCA GGTGACACCA GGAGAAGTGT CTATCCAGCT GCGTCCAGGA GCCGAAGCTA .1080 

ATTTTATGCT GAAAGTTCAT CCTCTGAAGA AATATCCTGT GGATCTTTAT TATCTTGTTG 1140 

ATGTCTCAGC ATCAATGCAC AATAATATAG AAAAATTAAA TTCCGTTGGA AACGATTTAT 1200 

CTAGAAAAAT GGCATTTTTC TCCCGTGACT TTCGTCTTGG ATTTGGCTCA TACGTTGATA 1260 

AAACAGTTTC ACCATACATT AGCATCCACC CCGAAAGGAT TCATAATCAA TGCAGTGACT 1320 

ACAATTTAGA CTGCATGCCT. CCCCATGGAT ACATCCATGT GCTGTCTTTG ACAGAGAACA 1380 

TCACTGAGTT TGAGAAAGCA GTTCATAGAC AGAAGATCTC TGGAAACATA GATACACCAG 1440 

A^GAGGTTT TGACGCCATG CTTCAGGCAG CTGTCTGTGA AAGTCATATC GGATGGCGAA 1500 

LtG^GCTAA AAGATTGCTG CTGGTGATGA CAGATCAGAC GTCTCATCTC GCTCTTGATA 1560 



432 
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WO 02/086443 

GCAAATTGGC AGGCATAGTG GTGCCCAATG ACGGAAACTG T CATCT GAAA 
ACGTCAAATC GACAACCATG GAACACCCCT CACTA6GCCA ACTTTCAGAG 
ACAACAACAT "TAATGTCATC TTTGCAGTTC AAGGAAAACA ATTTCATTGG 
TTCTACCCCT CTTGCCAGGC ACCATTGCTG GTGAAATAGA ATCAAAGGCT 
ATAATTTGGT AGTGGAAGCC TATCAGAAGC TCATTTCAGA AGTGAAAGTT 
ACCAGGTACA AGGCATCTAT- TTTAACATTA CCGCCATCTG TCCAGATGGG 

CAGGCATGGA" AGGATGCAGA aacgtgacga gcaatgatga agttcttttc 

TTACAATGAA AAAATGTGAT. GTCACAGGAG GAAAAAACTA TGCAATAATC 
GTTTTAATGA AACCGCTAAA" ATTCATATAC ACAGAAACTG CAGCTGTCAG 
ACAGAGGACC • TAAAGGAAAG TGTGTAGATG AAACTTTTCT AGATTCCAAG 
GTGATGAGAA" TAAATGTCAT TTTGATGAAG ATCAGTTTTC TTCTGAGAGT 
ACAAGGATCA GCCTGTTTGC AGTGGTOGAG GAGTTTGTGT TTGTGGGAAA 
ACAAAATTAA GCTTGGAAAA* GTGTATGGAA AATACTGTGA AAAGGATGAC 
CATATCACCA TGGAAATCTG TGTGCTGGGC ATGGAGAGTG TGAAGCAGGC 
GCTTCAGTGG CTGGGAAGGT. GATCGATGCC AGTGCCCTTC AGCAGCAGCC 
TCAATTCAAA GGGCCAAGTG TGCAGTGGAA GAGGCACGTG TGTGTGTGGA 
GCACCGATCC CAGGAGCATC GGCCGCTTCT GTGAACACTG CCCCACCTGT 
GCAAGGAAAA CTGGAATTGT ATGCAATGCC TTCACCCTCA CAATTTGTCT 
TTGATCAGTG CAAAACCTCA TGTGCTCTCA TGGAACAACA GCATTATOTC 
CAGAATGTTT. CTCCAGCCCA AGCTACTTGA GAATATTTTT CATCATTTTC 
TCTTGATTGG GTTGCTTAAA GTCCTGATCA * TTAGACAGGT GATACTACAA 
ATAAAATTAA GTCCTCATCA GATTACAGAG TGTCAGCCTC AAAAAAGGAT 
TGCAAAGTGT TTGCACAAGA GCAGTCACCT ACCGACGTGA GAAGCCTGAA 
TGGATATCAG CAAATTAAAT GCTCATGAAA CTTTCAGGTG CAACTTCTAA 
TTAAACACTT AATGGGAAAC TGGAATTGTT AATAATTGCT CCTAAAGATT 
AAGTCACAGG AGGAGACAAA TTGCTCACGG TCATGCCAGT • TGCTGGTTGT 
GAAGACTGAC AAGTATCCTC ATCATGATGT GACTCACATA GCTGCTGACT 
AAAATGTGTC TTACTACTGT TTGAGACTAG TGTCGTTGTA GCACTTTACT 
ACTTATTTAG ATCAGCATAG ; AATGTAGATC CTCTGAAGAG CACTGATTAC 
TACCTGTTAT CCCTACGCTT- CCCAGAGAGA* ACAATGCTGT GAGAGAGTTT 
CACTACAAGG GTACAGTAAT CCCTGCACTG GACATGTGAG GAAAAAAATA 
TATATTCTAA GGTTGCCAAA CACTTCAACA GTTGGTGGTT GAATAGACAA 
ATGAATAAAT GATTCGTGTT TCACTCTTTC AAGAGGTGAA CAGATACAAC: 
AAAGATTATT. GCTTTTTAAA GTGTGTAGTT. TTATGCATGT GTGTTTATGG 
TTTGCAAGAT GGATACTAAT. TCCAGCATTC TCTCCTCTTT GCCTTTATGT 
TTTTTACAGG ATAAGTTTAT GTATGTCACA GATGACTGGA . TTAATTAAGT 
TACTGCCATA AAAAACTAAT AATACAATGT CACTTTATCA GAATACTAGT 
GAATGTTAA ... 

Seq ID KO: 645 Protein sequence 
Protein AcceaBion ft: NP_002205 



PCTAJS02/12476, 



i 
I 

HCGSAIAFFT 
LGPECGWCVQ 
GEVSIQliRPG 
SRDFRLGFGS 
VHRQKISGNI 
VPNDGNCHLK 
TIAGEIESKA 
NVTSNDEVLF 
CVDETFLDSK 
VYGKYCEKDD 
CSGRGTCVCG 
CALMEQQHYV 
DYRVSASKKD 



11 
I 

AAFVCLiQNDR 
EDFISGGSRS 
AEANFMLKVH 
YVDKTVSPYI 
DTPEGGFDAM 
NNVYVKSTTM 
ANLNNLWEA 
NVTVTMXKCD 
CFQCDENKCH 
FSCPYHHGNL 
RCECTDPRSI 
DQTSECFSSP 
KLILQSVCTR 



21 
I 

RGPASFLWAA 
ERCDIVSNLI 
PLKKYPVDLY 
SIHPERIHNQ 
LQAAVCESHI 
EHPSLGQLSE 
YQKLISEVKV 
VTGGKNYAII 
FDEDQFSSES 
CAGHGECEAG 
GRFCEHCPTC 
SYLRIFFIIF 
AVTYRREKPE 



31 
I 

WVFSLVLGLG 
SKGCSVDSIE 
YLVDVSASMH 
CSDYNLDCMP 
GWRKEAKRLL 
KLIDNNINVI 
QVENQVQGIY 
KPIGFKETAK 
CKSHKDQPVC 
RCQCFSGWEG 
YTACKENWNC 
IVTFLIGLLK 
EIKMDISKIiN 



41 

I 

QGEDNRCASS 
YPSVHVIIPT 
NNIEKLNSVG 
PHGYIHVLSL 
LVMTDQTSHL 
FAVQGKQFHW 
FNITAICPDG 
IHIHRNCSCQ 
SGRGVCVCGK 
DRCQCPSAAA 
MQCLHPHNLS 
VLIIRQVILQ 
AHETFRCNF 



Seq ID KO: 646 DNA sequence 

Nucleic Acid Accession 8: NM_003318.1 

Coding sequence: 1..2574 



1 
I 

atggaatccg 
agagacatta 
atttctgctg 
aacccagagg: 

GATGCTCTTT 
GATAAATATG 
GCTATTCAAG 
AAATTTGCTT 
AAAAGTAAAC 
GAAATTGCCC 
AAGAATTTAT 
CATTTACAGA 
TTATATGGAG 
CAAACTAACA 
AGCCCAGATT 
ACCTCTAGAT 
TCCTGTGAAT 
TCAGATGAAA 
GAATCAAGTC 
GAGAGTAACC 
GCTGCATCTT 
AAACATACCA 
ACATCTAAAT 



11 
I 

AGGATTTAAG 
AAAATAAGTT 
ATACTACAGA 
ACTGGTTGAG 
TAAATAAATT 
GCCAAAATGA 
AGCCAGATGA 
TTGTTCATAT 
AACTTCTTCA 
TGCGGAATTT 
CAGCATCTAC 
ATAGGAACAA 
AGAACATGCC 
AAACTAAACA 
GTGATGTGAA 
CAGAATGCCG 
TAAGAAATTT 
AGAGTTCTGA 
TTCTAGCTAA 
AGAAACAGTG 
CAAATCACTG 
CTTTTGAGCA 
GGTTTGACCC 



21 
I 

TGGCAGAGAA 
TAAAAATGAA 
TAACTCGGGA 
TTTGTTGCTC 
GATTGGTCGT 
GAGTTTTGCT 
TGCACGTGAC 
ATCTTTTGCA 
AAAAGCTGTA 
AAACCTCCAA 
GGTATTAACT 
CAGTTGTGAT 
ACCACAAGAT 
GTCATGCCGA 
GACAGATGAT- 
AGATTTGGTT 
AAAGTCTGTT 
ACTTATTATT ■ 
ATTAGAAGAA- 
GCAATCTAAG 
GCAGATTCCG 
ACCTGTCTTT 
AAAATCTATT 



31 
I 

TTGACAATTG 
GACCTTACTG 
ACTGTTAACC 
AAACTAGAGA 
TACAGTCAAG 
AGAATTCAAG 
TACTTTCAAA 
CAATTTGAAC 
GAACGTGGAG 
AAAAAGCAGC 
GCCCAAGAAT 
TCCAGAGGAC 
GCAGAAATAG 
TTTGGAAGAG 
TCAGTTGTAC 
GTGCCTGGAT 
CAAAATAGTC 
ACTGATTCAA 
ACTAAAGAGT 
AGAAAGTCAG 
GAGTTAGCCC 
TCAGTTTCAA 
TGTAAGACAC 



41 
I 

ATTCCATAAT 
ATGAACTAAG 
AAATTATGAT 
AAAACAGTGT 
CAATTGAAGC 
TGAGATTTGC 
TGGCCAGAGC 
TGTCACAAGG 
CAGTACCACT 
TGCTTTCAGA 
CATTTTCCGG 
AGACTACTAA 
GTTACCGGAA 
TCCCAGTTAA 
CTTGTTTTAT 
CTAAACCAAG 
ATTTCAAGGA 
TAACCCTGAA 
ATCAAGAACC 
AGTGTATTAA 
GAAAAGTTAA 
AACAGTCACC 
CAAGCAGCAA 



AACAACGTCT- 

AAATTAATAG. 

TATAAGGATC 

GCAAACCTCA • 

CAGGTGGAAA 

TCCAGAAAGC " 

AATGTAACAG. 

AAACCTATTG 

TGTGAGGACA 

TGTTTCCAGT 

TGCAAGTCAC 

TGTTCATGTC 

TTTTCTTGTC 

AGATGCCAAT 

CAGCACTGTG . 

AGGTGTGAGT 

TATACAGCCT 

CAGGCTATAC " 

GACCAAACTT 

ATAGTTACAT • 

TGGAATAGTA 

AAGTTGATTC 

GAAATAAAAA 

AAAAAGATTT 

ATAATTTTAA 

ACACTCGAAC. 

TTTTCAGAGA 

GTAATATATA 

ACTTTACAGG 

AGCATTGTGT 

ATCTGGCAAG 

GAACAGCTAG. 

CTTAATCTTA 

TTTGCTTATT 

TTTGTTTTCT 

GCTAAGTTAC 

TTTAAAAGCT 



51 
I 

NAASCARCLA 
ENEINTQVTP 
NDLSRKMAFF 
TENITEFEKA 
ALDSKLAGIV 
YKDLLPLLPG 
SRKPGMEGCR 
CEDNRGPKGK 
CSCHKIKLGK 
QHCVNSKGQV 
QAILDQCKTS 
WNSNKIKSSS 



51- 
I 

GAACAAAGTG 
CTTGAATAAA 
GATGGCAAAC 
TCCGCTAAGT 
GCTTCCCCCA 
TGAATTAAAA 
AAACTGCAAG 
TAATGTCAAA 
AGAAATGCTG 
GGAGGAAAAG 
TTCACTTGGG 
AGCCAGGTTT 
TTCATTGAGA 
CCTTCTAAAT 
GAAAAGACAA 
TGGAAATGAT 
ACCTCTGGTG . 
GAATAAAACG 
AGAGGTTCCA 
CCAGAATCCT 
TACAGAGCAG 
ACCAATATCA 
TACCTTGGAT 
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WO 02/086443 

GATTACATGA GCTGTTTTAG AACTCCAGTT GTAAAGAATG ACTTTCCACC 
TTGTCAACAC CTTATGGCCA ACCTGCCTGT TTCCAGCAGC AACAGCATCA 
ACTCCACTTC AAAATTTACA GGTTTTAGCA TCTTCTTCAG CAAATGAATG 
AAAGGAAGAA TTTATTCCAT TTTAAAGCAG ATAGGAAGTG GAGGTTCAAG 
CAGGTGTTAA • ATGAAAAG AA ACAGATATAT GCTATAAAAT ATGTGAACTT 
GATAACCAAA CTCTTGATAG TTACCGGAAC GAAATAGCTT ATTTGAATAA 
CACAGTGATA AGATCATCCG ACTTTATGAT TATGAAATCA CGGACCAGTA 
GTAATGGAGT GTGGAAATAT TGATCTTAAT AGTTGGCTTA AAAAGAAAAA 
CCATGGGAAC GCAAGAGTTA CTGGAAAAAT ATGTTAGAGG CAGTTCACAC 
CATGGCATTG • TTCACAGTGA TCTTAAACCA GCTAACTTTC TGATAGTTGA 
AAGCTAATTG ATTTTCGGAT TGCAAACCAA ATGCAACCAG ATACAACAAG 
GATTCTCAGG TTGGCACAGT- TAATTATATG CCACCAGAAG CAATCAAAGA 
TCCAGAGAGA* ATGGGAAATC TAAGTCAAAG ATAAGCCCCA AAAGTGATGT 
GGATGTATTT TGTACTATAT GACTTACGGG AAAACACCAT TTCAGCAGAT 
A7TTCTAAAT TACATGCCAT AATTGATCCT AATCATGAAA TTGAATTTCC 
GAGAAAGATC TTCAAGATGT GTTAAAGTGT TGTTTAAAAA GGGACCCAAA 
TCCATTCCTG AGCTCCTGGC TCATCCCTAT- GTTCAAATTC AAACTCATCC 
ATGGCCAAGG GAACCACTGA AGAAATGAAA- TATGTTCTGG GCCAACTTGT 
TCTCCTAACT CCATTTTGAA: AGCTGCTAAA ACTTTATATG AACACTATAG 
AGTCATAATT CTTCATCCTC CAAGACTTTT GAAAAAAAAA GGGGAAAAAA 

•Seq lb NO: 647 Protein sequence 
Protein Accession ft: NP_003309.1 



PCT/US02/12476 



TGCTTGTCAG 

AATACTTGCC 

CATTTOGGTT ' 

CAAGGTATTT. 

AGAAGAAGCA 

ACTACAACAA ■ 

CATCTACATG 

ATCCATTGAT 

AATCCATCAA 

TGGAATGCTA 

TGTTGTTAAA 

TATGTCTTCC 

TTGGTCCTTA 

AATTAATCAG.. 

CGATATTCCA 

ACAGAGGATA 

AGTTAACCAA . 

TGGTCTGAAT 

TGGTGGTGAA 

ATGA 



MESEDLSGRE 
NPEDWLSLLL 
AIQEPDDARD 
EIALRNLNLQ. 
LYGENMPPQD 
TSRSECRDLV. 
ESSLLAKLEE. 
KHTTFEQPVF- 
LSTPYGQPAC 
QVLNEKKQIY 
VMECGNIDLN 
KLIDFGIANQ. 
GCILYYMTYG 
SIPEUiAHPY 
SHNSSSSKTF 



11 
I 

LTIDSIMNKV 
KLEKNSVPLS 
YFQMARANCK 
KKQbhSEEEKr 
AEIGYRNSLR 
VPGSKPSGND" 
*TKEYQEPEVP 
SVSKQSPPIS' 
FQQQQHQILA 
AI K YVNLEEA V 
SWLKKKKSID 
MQPDTTSWK 
KTPFQQIINQ 
VQIQTHPVNQ 
EKKRGKK 



21 
I 

RDIKNKFKNE 
DALLNKLIGR 
KFAFVHISFA 
KNLSASTVLT 
QTNKTKQSCP 
SCELRNLKSV 
ESNQKQWQSK 
TSKWFDPKSI- 
TPLQNLQVIA- 
DNOTLDSYRN 
PWERKSYWKN 
DSQVGTVNYM 
ISKLHAIIDP 
MAKGTTEEMK 



31 



41 



I 

DLTDEL5LKK 
YSQAIEAXiPP 
QFELSQGNVK 
AQBSFSGSLG 
FGRVPVNLLN 
QNSHFKEPLV. 
RKSECINQNP. 
CKTPSSNTLD 
SSSANECISV' 
EIAYLNKLQQ 
MLEAVHTIHQ. 
PPEAIKDMSS 
NHEIEFPDIP 
YVLGQLVGLN 



I 

ISADTTDNSG 
DKYGQHESFA 
KSKQLLQKAV 
HLONRKNSCD 
SPDCDVKTDD 
SDEKSSELII 
AASSNHWQIP 
DYMSCFRTPV 
KGRIYSILKQ 
HSDKIIRLYD 
HGIVHSDLKP. 
SRENGKSKSK 
EKDLQDVLKC 
SPNSILKAAK 



51 
1 

TVNQIMMMAN 

RIQVRFAELK 

ERGAVPLEML 

SRGQTTKARF 

SWPCFMKRQ 

TDSITIiKNKT 

ELARKVNTEQ .-. 

VKNDFPPACQ' 

IGSGGSSKVF 

YEITDQYIYM 

ANFLIVDGML 

ISPKSDVWSL 

CLKRDPKQRI 

TLYEHYSGGE 



Seq ID NO: 648 DNA sequence 
Nucleic Acid Accession ft: NM_015507 
Coding sequence: 241.. 1902 



CCGCAGAGGA 
CGCCCCTGCC 
CGAGTGGAGC 
GGGTCCGGCC 
ATGCCTCTGC 
GGGAACGCGG 
TGTCACTATG 
TGTGAAGCTA 
AGATGCTTTC 
AAACCCCGGC 
CTCAGTGGCC 
ATAAACTGTC 
TCAGGACTCC 
GGTAAAGTCA 
AAATGTCACA 
AATGAATGTA 
GGGTCCTTCA 
ATCCCTGAAA ; 
AAGAAGTTGC 
CCAGAACCCA 
ATAGTTTCCA 
GAGGGGCTTG 
AGCCTGCGAG 
CTGGTCCAAA 
GACTGCAGCT 
TGGAATCCTG. 
GGTCACAAGA- 
AACTTCTGTT 
TTTGTGAAAA- 
TGGAAGACAG 
GAAGCAGAAC 
TCAGGCTTAT 
TTGACTTTGT 
TTAGAATTAC 
TCTTGTATAA ■ 
TTTCTGAATC 
CAGTATATCT 
TAGAAAAAAA 
TATGACATCA 



11 

! 

GCCTCGGCCA 
GCGGTGCCTG 
GGAGGACCCG 
GGCGCCCTCC 
CCTGGAGCCT 
CCAGTGCAAG 
GAACTAAACT 
CATGCGAACC 
CAGGATACAC 
CATGCCAACA 
ACATGCTCAT 
AGTACAGCTG 
GCCTGGCCCC 
TCTGTCCCTA 
TTGGTTTCGA 

ctatggatag: 
agtgtaaatg 

ATTCTGTGAA 
TTGCTCACAA 
CCAGGACTCC 
GAGGCGGGAA 
AGGATGAGAA 
GAGATGTGTT 
GGAAAGCGCT 
TCAATCATGG 
CTGATCGAGA 
AAGACATTGG 
TGCTCTTTGA 
ACAGTAACAA 
GGAAAATTCA 
GTGGCAAGGG 
GTCCAGATAG 
ATGTCAGTTC 
TAGCTGAAAA 
GATATGCCAA 
TTTCCACATT 
GATTTGTATA 
AGCACAGAGA 
AAGATAGACT 



21 
I 

GGCTAGCCAG 
GCCTCCCCTC 
AGCGGCTGAG 
CGAGGGGGGC 
TGCGCTCCCG 
GCATCACGGG 
GGCCTGCTGC 
TGGATGTAAG 
CGGGAAAACC 
CAGATGTGTG 
GCCAGATGCT 
TGAAGACACA 
AAATGGAAGA 
CAATCGAAGA 
ACTGCAATAT 
CCATACGTGC 
CAAGCAGGGA 
GGAAGTCCTC 
AAACAGCATG 
TACCCCTAAG 
CTCTCATGGA 
AAGAGAAGAG 
TTTCCCTAAG 
AACTTCCAAA 
GATCTGTGAC 
TAATGCTATT 
CCGATTGAAA 
TTACCGGCTG 
TGCCCTGGCA 
GTTGTATCAA 
CAAAACCGGC 
CCTTTTATCT 
CCTGGTTTTT 
ATTGTAATGT 
TATTTGCTTT 
ATATTATAAA 
AGTAAGTTGA 
AATGTTTAAC 
TTTGCCTAAG 



31 

i 

GGCGCCCCCA 
CCAGACTGCA 
GAGAGAGGAG 
TCAGGAGGAG 
CTGCTGCTCT 
TTGTTAGCAT: 
TACGGCTGGA 
TTTGGTGAGT 
TGCAGTCAAG 
AATACACACG 
ACGTGTGTGA 
GAAGAAGGGC 
GACTGTCTAG 
TGTGTGAACA 
ATCAGTGGAC 
AGCCACCATG 
TATAAAGGCA 
AGAGCACCTG 
AAAAAGAAGG 
GTGAACTTGC 
GGTAAAAAAG 
AAAGCCCTGA 
GTGAATGAAG 
CTGGAACATA 
TGGAAACAGG 
GGCTTCTATA 
CTTCTCCTAC 
GCCGGAGACA 
TGGGAGAAGA 
GGAACTGATG 
GAAATCGCAG 
GTGGATGACT 
TTGATATTGC 
ACCAACAGAA 
AAATATCATA 
ATATGGAAAT 
TGAGCTTCTC 
TGTTTGACTC 
TGGCTTAGCT 



41 
I 

GCCCCTCCCC 
GGGACAGCAC 
GCGGCGGCTT 
GAAGGAGGAC 
CCTGGGTGGC 
CGGCACGTCA 
GAAGAAACAG 
GCGTGGGACC 
ATGTGAArGA 
GAAGCTACAA 
ACTCTAGGAC 
CACAGTGCCT 
ATATTGATGA 
CATTTGGAAG. 
GATATGACTG 
CCAATTGC7T 
ATGGACTTCG 
GTACCATCAA 
CAAAAATTAA 
AGCCCTTCAA 
GGAATGAAGA 
AGAATGACAT 
CAGGTGAATT 
AAGATTTAAA 
ATAGAGAAGA 
TGGCAGTTCC 
CTGACCTGCA 
AAGTCGGGAA 
CCACGAGTGA 
CTACCAAAAG 
TGGATGGCGT 
GAATGTTACT 
ATCATAGGAC 
ATATTATTGT 
TCACTGTATC 
GTCAGTTTAT 
TCTACAACAT 
TTATGATACT 
GGGTCTTTCA 



51 
I 

AGGCCGCGAG 
CCGGTAACTG 
AGCTGCTACG 
CCGTGCGAGA 
AGGTGGTTTC 
GCCTGGGGTC 
CAAGGGAGTC 
AAACAAATGC 
GTGTGGAATG 
GTGCTTTTGC 
ATGTGCCATG 
GTGTCCATCC 
ATGTGCCTCT 
CTACrACTGC 
TATAGATATA ■ 
CAATACCCAA 
GTGTTCTGCT' 
AGACAGAATC 
AAATGTTACC 
CTATGAAGAG 
GAAAATGAAA 
AGAGGAGCGA 
CGGCCTGATT 
TATCTCGGTT 
TGATTTTGAC 
GGCCTTGGCA 
ACCCCAAAGC 
ACTTCGAGTG 
GGATGAAAAG 
CATCATTTTT 
CTTGCTTGTT 
ATCTTTATAT 
CTCTGGCATT 
AAGATGCCTT 
TTCTCAGTCA 
CTCCCCTCCT 
TTCTAGAAAA 
TCTTGGAAAC 
TAGCCAAACT 
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TGTATATTTA AATTCTTTGT AATAATAATA TCCAAATCAT CAAAAAAAAA AAAAAAAA 



PCT/US02/12476 



Seq ID NO: 649 Protein sequence 
Protein Accession tf: NP_056322 
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1 
I 

MPLPWSLALP 
CEATCEPGCK 
LSGHMLMPDA 
GKVICPYNRR 
GSFKCKCKQG 
PEPTRTPTPK 
SLRGDVFFPK 
WNPADRDNAI 
PVKNSNNALA" 
SGLCPDSLLS 



11 

I 

LLLSMVAGGF 
FGECVGPKKC 
TCVNSRTCAM 
CVNTFGSYYC 
YKGNGLRCSA' 
VNLQPFNYEE 
VNEAGEFGLI 
GFYMAVPALA 
WEKTTSEDEK 
VDD 



21 
I 

GNAASARHHG 
RCFPGYTGKT 
INCQYSCEDT 
KCHIGFELCY 
IPENSVKEVL 
IVSRGGNSHG 
LVQRKALTSK 
GHKKDIGRLK 
WKTGKIQLYQ 



31 
I 

LLASARQPGV 
CSQDVNECGM 
EEGPQCLCPS 
ISGRYDCIDI- 
RAPGTIKDRIv 
CKKGNEEKMK 
LKHKDLNISV 
LLXiPDLQPQS 
GTDATKSIIF 



41 
I 

CHYGTKLACC 
KPRPCQHRCV 
SGLR1APNGR 
NECTMDSHTC 
KKLLAHKNSM 
EGLEDEKREE 
DCSFNHGICD 
NFCLLFDYRL 
EAERGKGKTG 



51 
I 

YGWRRNSKGV 
NTHGSYXCFC 
DCLDIDECAS 
SHHANCFNTQ 
KKKAKIKNVT 
KALKNDIEER 
WKQDREDDFD' 
AGDKVGKLRV 
EIAVDGVLLV 



Seq ID NO: 650 DNA sequence 

Nucleic Acid Accession #: NM_003506.1 

Coding sequence: 259.-2379 



GCAGCTCCAG 
TTAGACGGGG 
CTCATTTTCA 1 
ATCTTTGGAT 
ATCAGGAATT 
CTCCTAAGAG. 
ATGGCCTACA- 
GCGGTGGAAA 
ACTTTCCTCT 
TGTCGTAAAC 
ATCCGATGGC 
GTAACTTTTG 
AGAGACATTG 
TTTCTGGGAA" 
CTAGAGTTTG" 
TTCACATTCC 
ATATATTACT 
GGCGATAGCA 
CTAGGCTCTC 
GCTGGCACTG 
TGGAGTTGTG 
CCAGGTTTCC 
GGAGTTTGCT 
CTGTGCCTTT 
CATGTTCGAC 
ATTCGAATTG 
TACGTCTATG 
CGTCAGTACC 
TTATTTATGA 
GGAAGCAAAA 
CCAATCAGTG 
TCTAAAGTTA 
TCCAAATCCA 
ATTACTAGCC 
ACATCAATGA 
TGTGGTGAAC 
GGGAAGGGCC 
AAGAGTGATA 
TCAGAACCAA 
AGAAAAGAGC 
CAGAAGCAAA 
TACGTTCTTC 
TCAAGAATAA 
AAATGTGCAG 
CCTTTTCTAT 
TTTACCTTTT 
GTATCTTTTT 
ATTCAAGTAT 
ATTTCTAAGA 
GTCTTATAAT- 
TGTGATTTTT 
GGTGCTTACT 
ATATTTAAAA 
GGCCAAGTGC 
AACCACTTAC 
TACATTTTGT 



11 
I 

TCCCGGACGC 
ACGGGAAGGG. 
GGAAAGCCTG' 
GGGGATCTTC 
TGAAGAAAAT- 
GGCACAGTCT 
ACATGACGTT 
TGGAGCATTT 
GCAAAGGATT 
TTTGTGAGAA' 
CTGAGGAGCT 
ATCCACACAC 
GATTTTGGTG- 
TTGACCAGTG 
CAAAAAGTTT 
TTACTTTTTT 
CTGTCTGTTA 
CAGCCTGCAA 
AAAATAAGGC 
TGTGGTGGGT 
AAGCCATCGA 
TGACTGTTAT 
TTGTTGGCCT 
GTGTGTTTGT 
AAGTCATACA 
GAGTCTTCAG 
AGCAAGTGAA 
ATATCCCATG 
TAAAATACCT- 
AGACATGCAC 
AAAGTCGAAG 
AACACAAAAA- 
TGGGAACCAG 
ATGATTACCT 
GAGAGGTGAA 
CTGCCTCGCC 
AGGCAGGCAG 
TTACTGACAC 
GCAGCCTCAA 
AGGGAGGTGG 
TTTGTGTTAC 
TTTTGCACTT 
TATGACTCAT 
GTTAATAATA 
TTATGAAGAT 
TGATATAAAA 
ATACATATTT 
TTTTATCATG 
AAATTGTAAA 
AGGAATTTAA 
ATAGTCTCGT ' 
CAAAGAGTGT 
TAAAATGTCC 
AATTGACTTC 
AGTTGCTTAT 
ATTATACAGT 



21 
1 

AACCCCGGAG 
ACAGCGGCCT 
AAAATGAGTA 
TGAGGATGCA' 
GGAGATGTTT 
CTTCACCTGT 
TTTCCCTAAT- 
TCTTCCTCTC' 
TGTACCAACC 
AGTATATTCT 
TGAATGTGAC 
AGAATTTCTT 
TCCAAGGCAT 
TGCGCCTCCA 
TATTGGAACA* 
AATTGATGTT 
CAGCATTGTA 
TAAGGCAGAT 
TTGCACCGTT 
GATTCTTACC 
GCAAAAAGCA 
GCTTCTTGCT 
TTATGACCTG 
TGGGCTCTCT 
ACATGATGGC 
CGGCTTGTAT 
CAGGATTACC 
TCCTTATCAG 
GATGACATTA 
AGAATGGGCT 
AGTACTACAG 
GAAGCACTAT 
CACAGGAGCT 
AGGACAAGAA 
AGCGGACGGA 
AGCAGCATCC 
TGTATCTGAA 
TGGCCTGGCA- 
AGGTTCCACA 
TTGTCATTCA 
ACTGGAAGTG 
AAAGTTGCAT 
TTCACACAAA 
TTTTTTTAAT 
TCTACTCTTG 
TCAAGATATT 
GAAAATAAGC 
CTATTGTGAT 
ATAGTCTTCT 
CTTTAAAAAC 
TTTAGGAATT 
CCACTATTGA 
TAAAGGGTTA 
CCTTTTTTAA 
ATTTTTTGTT 
ACCTTTCTCA 



31 

I 

CCGTCTCAGG 
TOGACCGCCC 
-AAATAGTGAA 
AAGAGTGATT 
ACATTTTTGT 
GAACCAATT A- 
CTGATGGGTC 
GCAAATCTGG 
TGCATAGAAC 
GATTGCAAAA 
AGATTACAAT 
GGTCCTCAGA 
CTTAAGACTT 
TGCCCCAACA 
GTTTCAATAT. 
AGAAGATTCA 
TCTCTTATGT 
GAGAAGCTAG 
TTGTTCATGC 
ATTACTTGGT 
GTGTGGTTTC 
CTGAACAAAG 
GATGCTTCTC 
CTTCTTTTAG 
CGGAACCAAG 
CTTGTGCCAT 
TGGGAGATAA" 
GCAAAAGCAA 
ATTGTTGGCA 
GGGTTTTTTA 
GAATCATGTG 
AAACCAAGTT 
ACAGCAAATC 
ACTTTGACAG 
GCTAGCACCC 
ATCTCCAGAC 
AGTGCGCGGA. 
CAGAGCAACA 
TCTCTGCTTG 
GATACTTGAA 
ACCTATGCAC 
TGCCTACTGT 
GGTTAATGAC 
AGTGTGGGAG 
GTAAGAGTAT 
TCTTTGCTGA 
TTATATGTAT 
ATTTTAGCAC 
TTTAT ACTGT 
CCACTTATTG 
TCACAGATCT 
TTGTATTATG 
GTAGACAAAA 
TGTTTCATGA 
TTAACTTTTG. 
GACATTTTGT 



41 
I 

TCCCTGGGGG 
CCCGAGTAAT 
ATGAGGAATT 
CATCCAAGCC 
TGACGTGTAT 
CTGTTCCCAG 
ATTATGACCA 
AATGTTCACC 
AAATTCATGT 
AATTAATTGA 
ACTGTGATGA 
AGAAAACAGA 
CTGGGGGACA 
TGTATTTTAA 
TTTGTCTTTG 
GATACCCAGA- 
ACTTCATTGG: 
AACTTGGTGA 
TTTTGTATTT 
TCTTAGCTGC 
ATGCTGTTGC 
TTGAAGGAGA 
GCTACTTTGT. 
CTGGCATTAT. 
AAAAACTAAA- 
TAGTGACACT 
CTTGGGTCTC 
AAGCTCGACC 
TCTCTGCTGT 
AACGAAATCG 
AGTTTTTCTT 
CACACAAGCT: 
ATGGCACTTC 
AAATCCAAAC 
CCAGGTTAAG 
TCTCTGGGGA 
GTGAAGGAAG 
ATTTGCAGGT 
TTCACCCAGT 
GAACATTTTC 
TGTTTTGTAA 
TATACTGGAA 
AACAATATAC 
GACAGAGTTA 
TTTAAGATGT. 
AGTATTTAAA 
TTGAACTTTT. 
TTTGGTAGCT 
AAAAAAAGAT 
ATACCTTACC 
AAATTATGTA 
CTGCTCACTG 
TGTTAGTCTT 
CCACXCATTG 
TTTCTTAACA 
AG 



51 
I 

GAACGGTGGG 

TGACCCAGGA 

TGAACATTTT 

ATGTGGTAAA 

TTTTCTACCC 

ATGTATGAAA 

GAGTATTGCC: 

AAACATTGAA'. 

GGTTCCACCT- 

CACTTTTGGG" 

GACTGTTCCT ■ 

ACAAGTCCAA. 

AGGATATAAG-- 

AAGTGATGAG . 

TGCAACTCTG .- 

GAGACCAATT. 

ATTTTTGCTG ; 

CACTGTTGTC 

TTTCACAATG 

AGGAAGAAAA 

ATGGGGAACA 

CAACATTAGT 

ACTCTTGCCA 

TTCCTTAAAT 

GAAATTTATG. 

TCTCGGATGT 

TGATCATTGT 

AGAATTGGCT 

CTTCTGGGTT 

CAAGAGAGAT- 

AAAGCACAAT •" 

GAAGGTCATT: 

TGCAGTAGCA 

CTCACCAGAA 

AGAACAGGAC 

ACAGGTCGAC 

GATTAGTCCA. 

CCCCAGTTCT- 

TTCAGGAGTG 

TCTCGTTACT 

GAATCACTGT 

AAAATAGAGT 

CTGAAAACAG 

GAGGAATCTT 

ACTATGCTAT 

TCTTATCCTT- 

TTGAAATCCT- 

TTTACACTGA 

ATACCAAAAA ' 

ATCTAAAATG 

ACTGAAATAA 

ATCCTTCTGC 

TTGTATATTA • 

ATTGTATTAT " 

TTTAGAATAT 
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2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940, 
3000 
3060 
3120 
3180 
3240 
3300 



Seq ID NO: 651 Protein sequence 
Protein Accession #: NP_003497.l 
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MEKFTFLLTC IFLPLLRGHS LFTCSPITVP RCMKMAYNMT FFPNLKGHYD QSIAAVEMSH 60 

FLPLANLECS PNISTFLCKA FVPTCIEQIH WPPCRXLCE KVYSDCKKLI DTFGIRWPEE 120 

LECDRLQYCD ETVPVTFDPH TEFLGPQKXT EQVQRDIGPW CPRHUCTSGG QGYKFLGIDQ- 180. 

CAPPCPNMYF KSDELEFAKS FIGTVSIFCL CATLFTFLTF LIDVRRFRYP ERPIIYYSVC 240 

YSIVSLMYFI GFLLGDSTAC NKADEKLELG DTWLGSQNK ACTVI*FMLLY FFTMAGTVWW 300 

VILTITWFLiA AGRKWSCEAI EQKAVWFHAV AKGTPGFLTV. MLLALNKVEG DNISGVCFVG 360 

LYDLDASRYP VLLPLCLCVF VGLSLLLAGI ISLNHVRQVI QHDGRNQEKL KKFMIRIGVF 420 

SGLYLVPLVT LLGCYVYEQV. NRITWEITWV SDHCRQYHIP CPYQAKAKAR PEIALFMIKY, 480 

LMTLIVGISA VFWVGSKKTC TEWAGFFKRN RKRDPISESR RVLQESCEFF LKHNSKVKHK- . 540 

KXKYKPSSHK LKVISKSMGT STGATANHGT SAVAITSHDY LGQETLTEIQ TSPBTSMREV 600 

KADGASTPRL REQDCGEPAS PAASISRLSG EQVDGKGQAG SVSESARSEG RISPKSDITD - 660 
TGLAQSNNLQ VPSSSEPSSL KGSTSLLVHP VSGVRKEQGG GCHSDT 

Seq ID NO: 652 .DNA sequence 

Nucleic Acid Accession g: NK_014791.1 

Coding sequence: 171.. 2126 

1 11 21 31 |1 51 

TTGGCGGGCG GAAGCGGCCA CAACCCGGCG ATCGAAAAGA TTGTTAGGAA CGCCGTACCA 60 

GCCGCGTCTC.. TCAGGACAGC AGGCCCCTGT CCTTCTGTCG GGCGCCGCTC AGCCGTGCCC 120 

TCCGCCCCTC AGGTTCTTTT. TCTAATTCCA AATAAACTTG CAAGAGGACT ATGAAAGATT 180 

ATGATGAACT TCTCAAATAT TATGAATTAC ATGAAACTAT TGGGACAGGT GGCTTTGCAA 240 

AGGTCAAACT. TGCCTGCCAT- ATCCTTACTG GAGAGATGGT AGCTATAAAA ATCATGGATA 300 

AAAACACACT AGGGAGTGAT TTGCCCCGGA TCAAAACGGA GATTGAGGCC TTGAAGAACC 360 

TGAGACATCA GCATATATGT- CAACTCTACC ATGTGCTAGA GACAGCCAAC AAAATATTCA 420 

TGGTTCTTGA GTACTGCCCT GGAGGAGAGC TGTTTGACTA TATAATTTCC CAGGATCGCC 480 

TGTCAGAAGA GGAGACCCGG GTTGTCTTCC GTCAGATAGT ATCTGCTGTT GCTTATGTGC 540 

ACAGCCAGGG CTATGCTCAC AGGGACCTCA AGCCAGAAAA TTTGCTGTTT- GATGAATATC 600 

ATAAATTAAA GCTGATTGAC TTTGGTCTCT GTGCAAAACC CAAGGGTAAC AAGGATTACC 660 

ATCTACAGAC ATGCTGTGGG AGTCTGGCTT ATGCAGCACC TGAGTTAATA CAAGGCAAAT 720 

CATATCTTGG ATCAGAGGCA-- GATGTTTGGA GCATGGGCAT ACTGTTATAT GTTCTTATGT 780 

GTGGATTTCT ACCATTTGAT GA TGATAA TG TAATGGCTTT ATACAAGAAG ATTATGAGAG: 840 

GAAAATATGA TGTTCCCAAG TGGCTCTCTC CCAGTAGCAT TCTGCTTCTT CAACAAATGC 900 

TGCAGGTGGA CCCAAAGAAA- CGGATTTCTA TGAAAAATCT ATTGAACCAT. CCCTGGATCA 960 

TGCAAGATTA CAACTATCCT. GTTGAGTGGC AAAGCAAGAA TCCTTTTATT CACCTCGATG 1020 

ATGATTGCGT AACAGAACTT TCTGTACATC ACAGAAACAA CAGGCAAACA : . ATGGAGGATT 1080 

TAATTTCACT.'gTGGCAGTAT. GATCACCTCA CGGCTACCTA TCTTCTGCTT CTAGCCAAGA 1140 

AGGCTCGGGG AAAACCAGTT CGTTTAAGGC TTTCTTCTTT - CTCCTGTGGA CAAGCCAGTG 1200 

CTACCCCATT CACAGACATC AAGTCAAATA' ATTGGAGTCT GGAAGATGTG ACCGCAAGTG 1260 

ATAAAAATTA TGTGGCGGGA TTAATAGACT ATGATTGGTG TGAAGATGAT. TTATCAACAG. 1320 

GTGCTGCTAC TCCCCGAACA TCACAGTTTA CCAAGTACTG GACAGAATCA AATGGGGTGG 1380 

AATCTAAATC ATTAACTCCA GCCTTATGCA GAACACCTGC AAATAAATTA AAGAACAAAG 1440 

AAAATGTATA TACTCCTAAG TCTGCTGTAA AGAATGAAGA GTACTTTATG" TTTCCTGAGC 1500 

CAAAGACTCC AGTTAATAAG AACCAGCATA AGAGAGAAAT ACTCACTACG CCAAATCGTT 1560 

ACACTACACC CTCAAAAGCT AGAAACCAGT GCCTGAAAGA AACTCCAATT AAAATACCAG 1620 

TAAATTCAAC AGGAACAGAC AAGTTAATGA CAGGTGTCAT TAGCCCTGAG AGGCGGTGCC 1680 

GCTCAGTGGA ATTGGATCTC AACCAAGCAC ATATGGAGGA GACTCCAAAA AGAAAGGGAG 1740 

CCAAAGTGTT TGGGAGCCTT GAAAGGGGGT TGGATAAGGT TATCACTGTG CTCACCAGGA 1800 

GCAAAAGGAA GGGTTCTGCC AGAGACGGGC CCAGAAGACT AAAGCTTCAC TATAATGTGA 1860 

CTACAACTAG ATTAGTGAAT ' CCAGATCAAC. TGTTGAATGA AATAATGTCT ATTCTTCCAA 1920 

AGAAGCATGT TGACTTTGTA CAAAAGGGTT ATACACTGAA GTGTCAAACA CAGTCAGATT 1980 

TTGGGAAAGT GACAATGCAA TTTGAATTAG AAGTGTGCCA GCTTCAAAAA CCCGATGTGG 2040 

TGGGTATCAG GAGGCAGCGG CTTAAGGGCG ATGCCTGGGT TTACAAAAGA TTAGTGGAAG . 2100 

ACATCCTATC TAGCTGCAAG. GTATAATTGA TGGATTCTTC CATCCTGCCG GATGAGTGTG 2160 

GGTGTGATAC AGCCTACATA AAGACTGTTA TGATCGCTTT GATTTTAAAG TTCATTGGAA 2220 

CTACCAACTT GTTTCTAAAG - AGCTATCTTA AGACCAATAT« CTCTTTGTTT TTAAACAAAA 2280 

GATATTATTT TGTGTATGAA TCTAAATCAA GCCCATCTGT CATTATGTTA CTGTCTTTTT- 2340 
TAATCATGTG GTTTTGTATA TTAATAATTG TTGACTTTCT TAGATTCACT TCCATATGTG .2400 

AATGTAAGCT . CTTAACTATG TCTCTTTGTA ATGTGTAATT TCTTTCTGAA ATAAAACCAT 2460 
TTGTGAATAT 

Seq ID NO: 653 Protein sequence 
protein Accession tf: NP_055606.1 

1 11 . 21 31 41 51 

MKDYDELLKY YELHETIGTG GFAKVKLACH ILTGEMVAIK IMDKNTLGSD LPRIKTEIEA ■ 60 

LKNLRHQHIC QLYHVLETAN KIPMVLEYCP GGELPDYIIS QDRLSEEETR WFRQIVSAV 120 

AYVHSQGYAH RDLKPENLLF DEYHKLKLID FGLCAKPKGN KDYHLQTCCG SLAYAAPELI 180 

QGKSYLGSEA DVWSMGILLY VLMCGFLPFD DDNVMALYKK IMRGKYDVPK WLSPSSILLL 240 

QQMLQVDPKK RISMKNLLNH PWIMODYNYP VEWQSKNPFI HLDDDCVTEL SVHHRNNRQT . 300 

MEDLISLWQY DHLTATYLLL' LAKKARGKPV RLRLSSFSCG QASATPFTDI KSNNWSLEDV 360 

TASDKNYVAG LIDYDWCEDD LSTGAATPRT SQFTKYWTES NGVESKSLTP ALCRTPANKL 420 

KNKEHVYTPK SAVKNEEYFM FPEPKTPVNK NQHKREILTT PNRYTTPSKA RNQCLKETPI . 480 

KIPVNSTGTD KLMTGVISPE RRCRSVELDL NQAHMEETPK RKGAKVFGSL ERGLDKVITV 540 

LTRSKRXGSA RDGPRRLKLH YNVTTTRI/VN PDQLLNEIMS ILPKKEVDFV QKGYTLKCQT 600 
QSDFGKVTMQ PELEVCQLQK PDWGIRRQR LKGDAWVYKR LVEDILSSCK V 

Seq ID NO: 654 DNA sequence 
Nucleic Acid Accession ft: NM_000532 
Coding sequence: 88.. 990 

1 11 21 31 41 51 

GCAGAGCACA GCATCGTCGG GACCAGACTC GTCTCAGGCC AGTTGCAGCC TTCTCAGCCA 60 

AACGCCGACC AAGGAAAACT CACTACCATG AGAATTGCAG TGATTTGCTT TTGCCTCCTA 120 
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GGCATCACCT GTGCCATACC AGTTAAACAG GCTGATTCTG GAAGTTCTGA 
CTTTACAACA AATACCCAGA TGCTGTGGCC ACATGGCTAA ACCCTGACCC 
CAGAATCTCC TAGCCCCACA GACCCTTCCA AGTAAGTCCA ACGAAAGCCA 
GATGATATGG ATGATGAAGA TGATGATGAC CATGTGGACA GCCAGGACTC 
AACGACTCTG ATGATGTAGA TGACACTGAT GATTCTCACC AGTCTGATGA 
TCTGATGAAT CTGATGAACT GGTCACTGAT TTTCCCACGG ACCTGCCAGC 
TTCACTCCAG TTGTCCCCAC AGTAGACACA TATGATGGCC GAGGTGATAG 
GGACTGAGGT CAAAATCTAA GAAGTTTCGC' AGACCTGACA TCCAGTACCC 
GACGAGGACA • TCACCTCACA CATGGAAAGC GAGGAGTTGA ATGGTGCATA 
CCCGTTGCCC AGGACCTGAA CGCGCCTTCT GATTGGGACA GCCGTGGGAA 
GAAACGAGTC AGCTGGATGA CCAGAGTGCT GAAACCCACA GCCACAAGCA 
TATAAGCGGA-AAGCCAATGA TGAGAGCAAT GAGCATTCCG ATGTGATTGA 
CTTTCCAAAG : TCAGCCGTGA- ATTCCACAGC CATGAATTTC ACAGCCATGA 
GTTGTAGACC CCAAAAGTAA GGAAGAAGAT AAACACCTGA AATTTCGTAT. 
TTAGATAGTG CATCTTCTGA GGTCAATTAA- AAGGAGAAAA AATACAATTT 
ATTTAGTCAA AAGAAAAAAT GCTTTATAGC AAAATGAAAG AGAACATGAA 
CTCAGTTTAT. TGGTTGAATG- TGTATCTATT TGAGTCTGGA AATAACTAAT 
ATTAGTTTAG TTTGTGGCTT CATGGAAACT CCCTGTAAAC TAAAAGCTTC 
CTATGTTCAT TCTATAGAAG AAATGCAAAC: TATCACTGTA TTTTAATATT 
TCATGAATAG AAATTTATGT AGAAGCAAAC. AAAATACTTT TACCCACTTA 
ATAACATTTT ATGTCACTAT AATCTTTTGT TTTTTAAGTT AGTGTATATT- 
TATCTTTTTG TGGTGTGAAT AAATCTTTTA TCTTGAATGT. AATAAGAATT 
AATTGCTTAT TTGTTTTCCC ACGGTTGTCC AGCAATTAAT AAAACATAAC 
GCCTAAAAAA" AAAAAAAAAA" AAAA 

Seq ID NO: 6S5 Protein sequence ■ 
Protein Accession #: NP_000573 



PCTAJS02/12476 



GGAAAAGCAG 
ATCTCAGAAG 
TGACCACATG- 
CATTGACTCG 
GTCTCACCAT 
AACCGAAGTT 
TGTGGTTTAT 
TGATGCTACA 
CAAGGCCATC 
GGACAGTTAT 
GTCCAGATTA 
TAGTCAGGAA 
AGATATGCTG 
TTCTCATGAA 
CTCACTTTGC 
ATGCTTCTTT 
GTGTTTGATA 
AGGGTTATGT 
TGTTATTCTC 
AAAAGAGAAT 
TTGTTGTGAT 
TGGTGGTGTC 
CTTTTTTACT 
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11 21 31 41 51 

I I | l I 

LGITCAIPVK QADSGSSEEK QIiYNKYPDAV ATWLNPDPSQ KQNLLAPQTL 
MDDMDDEDDD DHVDSQDSID SNDSDDVDDT DDSHQSDESH HSDESDELVT 
VFTPWPTVD. TYDGRGDSW YGhRSKSKKF RRPDIQYPDA TDEDITSHME 
IPVAQDLNAP SDWDSRGKDS YETSQLDDQS AETHSHKQSR LYKRKANDES 
ELSKVSREFH SHSFHSHEDM LWDPKSKEE DKHLKFRISH ELDSASSEVN 



Seq ID NO: 656 DNA sequence 

Nucleic Acid Accession »: NM_003108.1 

Coding sequence: 76.. 1401 



1 
I 

GGGGTGGGAG 
GCCCTGCAAC 
CGGGAGGCGC 
GAGAGCGACC 
TTCATGGTATv 
AACGCCGAGA 
ATCCCGTTCA 
TACAAGTACC 
CAGAGCCCAG 
GGTGCCAAGA 
GCGGGCGCCA' 
GACGACTACG. 
ACGGTCAAGT. 
CAGCTGCAGA 
CTGCAGCCGC 
CCCGCCAGCC 
GAGGTGCGGG 
AAGAACATCA 
TCGCGCTCGG 
GAGGACGCCG 
GCCAGCGAGC 
GATAAGGATT 
TACTGCACGC 
GACCTGGTGT 
AGCTGGGTTC 
ATGATGGTGG 
ATATTGATAA- 
TTAAAGTGAA 
TCCTTTATCG 
AAAATGTGTT 
GAGGGGGCGG 
GTCGGTCTTT 
TCTAGGGAGT 
TTTTTAACAA 



11 

I 

GGGGAGGGGG 
GGATCATGGT- 
TGGACACGGA 
CAGACTGGTG" 
GGTCCAAGAT 
TCTCCAAGAG, 
TCCGGGAGGC 
GGCCCCGGAA 
AGAAGAGCGC 
CCTCCAAGGG 
AGGCGGGCGC 
TGCTGGGCAG 
GCGTGTTTCT 
TCAAACAGGA 
CGGGGCAGCA 
CTACGCTGAG 
CCGGCGCGAC 
CCAAGCAGCA 
TGTCCACCTC 
ACGACCTGAT 
AGCAGCTGGG 
TGGATTCGTT 
CGGAGCTGAG 
TCACATATTG 
CTTGGGAGGA 
TGTTGATGGT 
GATGTCGTGA 
ATGAGTAGTT 
TGTCTCAAGG 
TTTGTAATTA 
CGCGGCGGAG 
GAAGTCTGGA 
TGGTGGAGAT 
AAAAAGGG 



21 
I 

ACCTCCGCAC 
GCAGCAGGCG. 
GGAGGGCGAA 
CAAGACGGCG 
CGAACGCAGG 
GCTGGGCAAG 
GGAGCGGCTG 
AAAGCCCAAA 
GGCCGGCGGC 
CTCCAGCAAG 
GGGCAAGGCG 
CCTGCGCGTG 
GGATGAGGAC 
GGCGGACGAG 
GCCGTCGCAG- 
CAGCTCGGCG 
CTCGGGCGCC 
CCCGCCGCCG 
CTCGTCCAGC 
GTTCGACCTG 
GGGCGGCGCG 
CAGCGAGGGC 
CGAGATGATC 
AAAGGCGCCC 
AGTTGTAGTG 
GGCGGTGGTA 
CGCAAAGAAA 
TTTAAACATT 
TAGTTGCATA 
CTATTTCTTT 
GGGAGGTAGG 
AGACGTCTGC 
ATTTTTTTTT 



31 
I 

GAGACCCAGC 
GAGAGCTTGG 
TTCATGGCTT 
TCGGGCCACA 
AAGATCATGG 
CGCTGGAAAA 
CGGCTCAAGC 
ATGGACCCCT 
GGCGGCGGGA 
AAATGCGGCA 
GCCCAGTCCG 
AGCGGCTCGG 
GACGACGACG 
GAGGACGAGG 
CTGCTGAGAC 
GAGTCCCCCG 
GGGGGCGGCA 
CTCGCGCAGC 
AGCAGCGGCA 
AGCTTGAATT 
GCGGCCGGGA 
AGCCTGGGCT 
GCGGGGGACT 
GCTGCTCGCT 
GTGATGATGA 
GGGTGGAGGG 
TTGGAAAACA 
TTTCCTGTCC 
CCTAGTCTGG 
TTCCTGAAAT 
ACCCGCTCCG 
AGAGGACCCT 
CTTAAGAGAA 



41 
I 

GGCCCGGGTT 
AAGCGGAGAG 
GCAGCCCGGT 
TCAAGCGGCC 
AGCAGTCTCC 
TGCTGAAGGA 
ACATGGCCGA 
CGGCCAAGCC 
GCGCGGGCGG 
AGCTCAAGGC 
GGGACTACGG 
GCGGCGGCGG 
ACGACGACGA 
AACCACCGCA 
GCTACAACGT 
AGGGAGCGAG 
GCCGCCTCTA 
CCGCGCTGTC 
GCAGGAGCGG 
TCTCTCAAAG 
ACCTGTCCCT 
CCCACTTCGA 
GGCTGGAGGC 
CTTTCTCTCG 
TGATGATGAT 
GAGAGAAGAA 
TGATGAAAAT 

tttttttgtc 
agttgtgatt 

TCGTGATTGC 
GAAGGCGCTG 
TTTGGCAGCA 
CTTAAAGAAC 



51 
I 

GGAGCGTCCA 

CAACCTGCCC 

GGCCCTGGAC 

GATGAACGCG 

GGACATGCAC 

CAGCGAGAAG 

CTACCCCGAC 

CAGCGCCAGC 

AGGCGCGGGC 

CCCCGCGGCC 

GGGCGCGGGC 

CGCGGGCAAG 

CGACGAGCTG. 

CCAGCAGCTC 

CGCCAAAGTG, 

CCTCTACGAC 

CTACAGCTTC . 

GCCCGCGTCC 

CAGCAGCGGC 

CGCGCACAGC 

GTCGCTGGTG 

GTTCCCCGAC 

GAACTTCTCC 

GAGGGTGCAG 

AATGATGATG 

GATGCTGATG 

TTTGGTGGAG 

CCCCCTCCCT 

ATTTTCCCAA 

AACAAAGGCA 

TTTGAAGCTT 

CAACTGTTAC 

TGGTGATTTT 



Seq ID NO: 657 Protein sequence 
Protein Accession 8: NP_003099.1 
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31 41 51 

MVQQAESLEA ESNLPREALD TEEGEFMACS PVALDESDPD WCKTASGHIK R ^^MVWS 
7l^lMEQ SPDMHNAEIS KRLGKRWKML KDSEKIPFIR EAERLRLKHM ADYPDYKYRP 
RKKPKMDPSA KPSASQSPEK SAAGGGGGSA GGGAGGAKTS KGSSKKCGKL ^PAAAGAKA 
GAGKAAQSGD YGGAGDDYVL GSLRVSGSGG GGAGKTVKCV FLDEDDDDDD DDDELQLQIK 
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180 
.240 
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QEPDEEDEEP PHQQIiLQPPG QQPSQLLRRY NVAKVPASPT LSSSAESPEG ASLYDEVRAG 300 

ATSGAGGGSR LYYSFKNITX QHPPPLAQPA LSPASSRSVS TSSSSSSGSS SGSSGSDADD 360 

LMFDLSLNFS QSAHSASEQQ LGGGAAAGNL SLSLVDKDLD SFSEGSLGSH FEFPDYCTPE . 420 
LSEMIAGDWIi eanfsdlvft y 

Seq ID NO: 65B DNA sequence 
Nucleic Acid Accession 8 : NMJJ01719 
Coding sequence: 123.. 1418 

1, V 11 .21 31 |1 -51 

GGGCGCAGCG GGGCCCGTCT GCAGCAAGTG ACCGACGGCC GGGACGGCCG CCTGCCCCCT 60 

CTGCCACCTG GGGCGGTGCG GGCCCGGAGC CCGGAGCCCG- GGTAGCGCGT AGAGCCGGCG 120 

CGATGCACGT GCGCTCACTG. CGAGCTGCGG CGCCGCACAG CTTCGTGGCG CTCTGGGCAC 180 

CCCTGTTCCT GCTGOGCTCC GCCCTGGCOG ACTTCAGCCT GGACAACGAG GTGCACTCGA 240 

GCTTCATCCA CCGGCGCCTC CGCAGCCAGQ AGCGGCGGGA GATGCAGCGC GAGATCCTCT 300 

CCATTTTGGG' CTTGCCCCAC CGCCCGCGCC CGCACCTCCA GGGCAAGCAC AACTCGGCAC 360 

CCATGTTCAT GCTGGACCTG TACAAOGCCA TGGCGGTGGA- GGAGGGCGGC GGGCCCGGCG 420 

GCCAGGGCTT CTCCTACCCC TACAAGGCCG TCTTCAGTAC CCAGGGCCCC OTCTGGCCA 480 

GCCTGCAAGA TAGCCATTTC CTCACCGACG CCGACATGGT CATGAGCTTC GTCAACCTCG- 540 

TGGAACATGA CAAGGAATTC TTCCACCCAC GCTACCACCA TGGAGAGTTC CGGTTTGATC 600 

TTTCCAAGAT CCCAGAAGGG GAAGCTGTCA ■ OGGCAGCCGA ATTCCGGATC TACAAGGACT 660 

ACATCCGGGA ACGCTTCGAC AATGAGACGT TCCGGATCAG- CGTTTATCAG GTGCTCCAGG 720 

AGCACTTGGG CAGGGAATCG GATCTCTTCC TGCTCGACAG CCGTACCCTC TGGGCCTCGG 780 

AGGAGGGCTG GCTGGTGTTT GACATCACAG CCACCAGCAA CCACTGGGTG GTCAATCCGC 840 

GGCACAACCT GGGCCTGCAG.. CTCTCGGTGG AGACGCTGGA TGGGCAGAGC ATCAACCCCA 900 

AGTTGGOGGG CCTGATTGGG CGGCACGGGC CCCAGAACAA. GCAGCCCTTC ATGGTGGCTT- 960 

TCTTCAAGGC CACGGAGGTC CACTTCCGCA GCATCCGGTC CAOGGGGAGC AAACAGCGCA 1020 

GCCAGAACCG CTCCAAGACG CCCAAGAACC AGGAAGCCCT GCGGATGGCC AACGTGGCAG. 1080 

AGAACAGCAG-- CAGCGACCAG AGGCAGGCCT. GTAAGAAGCA CGAGCTGTAT GTCAGCTTCC 1140 

GAGACCTGGG CTGGCAGGAC TGGATCATCG CGCCTGAAGG CTACGCCGCC TACTACTGTG 1200 

AGGGGGAGTG < TGCCTTCCCT CTGAACTCCT ACATGAACGC CACCAACCAC GCCATCGTGC 1260 

AGACGCTGGT.. CCACTTCATC AACCCGGAAA CGGTGCCCAA* GCCCTGCTGT ^ C ^ G( = 122 0 

AGCTCAATGC CATCTCCGTC CTCTACTTCG ATGACAGCTC CAACGTCATC CTGAAGAAAT 1380 

ACAGAAACAT GGTGGTCCGG GCCTGTGGCT GCCACTAGCT. CCTCCGAGAA TTCAGACCCT 1440 

TTGGGGCX1AA GTTTTTCTGG ATCCTCCATT GCTCGCCTTG GCCAGGAACC AGCAGACCAA 1500 

CTGCCTTTTG TGAGACCTTC CCCTCCCTAT CCGCAACTTT AAAGGTGTGA GAGTATTAGG 1560 

AAACATGAGC AGCATATGGC TTTTGATCAG TTTTTCAGTG GCAGCATCCA ATGAACAAGA 1620 

TCCTACAAGC TGTGCAGGCA AAACCTAGCA" GGAAAAAAAA ACAACGCATA AAGAAAAATG 1680 

GCCGGGCCAG GTCATTGGCT GGGAAGTCTC AGCCATGCAC GGACTCGTTT CCAGAGGTAA- 1740 

TTATGAGCGC CTACCAGCCA GGCCACCCAG CCGTGGGAGG AAGGGGGCGT GGCAAGGGGT 1800 

GGGCACATTG GTGTCTGTGC GAAAGGAAAA TTGACCCGGA AGTTCCTGTA ATAAATGTCA 1860 
CAATAAAACG AATGAATG 

Seq ID NO; 659 Protein sequence 
Protein Accession #: NP_001710 

1 11 21 31. 41 51 

KHVRSLRAAA PHSFVALWAP LFLLRSALAD PSLDNEVHSS FIHRRLRSQE RREMQREILS 60 

ILGLPHRPRP HLQGKHNSAP MFMLDLYNAM AVEEGGGPGG QGFSYPYKAV FSTQGPPLAS 120 

LQDSHPLTDA DMVMSFVNLV' EHDKEFPHPR YHHREFRFDL SKIPEGEAVT AAEFR1YKDY 180 

IRERFDNETP RISVYQWJE HLGRESDLFL LDSRTLWASE EGWLVFDITA. TSNHWWNPR 240 

HNLGLQLSVE TLDGQSINPK LAGLIGRHGP QNKQPFMVAF FKATEVHFRS IRSTGSKQRS. 300 

QNRSKTPKNQ EALRMANVAE NSSSDQRQAC KKHELYVSFR DLGWQDWIIA PEGYAAYYCE 360 

GECAFPLNSY MNATNHAIVQ TLVHFXNPET VPKPCCAPTQ LNAISVLYFD DSSNVIIiKKY 420 
RNMWRACGC H 



Seq ID NO: 660 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 211.. 1895 

1 U f j 1 J 1 I 1 

GGATCTGAGG GGCGCCCAGT CACTTCCTCC ACGTTCTCGT GCTGGGCGGG AGGAGCGGAT 60 

GGGGCTTGGG AGGCAGCCTG CTCTCCAGTC CCTATCCACC CACAGGTTTT TTGGGTCGGA 120 

GAGGAATTAT CTGATAAAAT TCCTGGGTTA ATATTTTTAA AAACGGAGAG . TTTTTAAAAA . 180 

TGATTTTTTT CCCTCGAAAA" TGACCTTTTT ATGCTTCGAA GCAGTTTGTC AACCAGCATA 240 

GTGCTTTTTC TTTTCTCTTC TTTTTCTACG ATAAATGAAA GCATTTCTTC AAGAAAAAGG 300 

CACAGGTTCC TTGAACAGCT GGATTCTGAT GGCACCATTA CTATAGAGGA GCAGATTGTC 360 

• CTTGTGCTGA AAGCGAAAGT ACAATGTGAA CTCAACATCA CAGCTCAACT CCAGGAGGGA 420 

GAAGGTAATT GTTTCCCTGA ATGGGATGGA CTCATTTGTT GGCCCAGAGG AACAGTGGGG 480 

AAAATATCGG CTGTTCCATG' CCCTCCTTAT- ATTTATGACT TCAACCATAA AGGAGTTGCT. 540 

TTCCGACACT GTAACCCCAA TGGAACATGG GATTTTATGC ACAGCTTAAA TAAAACATGG • 600 

GCCAATTATT CAGACTGCCT TCGCTTTCTG CAGCCAGATA TCAGCATAGG AAAGCAAGAA . 660 

TTCTTTGAAC GCCTCTATGT AATGTATACC GTTGGCTACT CCATCTCTTT TGGTTCCTTG 720 

GCTGTGGCTA TTCTCATCAT TGGTTACTTC AGACGATTGC ATTGCACTAG GAACTATATC 780 

CACATGCACT TATTTGTGTC TTTCATGCTG AGAGCTACAA GCATCTTTGT CAAAGACAGA 840 

GTAGTCCATG CTCACATAGG AGTAAAGGAG CTGGAGTCCC TAATAATGCA GGATGACCCA 900 

CAAAATTCCA TTGAGGCAAC TTCTGTGGAC AAATCACAAT ATATCGGGTG CAAGATTGCT 960 

GTTGTGATGT TTATTTACTT CCTGGCTACA AATTATTATT GGATCCTGGT GGAAGGTCTC 1020 

TACCTGCATA ATCTCATCTT TGTGGCTTTC TTTTCGGACA CCAAATACCT GTGGGGCTTC 1080 

ATCTTGATAG GCTGGGGGTT TCCAGCAGCA TTTGTTGCAG CATGGGCTGT GGCACGAGCA 1140 

ACTCTGGCTG ATGCGAGGTG CTGGGAACTT AGTGCTGGAG ACATCAAGTG GATTTATCAA . 1200 

GCACCGATCT TAGCAGCTAT TGGGCTGAAT TTTATTCTGT TTCTGAATAC GGTTAGAGTT 1260 

CTAGCTACCA AAATCTGGGA GACCAATGCA GTTGGGCATG ACACAAGGAA GC^TACAGG 1320 

AAACTGGCCA AATCGACACT GGTCCTGGTC CTAGTCTTTG GAGTGCATTA CATCGTGTTC 1380 
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WO 02/086443 

GTATGCCTGC CTCACTCCTT CACTGGGCTC GGGTGGGAGA 
TTCTTCAACT CCTTTCAGGG TTTCTTTGTG TCTATCATCT 

GTTCAGGCAG aggtgaagaa gatgtggagt cggtggaatc 

ACACCGCCAT GTGGCAGCCG CAGATGCGGC TCAGTGCTCA 
AGCAGCCAGT CACAGGTGGC GGCCAGCACA CGCATGGTGC 
AAGATCGCCA' GCAGACAGCC TGACAGCCAC ATCACTTTAC 
TCAGAGCAGG- ACTGCCTGCC ACACTCTTTC CACGAGGAGA 
CAGGGAGATG. ATATTCTAAT GGAGAAGCCT TCCAGGCCTA 
GAAGGATGCC AAGGAGAAAC TGAGGATGTT CTCTGA 

Seq ID NO: 661 Protein sequence 
Protein Accession Eos sequence 



PCTAJS02/12476 



i 

I- 

MLRSSLSTSI 
LNITAQLQEG 
DPMHSLNKTW 
RRLHCTRNYT 
KSQYIGCKIA 
FVAAWAVARA 
VGHDTRKQYR- 
SIIYCYCNGE 
RMVLI SGKAA' 
SRPMESNPDT 



11 
I 

VLFLFSSFST 
EGNCFPEWDG 
ANYSDCLRFL 
HMHLFVSFML 
WMF I Y FLAT 
TLADARCWEL 
KLAKSTLVLV- 
VQAEVKKMWS 
KIASRQPDSH- 
EGCQGETEDV 



21 
I 

INESISSRKR 
LICWPRGTVG 
QPDISIGKQB 
RATSIFVKDR 
NYYWILVEGI/ 
SAGDIKWIYQ. 
LVFGVHYIVF 
RWNLSVDWKR 
ITLPGYVWSN 
L 



31 
I 

HRFLEQLDSD 
KISAVPCPPY- 
FFERLYVMYT 
WHAHIGVKE 
YLHNLIFVAF 
APILAAIGLN 
VCLPHSFTGL 
TPPCGSRRCG 
SEQDCLPHSF 



TCCGCATGCA 
ACTGCTACTG 
TCTCCGTGGA 
CCACCGTGAC 
TTATCTCTGG 
CTGGCTATGT 
CCAAGGAAGA 
TGGAATCTAA 



41 
I 

GTiriEEQIV 
IYOFNHKGVA 
VGYSISFGSL 
LESLIMQDDP 
FSDTKYLWGF 
FILFLNTVRV 
GWEIRMHCEL- 
SVLTTVTHST 



CTGTGAGCTC 
CAATGGAGAG 
CTGGAAAAGG 
GCACAGCACC 
CAAAGCTGCC 
CTGGAGTAAC 
TAGTGGGAGG 
CCCAGACACT 



51 
I 

LVLKAXVQCE 

FRHCNPNGTW.. 

AVAIL I IGYF 

QNSIEATSVD. 

ILIGWGFPAA 

LATKIWETNA 

FFNSFCGFFV 

SSQSQVAAST 

QGDDILMEKP 



1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 



60 
120 
180 
240 
300 
360 
420 
480 
540 



Seq ID NO: 662 DNA sequence 
Nucleic Acid Accession #: NMJ)0504B 
Coding sequence: 143.. 1795 



1 

I 

GGCCGGTGGC 
TGGGCCAGCC 
TCTTCCTACA 
GCTAATGCTC 
TATAGAGGAG; 
AGCTCAACTC 
GCCCAGAGGA' 
CAACCATAAA 
CAGCTTAAAT 
CAGCATAGGA 
CATCTCTTTT 
TTGCACTAGG ' 
CATCTTTGTC 
AATAATGCAG 
TATCGGGTGC 
GATCCTGGTG. 
CAAATACCTG 
ATGGGCTGTG" 
CATCAAGTGG 
TCTGAATACG. 
CACAAGGAAG" 
AGTGCATTAC 
CCGCATGCAC 
CTGCTACTGC 
CTCCGTGGAC 
CACCGTGACG 
TATCTCTGGC 
TGGCTATGTC 
CAAGGAAGAT 
GGAATCTAAC 
CATTTGTGGC 
ATACTCCTAT 
TTTTAGGCTC 
GGAGTAGTTT 
GCTCTGTGAT 
GCTGTAGCTT 
ATTTTCCTTT - 
ATTTATTTTG 
GATCTAAGAA 
TTATAACAAT 
ACATCCCTTC 
TTCTTTGTAA 
TTGATTTTGT 
TTTGAGCTGT 
T 



11 
I 

CCGGGCCCGA 
AAGTTGGCAA 
GCCGTTCCGG 
GGCAGCTGCC 
CAGATTGTCC 
CAGGAGGGAG 
ACAGTGGGGA 
GGAGTTGCTT 
AAAACATGGG 
AAGCAAGAAT 
GGTTCCTTGG- 
AACTATATCC 
AAAGACAGAG . 
GATGACCCAC. 
AAGATTGCTG 
GAAGGTCTCT- 
TGGGGCTTCA 
GCACGAGCAA 
ATTTATCAAG. 
GTTAGAGTTC 
CAATACAGGA 
ATCGTGTTCG 
TGTGAGCTCT 
AATGGAGAGG 
TGGAAAAGGA 
CACAGCACCA 
AAAGCTGCCA 
TGGAGTAACT 
AGTGGGAGGC 
CCAGACACTG 
TGACTTTCAT 
GCTTGAGTTC 
CATGAATTGG 
ATTACCTTCT 
TGTTCATTTT 
TCTCTCATAT 
TAGAAACTAG 
CCTGTGCATA 
CAAGTACTTG 
TACATGTGTT 
TTTTGAATGG 
ACCATGTCAT 
TTGTAATGTA 
TACTACATTG 



21 
I 

CCACCCCAGC 
CTTGGAAGCT 
GCATGGCCGG 
TCCTGGCCAG 
TTGTGCTGAA 
AAGGTAATTG 
AAATATCGGC 
TCCGACACTG 
CCAATTATTC 
TCTTTGAACG 
CTGTGGCTAT 
ACATGCACTT 
TAGTCCATGC 
AAAATTCCAT 
TTGTGATGTT 
ACCTGCATAA 
TCTTGATAGG 
CTCTGGCTGA 
CACCGATCTT 
TAGCTACCAA 
AACTGGCCAA 
TATGCCTGCC 
TCTTCAACTC 
TTCAGGCAGA 
CACCGCCATG 
GCAGCCAGTC 
AGATCGCCAG 
CAGAGCAGGA 
AGGGAGATGA 
AAG GATGCCA- 
GGGCTGGTCC 
AAAGGCTGAA' 
CTCCTGTAAA 
ATTGGCATCA 
TTTCTGCTAC 
ATATCACCCT 
TATTCTCTTA 
GGAGCAATTA 
CTGGAAAATT 
TTTGGGAACA 
CCTCTTTGTG 
GTGGAAAGAT 
TTTTGATAGC 
TACATGGCAT 



31 
\ 

TGCGCGTCGT 
TCTCCCGGGC 
GCTGGGGGCG 
AGCCCAGCTG 
.AGCGAAAGTA 
TTTCCCTGAA 
TGTTCCATGC 
TAACCCCAAT 

agactgcctt: 

CCTCTATGTA 
TCTCATCATT 
ATTTGTGTCT. 
TCACATAGGA 
TGAGGCAACT 
TATTTACTTC 
TCTCATCTTT 
CTGGGGGTTT 
TGCGAGGTGC 
AGCAGCTATT 
AATCTGGGAG 
ATCGACACTG. 
TCACTCCTTC 
CTTTCAGGGT 
GGTGAAGAAG 
TGGCAGCCGC 
ACAGGTGGCG 
CAGACAGCCT 
CTGCCTGCCA 
TATTCTAATG 
AGGAGAAACT 
AATGGCTGGT 
AATTCAGTTA 
TACTAACGAC 
AGTTTTCCTC 
TTTTGGGTAG 
AAATATAATG 
TTTCTTACTT 
GGATCTAAAA 
AGTTGGCTGG 
AGGAAAATTT 
ACCAGCCAGA 
TTCCTCAGTT 
AAATCATGCT 
GTGGGATCAA 



41 
i 

TACTGGCCAC 
TCTGGAGGAG 
TCGCTCCACG 
GATTCTGATG 
CAATGTGAAC 
TGGGATGGAC 
CCTCCTTATA 
GGAACATGGG 
CGCTTTCTGC 
ATGTATACCG 
GGTTACTTCA 
TTCATGCTGA 
GTAAAGGAGC 
TCTGTGGACA 
CTGGCTACAA 
GTGGCTTTCT 
CCAGCAGCAT 
TGGGAACTTA 
GGGCTGAATT 
ACCAATGCAG 
GTCCTGGTCC 
ACTGGGCTCG 
TTCTTTGTGT 
ATGTGGAGTC 
AGATGCGGCT 
GCCAGCACAC 
GACAGCCACA 
CACTCTTTCC 
GAGAAGCCTT 
GAGGATGTTC 
TGTGTGAGAG- 
AGGTGTTACT 
ATGAAAATGC 
TAAATTAATG 
AAAAAAGATT- 
AAGATCTTTT- 
TAATGTACTT 
AAATATATGG 
ACATTGATAA 
CTCAAAAAAG 
CCTCAGGTCT. 
AGTGAGCTTG 
GCATCTATAT 
TTAAAAATTT 



51 
I 

AAGTTTGCTC 

GGTCCCTGCT 

TCTGGGGTTG 

GCACCATTAC 

TCAACATCAC 

TCATTTGTTG 

TTTATGACTT** 

ATTTTATGCA 

AGCCAGATAT 

TTGGCTACTC 

GACGATTGCA 

GAGCTACAAG. 

TGGAGTCCCT 

AATCACAAT A 

ATT ATTATTG 

TTTCGGACAC 

TTGTTGCAGC * 

GTGCTGGAGA 

TTATTCTGTT- 

TTGGGCATGA ■ 

TAGTCTTTGG 

GGTGGGAGAT, 

CTATCATCTA- 

GGTGGAATCT-- 

CAGTGCTCAC- 

GCATGGTGCT 

TCACTTTACC 

ACGAGGAGAC " 

CCAGGCCTAT 

TCTGAATGGA 

GGCTTGGCTG 

TAATAATAGT 

AAGTGTCAAT 

TATGGTATTT 

CAATTGCTTG . 

AGTGTGTATC 

CTATCACTGC 

GAAGATAAAA 

AATAATGCAT 

AATATTTCAC 

TCACTCTTTC 

TGTCTGCAAA 1 

CTTTTTCTTG 

GTTTTAAAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
B40 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
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2040 
2100 
2160 
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2280 
2340 
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2460 
2520 
2580 
2 640 



Seq ID NO: 663 Protein sequence 
Protein Accession fh NP_005039 



11 



21 



31 



51 



41 

, , I 1 1 

MAGLGASI/HV WGWLMLGSCL LARAQLDSDG TITIEEQIVL VLXAKVQCEL NITAQLQEGE 
GNCFPEWDGL ICWPRGTVGK ISAVPCPPYI YDFNHKGVAF RHCNPNGTWD FMHSLNKTWA 
NYSDCLRFLQ PDISIGKQEF FERLYVMYTV GYSISFGSLA VAILIIGYFR RLHCTRNYIH 
MHLFVSFMLR ATSIPVKDRV VHAHIGVKEL ESLIKQDDPQ NSIEATSVDK SQYIGCKIAV 
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WO 02/086443 

* VMFIYFLATN YYWILVEGLY 
LADARCWELS AGDIKWIYQA 

' LAKSTLVLVL VFGVHYIVFV 
QAEVKKMWSH -WNLSVDWKRT 
IASRQPDSHI TLPGYVWSNS 
GCQGSTEDVL 



PCT/US02/12476 



LHNLIFVAF? SDTKYLWGPI LIGWGFPAAF 
PILAAIGLNF ILFLNTVHVL ATKIWETNAV 
CLPHSFTGLG WEIRMHCELF FNSFQGFFVS 
PPCGSRRCGS VLTTVTHSTS SQSQVAASTR 
EQDCLPHSFH EETKEDSGRQ GDDILMEKPS 



Seq ID NO: 664 DNA sequence 
Nucleic Acid Accession ft: NM_012152 
Coding sequence: 43.. 1104 



CTTCTTTAAA 
GACAAGCACA 
GSAACAAAGC 
TCTAATTCTC 
TACCTGTTGG 
ATGTTTAACA 
GGGCTTCTGG. 
AGGCACATGT 
CTGCTCATTT 
TGGAATTGCC 
TACCTTGTTT 
CTGCGGATCT' 
TCCATCAGCC 
GCGTTTGTGG 
AGGCAGTGTG 
GTCGTGAACC 
ATGATCTGCT 
GTCCTCAGCA 
GTCTGCAATA 
GTCTTAGG 



11 
I 

TTTCTTTCTA 
TGGACTTTTT 
TTGTGATTGT 
TGGTCATCGC 
CTAATTTAGC 
CAGGCCCAGT- 
ACAGTAGCTT 
CAATCATGAG 
TGCTTGTCTG 
TCTGCAACAT 
TCTGGACAGT 
ACGTGTACGT 
GCCGGAGGAC 
TATGCTGGAC 
GCGTGCAGCA- 
CCATCATCTA 
GCTTCTCTCA 
GGAGTGACAC 
AAAGCACTTC 



21 
I 

GGATGTTCAC 
TTATAATAGG 
TTTGTGTGTT 
GGCAGTGATC 
TGCTGCCGAT 
TTCAAAAACT. 
GACTGCTTCC 
GATGCGGGTC 
GGCCATCGCC 
CTCTGCCTGC 
GTCCAACCTC 
CAAGAGGAAA 
ACCCATGAAG- 
CCCGGGCCTG 
TGTGAAAAGG- 
CTCCTACAAG 
GGAGAACCCA 
AGGCAGCCAG 
CTAAACTCTG 



31 
I 

TTCTTCTCCA 
AGCAACACTG 
GGGACGTTTT 
AAAAACAGAA 
TTCTTCGCTG 
TTGACTGTCA 
CTCACCAACT 
CATAGCAACC 
ATTTTTATGG 
TCTTCCCTGG 
ATGGCCTTCC 
ACCAACGTCT 
CTAATGAAGA 
GTGGTTCTGC 
TGGTTCCTGC 
GACGAGGACA 
GAGAGGCGTC 
TACATAGAGG 
GATGCCTCTC 



41 
1 

CAATGAATGA 
ATACTGTCGA 
TCTGCCTGTT 
AATTTCATTT 
GAATTGCCTA 
ACCGCTGGTT 
TGCTGGTTAT 
TGACCAAAAA 
GGGCGGTCCC 
CCCCCATTTA 
TCATCATGGT. 
TGTCTCCGCA' 
OGGTGATGAC 
TCCTCGACGG 
TGCTGGCGCT 
TGTATGGCAC 
CCTCTCGCAT 
ATAGT ATT AG 
GGCCCACCCA 



VAAWAVARAT 
GHDTRKQYRK ' 
IIYCYCNGEV 
MVLISGKAAK 
RPMESNPDTE 



SI 
I 

GTGTCACTAT 
TGACTGGACA 
TATTTTTTTT 
CCCCTTCTAC 
TGTATTCCTG 
TCTCCGTCAG 
CGCCGTGGAG. 
GAGGGTGACA 
CACACTGGGC 
CAGCAGGAGT 
TGTGGTGTAC. 
TACAAGTGGG . 
. TGTCTTAGGG ' 
CCTGAACTGC 
GCTCAACTCC 
CATGAAGAAG -. 
CCCCTCCACA 
CCAAGGTGCA 
GGTGATGACT 



Seq ID NO: 665 Protein sequence 
Protein Accession #: NP_036284 



MNECHYDKHM 
FHPPFYYLLA 
LVIAVERHMS 
PIYSRSYLVF 
VMTVLGAFW 
YGTMKKMICC 



11 

I ' 

DFFYNRSNTD- 
NLAAADFFAG 
IMRMRVHSNL 
WTVSNLMAFL 
CWTPGLWLL 
FSQENPERRP 



21 • • - 
I ' 

TVDDWTGTKL 
IAYVFLMFNT 
TKKRVTIiLIL 
IMVWYLRIY 
LDGLNCRQCG 
SRIPSTVLSR 



31 
I 

VrVLCVGTFF 
GPVSKTLTVN 
LVWAIAIFMG 
VYVKRKTNVL 
VQHVKRWFLL 
SDTGSQYIED 



41 51. 

I I ■ 

CLFIFFSNSL VIAAVIKNRK 
RWPLRQGIiLD SSLTASLTNL 
AVPTLGWNCL CNISACSSIiA 
SPHTSGSISR RRTPMKLMKT 
LALLNSWNP IIYSYKDEDM 
SISQGAVCNK STS 



Seq ID NO: 666 DNA sequence 
Nucleic Acid Accession ft: NMJ)02821 
Coding sequence: 150.. 3362 



1 
I 

AACTCCCGCC 
GCGCTCCGGT 
CCTCAGCTCC" 
CCCGCCGGTT 
CCATTGTCTT 
TTCGCTGTGA 
CTGTCCAGGA 
ACCGGCTGCA 
AAGCCCGCAG 
TGAAGCATCC 
ACATTGATGG 
ATGGTCAGAG 
GTCCTGAGCA 
GCAGCCAGAA 
CCCAGGACGT 
AGCCACCCCC 
GCCCCCCACA 
TCCGGCCACG' 
TCATCCTGGA' 
GGGTGTTTAC 
AGCCCAGCGT 
AGAAGGGCCA: 
GCCACGCGGC 
TGCCCTCCTG 
TGGATTGCCT 
TCATCTCAGA 
TGGAGGTGTA 
AGGCGCAAGC 
AGCAGTGCAT 
AGCCCACTAT 
ACGCTGGGAC 
TTGCCTCCAA 
TTATCACCTT 
TGCAGTGCGA 
TCCTGGACCC 



11 
I 

TCGGGACGCC 
GCGTCCGCCT 

ttttcctgag: 
gcctctgctc 
catcaagcag-. 

GGTTGAGGCT-. 
CACGGAGCGG 
GGACTCTGGC. 
TGCCAACGCC 
AGCCTCGGAA 
GCACCCTCGG 
CAACCACACA 
TAGTGGGCTG 
CTTCACCTTG 
GGTAGTAGCG 
GAGCCTGCAG 
CCTCCGCAGA 
CAATGCAGGG 
AGCCACACTT 
AGCTGGCAGC 
GTGGTGGGAG 
CGAGCTGGTG' 
CAACCTGGCT 
GCTGAAGAAG 
GACCCAGGCC 
GGACTCACGG 
TGATGGGACA 
CCGTGTCCAA 
GGAGTTTGAC 
TAAGTGGGAA 
CCTGCATTTT 
CGGGCCGCAG 
CAAAGTGGAA 
GGCCCAGGGG 
-CACCAAGCTG 



21 
I 

TCGGGGTCGG 
CCTGTGCCCG 
CCCGCCGCGA 
AGCGTCCTGC 
CCGTCCTCCC 
CCGGGCCCGG 
CGTTTCGCCC 
ACCTTCCAGT 
TCCTTCAACA' 
GCTGAGATCC 
CCCACCTACC 
GTCAGCAGCA 
TATTCCTGCT 
AGCATTGCTG 
AGGTATGAGG 
TGGCTCTTTG 
GCCACAGTGT 
ATCTACCGCT 
CACCTAGCAG 
GAGGAGCGTG 
CACGCGGGAG 
TTGGCCAATA 
GGTCAGCGGA 
CCCCAAGACA 
ACACCAAAAC 
TTCGAGGTCT 
TGGTACCGTT 
GTGCTGGAAA 
AAGGAGGCCA 
CGGGCAGATG 
GCCCGGGTGA 
GGCCAGATTC 
CCAGAGCGTA 
GACCCCAAGC 
GGACCCAGGA 



31 
I 

GCTCCGGCTG 
CCGCGGAGCA 
TGGGAGCTGC 
TGCTGCCGCT. 
AGGATGCACT 
TACATGTGTA 
AGGGCAGCAG 
GTGTGGCTCG 
TCAAATGGAT 
AGCCACAGAC 
AATGGTTCCG 
AGGAGCGGAA 1 
GCGCCCACAG- 
ATGAAAGCTT 
AGGCCATGTT 
AGGATGAGAC 
TTGCCAACGG 
GCATTGGCCA 
AGATTGAAGA 
TGACCTGCCT 
TCCGGCTGCC 
TTGCTGAAAG- 
GACAGGATGT 
GCCAGCTGGA 
CTACAGTTGT 
TCAAGAATGG 
GTATGAGCAG 
AGCTCAAGTT 
CGGTGCCCTG 
GGAGCAGCCT 
CTCGAGATGA 
GTGCCCATGT 
CGACTGTGTA 
CGCTGATTCA 
TGCACATCTT 



41 
i 

CGGCTGCTGC 
GTCTGCGGCC 
GCGGGGATCC 
GCTGGGCGGT 
GCAGGGGCGC 
CTGGCTGCTC 
CCTGAGCTTT 
GGATGATGTC 
TGAGGCAGGT 
CCAGGTCACA 
AGATGGGACC 
CCTGACGCTC 
TGCTTTTGGC 
TGCCAGGGTG 
CCATTGCCAG 
TCCCATCACT 
GTCTCTGCTG 
GGGGCAGAGG 
CATGCCGCTA 
TCCCCCCAAG 
CACCCATGGC 
TGATGCTGGT 
CAACATCACT 
GGAGGGCAAA 
CTGGTACAGA 
GACCTTGCGC 
CACCCCAGCC 
CACACCACCA 
TTCAGCCACA 
CCCAGAGTGG 
CGCTGGCAAC 
CCAGCTCACT 
CCAGGGCCAC 
GTGGAAAGGC 
CCAGAATGGC 



51 
I 

TGCGGCGCCC 
CGCCGTGCGC 
CCGGCCAGAC 
ACCCAGACAG 
CGGGCGCTGC 
GATGGGGCCC 
GCAGCTGTGG 
ACTGGAGAAG 
CCTGTGGTCC' 
CTTCGTTGCC 
CCCCTTTCTG 
CGGCCAGCTG 
CAGGCTTGCA 
GTGCTGGCAC 
TTCTCAGCCC 
AACCGCAGTC 
CTGACCCAGG 
GGCCCACCCA 
TTTGAGCCAC 
GGTCTGCCAG 
AGGGTCTACC 
GTCTACACCT 
GTGGCCACTG 
CCCGGCTACT 
AACCAGATGC 
ATCAACAGCG 
GGCAGCATCG 
CCCCAGCCAC. 
GGCCGAGAGA 
GTGACAGACA 
TACACTTGCA 
GTGGCAGTTT 
ACAGCCCTAC 
AAGGACCGCA 
TCCCTGGTGA 



300 
360 
420 
480 
540 
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WO 02/086443 

TCCATGACGT GGCCCCTGAG 
ACATCAAGCA CAOGGAGGCC 
AGGGCCCTGG CAGCCCTCCC 
COGCTGTGGC CTACATCATT 
AAGCCAAGCG GCTGCAGAAG 
GAGGGCCTTT GCAGAACGGG. 
GCTTGGGCTC" CGGCCCCGCG 
TCCCACGGTC TAGCCTGCAG 
TCCTGGCAAA GGCTCAGGGC 
GCCTGCAGAC GAAGGATGAG 
GGAAGCTGAA CCACGCCAAC. 
ACTACATGGT GCTGGAATAT 
AGAGCAAGGA TGAAAAATTG 
GCACCCAGGT AGCCCTGGGC 
TGGCTGCGCG TAACTGCCTG 
TCAGCAAGGA TGTGTACAAC 
GCTGGATGTC CCCCGAGGCC 
OCTTCGGTGT GCTGATGTGG 
CAGATGATGA AGTACTGGCA 
GCTGCCCTTC CAAACTCTAT 
GGCCCTCCTT CAGTGAGATT. 
GAGGAGGGAG CCCGCTCAGG 
CAGCATGATG GGCAAGATCC 
TTGCTGAGGT CTGAGCAGGG 
GGCTGACTTG. GACCCAAACT 
CTCTTCCTCT ATCAGGGACA 
TTCTCCCCTT GACCGGGTCC 
AGGCTTGGGA TGAGCTGGGT 
AGGGTTAATG- AGTCTCTTGC 
ACACAGCAAG TGAGTCCTCC 
CCCCACCCTT- CTCTCCTTTC. 
CTTTTGACAC TATATAAACC. 
TGCAGCGTGG GGTGGGTGGG: 
GCCATCCTTA CCCCACACTT 
TGTTTTTGTT TTTACACTCG 



GACTCAGGCC GCTACACCTG CATTGCAGGC 
CCCCTCTATG TCGTGGACAA GCCTGTGCCG 
CCCTACAAGA TGATCCAGAC CATTGGGTTG 
GCCGTGCTGG GCCTCATGTT CTACTGCAAG 
CAGCCCGAGG GCGAGGAGCC. AGAGATGGAA 
CAGCCCTCAG CAGAGATCCA AGAAGAAGTG 
GCCACCAACA AACGCCACAG CACAAGTGAT 
CCCATCACCA CGCTGGGGAA GAGTGAGTTT 
TTGGAGGAGG: GAGTGGCAGA GACCCTGGTA 
CAGCAGCAGC TGGACTTCCG GAGGGAGTTG 
GTGGTGCGGC TCCTGGGGCT GTGCCGGGAG 
GTGGATCTGG GAGACCTCAA GCAGTTCCTG 
AAGTCACAGC CCCTCAGCAC CAAGCAGAAG 
ATGGAGCACC TGTCCAACAA CCGCTTTGTG 
GTCAGTGCCC AGAGACAAGT GAAGGTGTCT 
AGTGAGTACT ACCACTTCCG CCAGGCCTGG 
ATCCTGGAGG 1 GTGACTTCTC TACCAAGTCT 
GAAGTGTTTA CACATGGAGA GATGCCCCAT- 
GATTTGCAGG CTGGGAAGGC. TAGACTTCCT 
CGGCTGATGC AGCGCTGCTG GGCCCTCAGC. 
GCCAGCGCCC TGGGAGACAG CACCGTGGAC 
ATGGCCTGGG CAGGGGAGGA CATCTCTAGA 
CTGTCCTCCT- GGGCCCTGAG GTGCCCTAGT 
CCTGGCCTTT CCTCCTCTTC CTCACCCTCA 
GGGCGACTAG GGCTTTGAGC TGGGCAGTTT 
GTGTGGGTGC CACAGGTAAC CCCAATTTCT 
AACTCTGCCA CTCATCTGCC AACTTTGCCT 
TTGTGGGGAG. TTCCTTAATA TTCTCAAGTT 
CCACTGGTCC ACTTGGGGGT CTAGACCAGG 
CCACTCTGGG. CTTGTGCACA CTGACCCAGA 
CTCATCCTAA GTGCCTGGCA GATGAAGGAG 
GCCCTTTTTG TATGCACCAC GGGCGGCTTT 
CATGGGAGGT AGGGGTGGGC CCTGGAGATG 
TTATTGTTGT CGTTTTTTGT TTGTTTTGTT 
CTGCTCTCAA TAAATAAGCC TTTTTTA 



AACAGCTGCA 

GAGGAGTCGG 

TCGGTGGGTG 

AAGCGCTGCA 

TGCCTCAACG ' 

GCCTTGACCA 

AAGATGCACT 

GGGGAGGTGT ■ 

CTTGTGAAGA 

GAGATGTTTG 

GCTGAGCCCC 

AGGATTTCCA 

GTGGCCCTAT 

CATAAGGAC7 

GCCCTGGGCC 



GATGTCTGGG 
GGTGGGCAGG 
CAGCCCGAGG 
CCCAAGGACC 
AGCAAGCCGT 
GGGAAGCTCA 
GCAACAGGCA 
TCCTTTGGGA 
CCCCTGCCAC 
GGCCTTCAAC 
GGGGAGGGCT 
CTGGGCACAC 
ATTATAGAGG 
CCCACGTCTT 
TTTTCAGGAG 
TATATGTAAT 
AGGAGGGTGG 
TTTTTGTTTT 



Seg ID NO: 667 Protein sequence 
Protein Accession 8: NP 002812 



1 
I 

MGAARGSPAR 
VHVYWLLDGA 
IKWIEAGPW 
KERNLTLRPA 
EAMPHCQFSA" 
CTGQGQRGPP 
VRIiPTHGRVY- 
SQLEEGKPGY 
CMSSTPAGSI- 
GSSLPEWVTD 
TTVYOGHTAL 
RYTCIAGWSC 
GLMPYCKKRC 



LDFRRELEMF 
PLSTKQKVAL 
VHFRQAWVPL 
AGKARLPQPE 



11 
I 

PRRLPLLSVL 
PVQDTERRPA 
LKHPASEAEI 
GPEHSGLYSC 
QPPPSLQWLF 
IILBATLHLA 
QKGHELVLAN 
LDCLTQATPK 
EAQARVQVLE 
NAGTLHFARV 
IiQCEAQGDPK 
MIKHTEAPLY 
KAKRLQKQPE 
PPRSSLQPIT 
GKLNHANWR 
CTQVALGMEH-. 
RWMSPSAILB 
GCPSKLYRLM 



21 
I 

LLPLLGGTQT 
QGSSLSFAAV. 
QPQTQVTLRC 
CAHSAPGQAC 
EDETPITNRS 
EIEDMPLFEP 
IAESDAGVYT 
PTWWYRNQM 
KLKFTPPPQP 
TRDDAGNYTC 
PLXQWKGKDR 
WDKPVPEES 
GEEPEMECLN^ 
TLGKSEFGEV.- 
LLGLCREAEP 
LSNNRFVHKD 
GDFSTKSDVW 
QRCWALSPKD 



31 
I 

AIVFIKQPSS 
DRLQDSGTFQ 
HIDGHPRPTY 
SSQNFTLSIA 
RPPHLRRATV 
RVFTAGSEER 
CHAANLAGQR 
IjISEDSRFEV 
QQCMEFDKEA 
IASNGPQGQI 
ILDPTKLGPR 
EGPGSPPPYK 
GGPLQNGQPS 
FLAKAQGLEE 
HYMVLEYVDL 
LAARNCLVSA' 
AFGVLMWEVF. 
RPSFSEIASA 



41 
I 

QDALQGRRAL 
CVARDDVTGE 
QWFRDGTPLS 
DESFARWLA 
FANGSLhLTQ 
VTCLPPKGLP 
RQDVNITVAT 
FKNGTLRINS 
TVPCSATGRE 
RAHVQLTVAV 
MHIFQNGSLV 
MIQTIGLSVG 
AEIQEEVALT 
GVAETLVLVK 
GDLKQFLRIS 
QRQVKVSALG 
THGEMPHGGQ 
LGDSTVDSKP 



51 
I 

LRCEVEAPGP 

EARSANASFN 

DGQSNHTVSS 

PQDWVARYE 

VRPRNAGIYR 

EPSVWWEHAG 

VPSWLKKPQD 

VEVYDGTWYR ' 

KPTIKWERAD 

FITFKVEPER 

IHDVAPEDSG 

AAVAYIIAVL 

SLGSGPAATN. 

SLQTKDEQQQ. 

KSKDEKLKSQ. 

LSKDVYNSEY 

ADDEVLADLQ 



Seq ID NO i 668 DNA sequence 
Nucleic Acid Accession #: Eos 
Coding sequence: 1..1389 



sequence 



ATGGGCTACC 
ACCCTTGTTT 
GTTGTCAACT 
GGGTTTCCTT 
GTTTTATTGA 
AAAACTTTCG 
ATAGCAATGA 
ATCCCAGGAG 
ACAGTTACCT. 
TCCCTpVTCT 
TCACTGGGTC 
ATTCAAGCGG 
TACAGTTCTC 
GTGATTTCTG 
TTCACCCAAG 
AGATTTTGTT 
GAGGTAATTG 
ACAGTGATGG 
GTTCTAGAAC 
TGTTATCTGA 
ATGCTTCCCA 



11 

I 

AGAGGCAGGA 
CTGAACATGA 
CGATTATAGG 
TGGGAATATT 
TAAAAGGAGG- 
GCTTTCCAGG 
TAAGTTACAA- 
TTGATCCTGA- 
TTACTCTGCC 
CTACAGGTTT 
CACACATACC 
TCGGGGTTAT 
TAGAAGAACC 
TATTTATCTG 
GGGACTTATT 
ATGGTGTCAC 
CCAATGTGTT 
TCATCACTGT 
TCAATGGTGT 
AACTGTCTGA 
TTGGTGCTGT 



21 
I 

GCCTGTCATC 
GTATAAAGAG 
ATCTGGTATA 
GCTTTTATTC 
GGCCCTCTCT 
GTATCTGCTC 
TATAATAGCT 
AAACGTGTTT 
TTTATCCTTG 
AACAACTCTG 
AAAAACAGAA 
GTCTTTTGCA 
CACAGTAGCT 
TATATTCTTT 
TGAAAATTAC 
TGTCATTTTG 
TTTTGGTGGG 
AGCCACGCTT 
GCTCTGTGCA 
AGAACCAAGG 
GGTGATGGTT 



31 
I 

CCGCCGCAGA 
AAAACCTGTC 
ATAGGATTGC 
TGGGTTTCAT 
GGAACAGATA 
CTCTCTGTTC 
GGAGATACTT 
ATTGGTCGCC 
TACCGAAATA 
ATTCTTGGAA 
GACGCTTGGG 
TTTATTTGCC 
AAGTGGTCCC 
GCTACATGTG 
TGCAGAAATG 
ACATACCCTA 
AATCTTTCAT 
GTGTCATTGC 
ACTCCCCTCA 
ACACACTCCG 
TTTGGATTCG 



41 
I 

GAGATTTAGA 
AGTCTGCTGC 
CTTATTCAAr 
ATGTTACGGA 
CCTACCAGTC 
TTCAGTTTTT 
TGAGCAAAGT 
ACTTCATTAT 
TAGCAAAGCT 
TTGTAATGGC 
TATTTGCAAA 
ACCATAACTC 
GCCTTATCCA 
GATACTTGAC 
ATGACCTGGT 
TGGAATGCTT 
CGGTTTTCCA 
TGATTGATTG 
TTTTTATCAT 
ATAAGATTAT 
TCATGGCTAT 



51 
1 

TGACAGAGAA 

TCTTTTTAAT 

GAAGCAAGCT 

CTTTTCCCTT 

TTTGGTCAAT 

GTATCCTTTT 

TTTTCAAAGA 

TGGACTTTCC 

TGGAAAGGTC 

AAGGGCAATT 

GCCCAATGCC 

CTTCTTAGTT • 

TATGTCCATC; 

ATTTACTGGC' 

AACATTTGGA 

TGTGACAAGA 

CATTGTTGTA 

CCTCGGGATA 

TCCATCAGCC 

GTCTTGTGTC 

TACAAATACT 



2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600: 

3660 

3720 

3780 

3840 

3900. 

3960 

4020. 

4080 

4140 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



441 



caa^c^^o^gIsca ggaaatgttc tactgctttc CTGACAATTT CTCTCTCACA 1320 
AATACCTCAS AGTCTCATGT TCAGCAGACA ACACAACTTT CTACTTTAAA TATTAGTATC 1380 
TTTCAATGA 

Seq ID NO: 669 Protein sequence 
Protein Accession 8: Eos sequence 

1 11 21 j 1 l 1 f 1 

MGYQRQEPVT PPQRDLDDRE TLV5EHEYKE KTCQrSAALFN- WNSIIGSGI IGLPYSMKOA 60 

gfpEgilllf wvsyvtdfsl vllixggals gtdtyqslvn ktfgfpgyll lsvlqflypf 120 

IAMISYNIIA GDTLSKVFQR IPGVDPENVF IGRHFIIGLS TVTFTLPLSL YRNIAKLGKV 180 

SLISTGLTTL ILGIVMARAI SLGPHIPKTE DAWVFAKPNA IQAVGVMSFA FICHHNSFLV 240 

YSSLEEPTVA KWSRLIHMSI VISVFICIFF ATCGYLTFTG FTQGDLFENY CRNDDLVTFG 300 

RFCYGVTVIL TYPMECFVTR EVIANVFFGG ^SSVFHIW TVMVITVATL VSLLIDCLGI 360 

VLELNGVLCA TPLIFIIPSA CYLKLSEEPR THSDXIMSCV MLPIGAWMV FGFVKAITNT 420 
QDCTHGQEMF YCFPDNFSLT NTSESHVQOT TQLSTLNISI FQ' 

Seq ID NO : 670 DNA sequence 

Nucleic Acid Accession 8: Eos sequence 

Coding sequence : 1 . . 1284 

1 11 .21 31 |1 51 

ATGGGCTACC AGAGGCAGGA GCCTGTCATC CCGCCGCAGA GAGGATTGCC TTATTCAATG 60 

AAGCAAGCTG GGTTTCCTTT. GGGAATATTG CTTTTATTCT- GGGTTTCATA TGTTACAGAC 120 

TTTTCCCTTG" TTTTATTGAT- AAAAGGAGGG GCCCTCTCTG- GAACAGATAC CTACCAGTCT- 180 

TTGGTCAATA AAACTTTCGG CTTTCCAGGG; TATCTGCTCC TCTCTGTTCT TCAGITTTTG 240 

TATCCTTTTA TAGCAATGAT. AAGTTACAAT ATAATAGCTG GAGATACTTT C5AGCAAAGTT 300 

TTTCAAAGAA TCCCAGGAGT TGATCCTGAA AACGTGTTTA TTGGTCGCCA CTTCATTATT 360 

GGACTTTCCA CAGTTACCTT, TACTCTGCCT' TTATCCTTGT^ ACCGAAATAT- AGCAAAGCTT 420 

GGAAAGGTCT CCCTCATCTC TACAGGTTTA ACAACTCTGA TTCTTGGAAT TGTAATGGCA 480 

AGGGCAATTT CACTGGGTCC ACACATACCA- AAAACAGAAG ACGCTTGGGT ATTTGCAAAG 540 

CCCAATGCCA TTCAAGCGGT CGGGGTTATG" TCTTTTGCAT. TTATTTGCCA QCATAACTCC . 600 

TTCTTAGTTT- ACAGTTCTCT AGAAGAACCC ACAGTAGCTA AGTGGTCCCG CCTTATCCAT .660 

ATGTCCATCG TGATTTCTGT ATTTATCTGT ATATTCTTTG CTACATGTGG ATACTTGACA, 720 

TtScTGGCT TCACCCAAGG GGACTTATTT GAAAATTACT GCAGAAATGA- TGACCTGGTA 780 

ACAtSgGAA GATTTTGTTA TGGTGTCACT.GTCATTTTGA- CATACCCTAT GGAATGCTTT 840 

OTGACAAGAG AGGTAATTGC CAATGTGTTT TTTGGTGGGA ATCTTTCATC GGTTTTCCAC 900 

A^T^TAA- CAGTGATGGT CATCACTGTA- GCCACGCTTG TGTCATTGCT GATTGATTGC 960 

CTCGGGATAG TTCTAGAACT CAATGGTGTG CTCTGTGCAA CTCCCCTCAT TTTTATCATT 1020 

CCATCAGCCT GTTATCTGAA' ACTGTCTGAA GAACCAAGGA CACACTCCGA TAAGATTATC 1080 

TCTTGTGTCA--TGCTTCCCAT. TGGTGCTGTG GTGATGGTTT TTGGATTCGT CATGGCTATT 1140 

ACAAATACTC AAGACTGCAC CCATGGGCAG GAAATGTTCT ACTGCTTTCC TGACAATTTC 1200 

TCTCTCACAA ATACCTCAGA- GTCTCATGTT CAGCAGACAA CACAACTTTC TACTTTAAAT 1260 
ATTAGTATCT TTCAACTCGA GTAA 

Seq ID NO: 671 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

LyORQEPVI PPQRGLPYSM KQAGFPLGIL LLFWVSYVTD FSLVLLIKGG ALSGTDTYQS 60 
"S PG YLLLSVLQFI> YPFIAMISYN- IIAGDTLSKV FQRIPGVDPE NVFIGRHFII 120 
LSLYRNIAKL GKVSLISTGL TTIiILGIVMA RAISLGPHIP KTEDAWVFAK 180 
PNAIQAVGVM SFAFICHHNS FLVYS SLEEP TVAKWSRLIH- MSIVISVFIC I FFATCGYLT 240 
™f?^dlf ENYCRNDDLV. TFGRFCYGVT VILTYPMECF VTREVIANVF FGGNLSSVFH 300 
IWTVMVITV ATLVSLLIDC IX3IVLELNGV- LCATPLIFII PSACYLKLSE EPRTHSDKIM 360 
SC^GAV CmVFGFVMAI TNTQDCTHGQ EMFYCFPDNF SLTNTSESKV QQTTQLSTLN 420 
ISIFQLE 

Seq ID NO: 672 DNA sequence 
Nucleic Acid Accession #: Eos sequence 
Coding sequence: 1..1203 

1 11 l 1 I' I' 

' ATGGGCTACC AGAGGCAGGA GCCTGTCATC CCGCCGCAGT TTTCCCTTGT mATTGATA 
AAAGGAGGGG CCCTCTCTGG AACAGATACC TACCAGTCTT TGC^CAATAA AACTTTCGGC 120 
TTTCCAGGGT ATCTGCTCCT CTCTGTTCTT CAGTTTTTGT ATCCTTTTAT AGCAATGATA 180 
AGTTACAATA TAATAGCTGG AGATACTTTG AGCAAAGTTT- TTCAAAGAAT CCCAGGAGTT 240 
GATCCTGAAA ACGTGTTTAT TGGTCGCCAC TTCATTATTG GACTTTCCAC AGTTACCTTT 
ACTCTGCCTT TATCCTTGTA CCGAAATATA GCAAAGCTTG GAAAGGTCTC CCTCATCTCT 
ACAGGTTTAA CAACTCTGAT TCTTGGAATT GTAATGGCAA GGGCAATTTC ACTGGGTCCA 
CACATACCAA AAACAGAAGA CGCTTGGGTA TTTGCAAAGC CCAATGCCAT TCAAGCGGTC 
GGGGTTATGT CTTTTGCATT- TATTTGCCAC CATAACTCCT TCTTAGTTTA CAGTTCTCTA 
GAAGAACCCA CAGTAGCTAA GTGGTCCCGC CTTATCCATA TGTCCATCGT GATTTCTGTA 
TTTATCTGTA TATTCTTTGC TACATGTGGA TACTTGACAT TTACTGGCTT C^CCAAGGG 
GACTTATTTG AAAATTACTG CAGAAATGAT GACCTGGTAA CATTTGGAAG ATTTTGTTAT 
GGTGTCACTG TCATTTTGAC ATACCCTATG GAATGCTTTG TGACAAGAGA 
AATGTGTTTT- TTGGTGGGAA TCTTTCATCG GTTTTCCACA TTGTTGTAAC AGTGATGGTC 
ATCACTGTAG CCACGCTTGT. GTCATTGCTG- ATTGATTGCC TCGGGATAGT rCTAGAACTC 
AATGGTGTGC TCTGTGCAAC TCCCCTCATT TTTATCATTC CATCAGCCTG ™CTCAAA 
CTGTCTGAAG AACCAAGGAC ACACTCCGAT AAGATTATGT CTTGTGTCAT GCTTCCCATT 
GGTGCTGTGG TGATGGTTTT TGGATTCGTC ATGGCTATTA CAAATACTCA AGACTGCACC 
CATGGGCAGG AAATGTTCTA CTGCTTTCCT GACAATTTCT CTCTCACAAA ^CCTCAGAG 
TCTCATGTTC AGCAGACAAC ACAACTTTCT ACTTTAAATA TTAGTATCTT TCAACTCGAG 



60 



240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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Seq ID NO: 673 Protein sequence 
protein Accession 8: Eos sequence 



10 
15 
20 
25 
30 
35 
40 

45 

50 

55 

60 

65 

70 

75 

80 

85 



1 
I 

MGYQRQEPVI 
SYN1IAGDTL 
TGLTTLILGI 
EEPTVAKWSR 
GVTVILTYPM 
NGVLCATPLI 
HGQEMPYCFP 



11 
I 

PPQFSLVLLI 
SKVFQRIPGV 
VMARAISLGP 
LIHMSIVISV 
ECFVTREVIA 
FIIPSACYUC 
DNFSLTNTSE 



21 
I 

KGGALSGTDT 
DPENVFIGRH 
HIPKTEDAWV 
PICIFFATCG 
NVFFGGNLSS 
LSEEPRTHSD 
SHVQQTTQLS 



31 

I 

YQSLVNKTFG 
FIIGLSTVTF 
FAKPNAIQAV. 
YLTFTGPTQG 
VFHIWTVMV 
KIMSCVMLPI 
TLNISIPQLS 



41 
I 

FPGYLLLSVL 
TLPLSLYRNI 
GVMSPAFICH 
DliFENYCRND 
ITVATLVSLL 
GAWMVFGFV 



51 

I ' 

QPLYPFIAMI 
AKLGKVSLIS 
HNSFLVYSSL 
DLVTFGRFCY 
IDCLGIVLEL 
MAITNTQDCT 



Seq ID NO; 674 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1. .1140 



1 

I 

ATGGGCTACC 
CCAGGGTATC 
TACAATATAA 
CCTGAAAACG 
CTGCCTTTAT 
GGTTTAACAA 
ATACCAAAAA 
GTrATGTCTT 
GAACCCACAG 
ATCTGTATAT 
TTATTTGAAA' 
GTCACTGTCA 
GTGTTTTTTG 
ACTGTAGCCA 
GGTGTGCTCT 
TCTGAAGAAC 
GCTGTGGTGA 
GGGCAGGAAA 
CATGTTCAGC 



11 
I 

AGAGGCAGGA 
TGCTCCTCTC 
TAGCTGGAGA 
TGTTTATTGG 
CCTTGTACCG 
CTCTGATTCT 
CAGAAGACGC 
TTGCATTTAT 
TAGCTAAGTG 
TCTTTGCTAC 
ATTACTGCAG 
TTTTGACATA 
GTGGGAATCT 
CGCTTGTGTC 
GTGCAACTCC 
CAAGGACACA 
TGGTTTTTGG 
TGTTCTACTG 
AGACAACACA 



21 
I 

GCCTGTCATC 
TGTTCTTCAG 
TACTTTGAGC 
TCGCCACTTC 
AAATATAGCA. 
TGGAATTGTA 1 
TTGGGTATTT 
TTGCCACCAT 
GTCCCGCCTT 
ATGTGGAT AC 1 
AAATGATGAC 
CCCTATGGAA 
TTCATCGGTT 
ATTGCTGATT 
CCTCATTTTT 
CTCCGATAAG 
ATTCGTCATG 
CTTTCCTGAC 
ACTTTCTACT 



31 
I 

CCGCCGCAGG 
TTTTTGTATC 
AAAGTTTTTC 
ATTATTGGAC 
AAGCTTGGAA 
ATGGCAAGGG 
GCAAAGCCCA 
AACTCCTTCT 
ATCCATATGT 
TTGACATTTA 
CTGGTAACAT 
TGCTTTGTGA 
TTCCACATTG 
GATTGCCTCG 
ATCATTCCAT 
ATTATGTCTT 
GCTATTACAA 
AATTTCTCTC 
TTAAATATTA 



41 
I 

TCAATAAAAC 
CTTTTATAGC 
AAAGAATCCC 
TTTCCACAGT 
AGGTCTCCCT 
CAATTTCACT 
ATGCCATTCA 
TAGTTTACAG 
CCATCGTGAT 
CTGGCTTCAC 
TTGGAAGATT- 
CAAGAGAGGT 
TTGTAACAGT 
GGATAGTTCT 
CAGCCTGTTA 
GTGTCATGCT 
ATACTCAAGA 
TCACAAATAC 
GTATCTTTCA 



51 
I 

TTTCGGCTTT 

AATGATAAGT 

AGGAGTTGAT 

TACCTTTACT 

CATCTCTACA 

GGGTCCACAC 

AGCGGTCGGG 

TTCTCTAGAA 

TTCTGTATTT 

CCAAGGGGAC 

TTGTTATGGT. 

AATTGCCAAT 

GATGGTCATC 

AGAACTCAAT 

TCTGAAACTG 

TCCCATTGGT 

CTGCACCCAT 

CTCAGAGTCT 

ACTCGAGTAA 



Seq ID NO: 675 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 

(111 
MGYQRQEPVI PPQVNKTFGF PGYLLLSVLQ FLYPFIAMIS 
PENVFIGRHF IIGLSTVTFT- LPLSLYRNIA KLGKVSLIST 
IPKTEDAWVF AKPNAIQAVG VMSFAFICHH NSFLVYSSLE 
ICIFFATCGY LTFTGFTQGD LFENYCRNDD LVTFGRFCYG 
VFFGGNLSSV FHIWTVMVI TVAThVShLI DCLGIVLELN 
SEEPRTHSDK IMSCVMLPIG AWMVFGFVM AITNTQDCTH 
HVQQTTQLST LNISIFQLE 

Seq ID NO: 676 DNA sequence 

Nucleic Acid Accession #:. NMJ>06a53.1 

Coding sequence : 26 . .874 ; 



41 
I 

YNI IAGDTIiS 
GLTTLILGIV 
EPTVAKWSRI. 
VTVILTYPME 
GVLCATPLIP 
GQEMFYCFPD 



51 

I 

KVFQRIPGVD 
MARAISLGPH 
IHMSIVISVF 
CFVTREVIAN 
IIPSACYLKL 
NFSIiTNTSES 



1 
I 

AGGAATCTGC 
ATCGGGCAGA 
CATGAGGATT 
CAGGATCATC 
CGAGAAGACG 
AGCCCACTGC 
GGAGGGCTGT 
CAGCCTCCCC 
CTCCATCACC 
CAGCTGCCTC 
CTTGCGATGC 
CAACATCACA 
GGGTGACTCC 
CCAGGATCCG 
GGACTGGATC 
ACCCTCCATT 
CAAGACCCTC 
AATCAACCTG 
GACTCTGGGA 
TCCTGGCCAT 



11 
I 

GCTCGGGTTC 
GGTCTCACAG 
CTGCAGTTAA 
AAGGGGTTCG 
CGGCTACTCT 
CTCAAGCCCC 
GAGCAGACCC 
AACAAAGACC 
TGGGCTGTGC. 
ATTTCCGGCT 
GCCAACATCA 
GACACCATGG 
GGGGGCCCTC 
TGTGCGATCA 
CAGGAGACGA 
TCCACTTGGT 
TACGAACATT 
GGGTTCGAAA 
"ATGACAACAC 
ATATCAAGGT 



21 
I 

CGCAGATGCA 
CAGCCAAGGA 
TCCTGCTTGC 
AGTGCAAGCC 
GTGGGGCGAC 
GCTACATAGT 
GGACAGCCAC 
ACCGCAATGA 
GACCCCTCAC 
GGGGCAGCAC 
CCATCATTGA 
TGTGTGCCAG 
TGGTCTGTAA- 
CCCGAAAGCC 
TGAAGAACAA 
GTTTGGTTCC 
CTTTGGGCCT 
TCAGTGAGAC 
CTGGTTTGTT 
TTCAATAAAT 



31 



41 



GAGGTTGAGG 
ACCTGGGGCC 
TCTGGCAACA 
TCACTCCCAG 
GCTCATCGCC 
TCACCTGGGG 
TGAGTCCTTC 
CATCATGCTG 
CCTCTCCTCA 
GTCCAGCCCC 
GCACCAGAAG 
CGTGCAGGAA 
CCAGTCTCTT 
TGGTGTCTAC 
TTAGACTGGA 
TGTTCACTCT 
CCTGGACTAC 
CTGGATTCAA 
CTCTGTTGTA 
ATTPGCTAAA 



TGGCTGCGGG 
CGCTCCTCCC 
GGGCTTGTAG 
CCCTGGCAGG 
CCCAGATGGC 
CAGCACAACC 
CCCCACCCCG 
GTGAAGATGG 
CGCTGTGTCA 
CAGTTACGCC 
TGTGAGAACG 
GGGGGCAAGG 
CAAGGCATTA 
ACGAAAGTCT 
CCCACCCACC 
GTTAATAAGA 
AGGAGATGCT 
ATTCTGCCTT 
TCCCCAGCCC 
TGAGTG 



51 

r 

ACTGGAAGTC 
CCCTCCAGGC 
GGGGAGAGAC 
CAGCCCTGTT 
TCCTGACAGC 
TCCAGAAGGA 
GCTTCAACAA 
CATCGCCAGT 
CTGCTGGCAC 
TGCCTCACAC 
CCTACCCCGG 
ACTCCTGCCA 
TCTCCTGGGG 
GCAAATATGT 
ACAGCCCATC 
AACCCTAAGC 
GTCACTTAAT 
GAAATATTGT 
CAAAGACAGC 



Seq, ID NOT 677 Protein sequence 
protein Accession §: NP_006844.1 
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60 
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60 
120 
180 
240 
300 
3 60 
420 
480 
540 
600 
. 660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



1 ' ; ".11 .-. : 21 31 41 51 % 

JlRIWLILLA LATGLVGGET RIIKGFECKP HSQPWQAALF EKTRLIiCGAT LIAPRWIiLTA 



60 



443 



WO 02/086443 

AHCUCPRYIV KLOQHNLQKE EGCEQTRTAT ESFPHPG^ ■»»» ^0 
DWIQETMKNN 

Seq ID NO: 678 UNA sequence 

Nucleic Acid Accession # : 1 Eos sequence 

Coding sequence r 1 .. 9 3 3 

i »' ;• f "• P I 1 K 

L * L : i^^r^ r-rn^rRRGCT GCCTGACTGC 60 

120 



EE ass.ssss ssss == 

GACACCAGCC ACAGCCCGGG GCAGCCTGGC CCCCAGGAGG GCACTGCTGA GCCCAGGGAC 
TCTGAGCCGA. GCCAGGGCAC TGAAGAAGTA TAA 



180 
240 
300 
360 
420 
480 
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780 
840 
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Seq lb NO: -' 679 Protein sequence 
Protein Accession 8: Eos sequence 



JcSNGHdPC LoCCCPPC PDKSOmCHCP FPOSGIHCI ^ « 

SEPSQGTEEV 

Seq ID NO: 680 DNA sequence 
Nucleic Acid Accession #: S78203.1 
Coding sequence: 1;-2190 

====== s 

GCAGCCATTG CTGACTCGTG GTTGGGAAAA SScS 420 

iiiii I 

SSSS ~»s jr.? 

• 3sss iil lib sssek. - 

GCTGTTGGTA TGATCCTAGC G^CTGGCA " aKgAAT 1260 

ATAAAT(^.T(^CCAGC.CC^TCAGGT. CCCCAGGAGG tcTGTTGATA. 1320 

CTGGCAGATG ATGAGGTGAA GGTGACAGTG GTGGGAAATC JAAACAA ^ 13B0 

GAGTCCATCA. AATCCTTTCA GAAAACACCA «CTATTCCA tgAGCATTCT 1440 

AGCCAGGATT TTCAOTTCCA CCTGAAATAT. CACAATTTGT CTCTCTA lg00 

• GTGCAGGAGA AGAACTGGTA CAGTCTTC-TC a £G°£™~ TGACAACCGT GAGGTTTGTT 1SS0 
ATGATGGTAA AGGATACAGA AAGCAAAACA ACCAATGGGA ^ACAACCGT ^ 
AACACTTTGC ATAAAGATGT- * CAACATCTCC CTGAGTACAG ATACCTCTCT CA^GTTGG 
GAAGACTATG GTGTGTCTGC rTATAGAACT GTGCAAAGAG GAOAATACCC 

TGTAGAACAG AAGATAAGAA CTTTTCTCTG ifir prTRGAA GATTGAAGAC 1800 

TATCTGTTTG TTATTACTAA TAACACCAAT ^TCTTC ££^ GCT 1860 

ATTCCAGCCA ACAAAATGTC GATTGCGTGG GAGCTACCAC ^ATGC^ ? CCCSCIMC 1920 

GGGGAGGTCA TGTTCTCTGT CACAGGTCTT- GAGTTTTCTT ^CTCA^ lgg0 

SS = 5SSSS = -SSS5S -ssS SS 

™ = = = 2,0 

AAACTAGAGA CCAAGAAGAC AAAACTCTGA 
Seq ID NO: 681 Protein sequence 

•Protein -Accession ft: AAB34388.1 - 
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15 

20 

25 

30 

35 
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1 
I 

KNPFQKNSSK 
YGMKAVLILY 
YVLGHVIKSL 
TRYFSVFYLS 
IYNKPPPEGN 
RVLPLYIPLP 
FVIYRLVSKC 
LADDEVKVTV 
VQEKNWYSLV 
EDYGVSAYRT 
IPANKMSIAW 
LWAQFSGLV 
KLETKKTKL 



11 

r- 

ETLFSPVSIB 
FLYFLHWNED 
GALPILGGQV. 
INAGSLISTF 
IVAQVFKCIW 
MFWALLDQCG 
GINFSSLRKM 
VGNENNSLLI 
IREDGNSISS 
VQRGEYPAVH 
QLPQYAIiVTA 
QWAEFILFSC 



21 

I- 

EVPPRPPSPP 
TSTSIYHAFS 
VHTVLSLIGL 
ITPMLRGDVQ 
FAISNRFKNR 
SRWTLQAIRM 
AVGMI LACLA- 
ESIKSFQKTP 
KMVKDTESKT 
CRTEDKKFSL 
GEVMFSVTGL 
LLLVTCLIFS 



31 
I 

KKPSPTICGS 
SLCYFTPIUG 
SLIALGTGGI 
CFGEDCYALA 
SGDIPKRQHW 
NRNLGFFVUQ 
FAVAAAVEIK 
HYSKLHLKTX 
TNGMTTVRFV- 
NLGIiLDFGAA 
EFSYSQAPSS 
IMGYYYVPVK 



41 

NYPLSIAFIV 
AAIADSWLGK 
KPCVAAFGGD 
FGVPGLLMVI 
LDKAAEKYPK- 
PDQMQVLNPF 
INEMAPAQSG 
SQDFHFHMCY 
NTLHKDVNIS 
YLFVITNNTN 
MKSVLQAAWL 
TEDMRGPADK 



51 
I 

VNEFCERFSY 
FKTIIYLSLV 
QFEEKHAEER 
ALWFAMGSK 
QLIMDVKALT 
LVLIFIPLFD" 
PQEVFLQVUJ 
HNLSLYTEHS 
LSTDTSLNVG 
QGLQAWKIED 
LTIAVGNIIV 
HIPHIQGNMI 



Seq ID NO:- 682 DNA sequence 

Nucleic Acid Accession #: NM_016077.1 

Coding sequence: 128.. 667 



TCGCTTTGTG 
CGCGATAGAA 
ACTGTAGATG 
CTTGGCTGTT 
GATGCTCCCC 
CTTGGGAGAC; 
AAAAGGGAAA- 
AAGAAGAAAT 
CAAAGCTCCT 
GACTGTAAGT 
CCTAGGGATT 
TTACTAGGTG 
GATTCTAACA 
AAACCTATTC 



11 
I 

ATTCTTGATC 
ACGTGTTCGC 
CCCTCCAAAT 
GGAGTTGCTT. 
AAAAGCAAGA 
AGCGGGGAGT 
GTGGCTGCCC 
CCTGAAATGC 
GATGAAGAAA' 
TTAATTCAAG 
GGGCCAGGAC 
GACTTTGATA 
ACAAAAGCTG 
CCATGTTCTA 



21 

I • 

CGGAACTTTG 
TTGCCCAGAA 
CCTTGGTTAT 
GTGGCATGTG 
CGAGCAAGAC 
ACAAGATGAT 
AGTGCTCTCA 
TCAAACAATG 
CCCTGATTGC 
ATGCTGGACG 
CAGCAGACCT 
TGACAACAAC 
AATTTCTTCA 
AAAAAA 



31 
I 

TCACCCAGGA 
GAAGGGAAGG 
GGAATATTTG 
CCTGGGCTGG 
ACACACAGAT 
TCTTGTGGTT 
TGCTGCTGTT 
GGAATACTGT 
ATTATTGGCC 
TACTCAGATT 
AATTGACAAA 
CCCTCCATCA 
CCCAACTTAA 



41 

I 

ACCCCGGAAG 
CGCGAGTGAG 
GCTCATCCCA 
AGCCTTCGAG 
ACTGAAAGTG 
OGAAATGACT 
TCAGCCTACA 
GGCCAGCCCA 
CATGCAAAAA 
GCACCAGGCT 
GTCACTGGTC 
CAAGTGTTTG 
ATGTTCTTGA 



51 



1 

AGGTAGCTCA 
GAAAGGAGGT 
GTACACTCGG 
TATGCTTTGG 
AAGCAAGCAT 
TAAAGATGGG 
AGCAGATTCA 
AGGTGGTGGT 
TGCTGGGACT 
CTCAAACTGT 
ACCTAAAACT 
AAGCCTGTCA 
GATGAAAATA 



Seq ID NOs 683 Protein sequence. 
Protein Accession &: NP_0 57161.1 



11 



21 .31 41 51 

I I I I ■ 

MPSKSLVMEY LAHPSTLGLA VGVACGMCLG WSLRVCFGML PKSKTSKTHT DTE SEAS I LG 
DSGEYKMILV VRNDliKMGKG" KVAAQCSHAA VSAYKQIQRR NPEMLKQWBY CGQPKVWKA 
PDEETLIALL AHAKMLGLTV SLIQDAGRTQ IAPGSQTVLG IGPGPADLID KVTGHLKLY 

Seq ID NO: 684 DNA sequence 

Nucleic Acid Accession U: NM_004864.1 

Coding sequence: 26.. 952 



CGGAACGAGG 
TCAGATGCTC 
GGCCGAGGCG 
ATTCCGAGAG 
CTGGGAAGAT 
AGTGCGGCTG 
GGGGCTCCCC 
AAGGTCGTGG 
GCCCGCGCTG 
ATCTTCGTCC 
CCGCAGAGCG 
TCTGCACACG 
ACGGGAGGTG 
CATGCACGCG 
CTGCTGCGTG 
GTCGCTCCAG 
GGTCCTTCCA 
GGGCTCAAGG 
TTATTTATTA 
ACTGTGTATT 



11 

! 

GCAACCTGCA 
CTGGTGTTGC 
AGCCGCGCAA 
TTGCGGAAAC 
TCGAACACCG 
GGATCCGGCG 
GAGGCCTCCC 
GACGTGACAC 
CACCTGCGAC 
GCACGGCCCC 
CGTGCGCGCA 
GTCCGCGCGT 
CAAGTGACCA 
CAGATCAAGA 
CCCGCCAGCT 
ACCTATGATG 
CTGTGCACCT 
TTCCTGAGAC 
TTAATTTATT 
TATTTAAAAC 



21 
I 

CAGCCATGCC 
TGGTGCTCTC 
GTTTCCCGGG 
GCTACGAGGA 
ACCTCGTCCC 
GCCACCTGCA 
GCCTTCACCG 
GACCGCTGCG 
TGTCGCCGCC 
AGCTGGAGTT 
ACGGGGACGA 
CGCTGGAAGA 
TGTGCATCGG 
CGAGCCTGCA 
ACAATCCCAT 
ACTTGTTAGC 
GCGCGGGGGA 
ACCCGATTCC 
GGGGTGACCT 
TCTGGTGATA 



31 

! 

CGGGCAAGAA 
GTGGCTGCCG 
ACCCTCAGAG 
CCTGCTAACC 
GGCCCCTGCA 
CCTGCGTATC. 
GGCTCTGTTC 
GCGTCAGCTC 
GCCGTCGCAG 
GCACTTGCGG 
CTGTCCGCTC 
CCTGGGCTGG 
CGCGTGCCCG 
CCGCCTGAAG 
GGTGCTCATT 
CAAAGACTGC 
GGCGACCTCA 
TGCCCAAACA 
TCTTGGGGAC 
AAAATAAAGC 



41 
I 

CTCAGGACGG 
CATGGGGGCG 
TTGCACTCCG 
AGGCTGCGGG 
GTCCGGATAC 
TCTCGGGCCG 
CGGCTGTCCC 
AGCCTTGCAA 
TCGGACCAAC 
CCGCAAGCCG 
GGGCCCGGGC 
GCCGATTGGG 
AGCCAGTTCC 
CCCGACACGG 
CAAAAGACCG 
CACTGCATAT 
GTTGTCCTGC 
GCTGTATTTA 
TCGGGGG CTG 
TGTCTGAACT 



51 
I 

TGAATGGCTC 
CCCTGTCTCT 
AAGACTCCAG 
CCAACCAGAG. 
TCACGCCAGA 
CCCTTCCCGA 
CGACGGCGTC 
GACCCCAAGC 
TGCTGGCAGA 
CCAGGGGGCG 
GTTGCTGCCG. 
TGCTGTCGCC. 
GGGCGGCAAA 
AGCCAGCGCC 
ACACCGGGGT 
GAGCAGTCCT 
CCTGTGGAAT 
TATAAGTCTG 
GTCTGATGGA 
GTTAAAAAAA 



Seq ID NO: 685 Protein sequence 
Protein Accession ftt NP_004855.1 



1 • 11- 

I I 
KPGQELRTVN GSQMLLVLLV 
EDLLTRLRAN QSWEDSNTDL 
HRALFRLSPT ASRSWDVTRP 
ELHLRPQAAR GRRRARARNG 
IGACPSQPRA ANMHAQIKTS 
LAKDCHCI 



21 31 41 51 

I " I i I 

LSWLPHGGAL SLAEASRASF PGPSELHSED SRFRELRKRY 
VPAPAVRILT PEVRLGSGGH LHLRISRAAL PEGLPEASRL 
LRRQLSLARP QAPALHLRLS PPPSQSDQLL AESSSARPQL 
DDCPLGPGRC CRLHTVRASL EDLGWADWVL SPREVQVTMC 
LHRLKPDTEP APCCVPASYN PMVLIQKTDT GVSLQTYDDL 



60 
120 
180 
240 
.300 
360 
420 
480 
540 
600 
660 
720- 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 



60 
120 
180 
240 
■300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900. 
960 
1020 
1080 
1140 
1200 



60 
120 
180 
240 
300 



Seq ID NO: 686 DNA sequence 
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Nucleic Acid Accession #: NM_002423.2 
Coding sequence: 48.-851 

i • ii 21 31 41 51 

| 1 ' ! I I I 

ACCAAATCAA CCATAGGTCC AAGAACAATT GTCTCTGGAC GGCAGCTATG CGACTCACCG 60 

TGCTGTGTGC TGTGTGCCTG CTGCCTGGCA GCCTGGCCCT GCCGCTGCCT CAGGAGGCGG 120- 

GAGGCATGAG TGAGCTACAG' TGGGAACAGG CTCAGGACTA TCTCAAGAGA TTTTATCTCT 180 

ATGACTCAGA AACAAAAAAT GCCAACAGTT TAGAAGCCAA ACTCAAGGAG ATGCAAAAAT 240 

TCTTTGGCCT ACCTATAACT GGAATGTTAA ACTCCCGCGT CATAGAAATA ATGCAGAAGC : 300 

CCAGATGTGG -AGTGCCAGAT GTTGCAGAAT ACTCACTATT TCCAAATAGC CCAAAATGGA 360 

CTTCCAAAGT GGTCACCTAC. AGGATCGTAT CAT A TACT CG AGACTTACCG .CArATTACAG.- 420 

TGGATCGATT AGTGTCAAAG" GCTTTAAACA TGTGGGGCAA AGAGATCCCC CTGCATTTCA 480 

GGAAAGTTGT ATGGGGAACT GCTGACATCA TGATTGGCTT TGCGCGAGGA GCTCATGGGG 540 

ACTCCTACCC ATTTGATGGG CCAGGAAACA CGCTGGCTCA TGCCTTTGCG CCTGGGACAG 600 

GTCTCGGAGG AGATGCTCAC TTCGATGAGG" ATGAACGCTG " GA CGGA TGGT . AGCAGTCTAG : 660 

GGATTAACTT CCTGTATGCT GCAACTCATG AACTTGGCCA TTCTTTGGGT ATGGGACATT 720 

CCTCTGATCC TAATGCAGTG ATGTATCCAA CCTATGGAAA TGGAGATCCC CAAAATTTTA 780 

AACTTTCCCA GGATGATATT AAAGGCATTC AGAAACTATA TGGAAAGAGA - AGTAATTCAA 840 

GAAAGAAATA GAAACTTCAG GCAGAACATC CATTCATTCA TTCATTGGAT TGTATATCAT 900 

TGTTGCACAA- TCAGAATTGA TAAGCACTGT TCCTCCACTC CATTTAGCAA TTATGTCACC 960 

CTTTTTTATT GCAGTTGGTT TTTGAATGTC TTTCACTCCT TTTATTGGTT AAACTCCTTT 1020 

ATGGTGTGAC TGTGTCTTAT TCCATCTATG AGCTTTGTCA GTGCGCGTAG ATGTCAATAA 1080 
ATGTTACATA CACAAATAAA TAAAATGTTT ATTCCATGGT AAATTTA 

Seq ID NO: 687 Protein sequence 
Protein Accession fl: NP_002414.1 

1 11 21 

I I I 

MRLTVLCAVC LLPGSlALPIi PQEAGGMSEL 
EMQKPFGLPI TGMLNSRVIE IMQKPRCGVP 
PHITVDRLVS KALNMWGKEI PLHFRKWWG 
APGTGLGGDA HFDEDERWTD GSSIiGINFLY 
PQNFKLSQDD IKGIQKLYGK RSNSRKK 



31 41 51 

| I I 

QWEQAQDYLK RPYLYDSETK NANSLEAKLK 60 

DVAEYSLFPN SPKHTSK\WT YRIVSYTRDL 120 

TADIMIGFAR GAHGDSYPPD GPGNTLAHAF 180 

AATHBIiGHSL GMGHSSDPNA VKYPTYGNGD 240 



Seq ID NO: 688 DNA sequence 

Nucleic Acid Accession ft: NM_005221.3 

Coding sequence: 1..870 

1 11 21 31 41 51 

ATGACAGGAG TGTTTGACAG AAGGGTCCCC AGCATCCGAT CCGGCGACTT CCAAGCTCCG 60 

TTCCAGACGT CCGCAGCTAT GCACCATCCG- TCTCAGGAAT CGCCAACTTT GCCCGAGTCT 120 

TCAGCTACCG ATTCTGACTA CTACAGCCCT ACGGGGGGAG CCCCGCACGG CTACTGCTCT 180 

CCTACCTCGG CTTCCTATGG CAAAGCTCTC AACCCCTACC AGTATCAGTA- TCACGGCGTG 240 

AACGGCTCCG CCGGGAGCTA- CCCAGCCAAA GCTTATGCCG ACT AT AG CT A CGCTAGCTCC 300 

TACCACCAGT. ACGGCGGCGC CTACAACCGC GTCCCAAGCG CCACCAACCA GCCAGAGAAA 360 

GAAGTGACCG AGCCCGAGGT- GAG AATGGTG AATGGCAAAC CAAAGAAAGT TCGTAAACCC 420 

AGGACTATTT ATTCCAGCTT TCAGCTGGCC GCATTACAGA GAAGGTTTCA GAAGACTCAG 480 

TACCTCGCCT TGCCGGAACG- CGCCGAGCTG GCCGCCTCGC TGGGATTGAC ACAAACACAG 540 

GTGAAAATCT GGTTTCAGAA CAAAAGATCC AAGATCAAGA AGATCATGAA AAACGGGGAG- 600 

ATGCCCCCGG AGCACAGTCC CAGCTCCAGC GACCCAATGG CGTGTAACTC GCCGCAGTCT 660 

CCAGCGGTGT GGGAGCCCCA GGGCTCGTCC" CGCTCGCTCA GCCACCACCC TCATGCCCAC 720 

CCTCCGACCT CCAACCAGTC CCCAGCGTCC AGCTACCTGG AGAACTCTGC ATCCTGGTAC 780 

ACAAGTGCAG CCAGCTCAAT CAATTCCCAC CTGCCGCCGC CGGGCTCCTT ACAGCACCCG 840. 
CTGGCGCTGG CCTCCGGGAC ACTCTATTAG 

Seq ID NO: 689^ Protein sequence 
Protein Accession tt: NP_005212.1 

1 ll 21 31 41 51 

MTGVFDRRVP STRSGDFQAP FQTSAAMHHP SQESPTLPES SATDSDYYSP TGGAPHGYCS 60 

PTSASYGKAL NPYQYQYHGV NGSAGSYPAK AYADYSYASS YHQYGGAYNR VPSATNQPEK 120 

EVTEPEVRMV NGXPKKVRKP RTIYSSPQLA ALQRRFQKTQ YLALPERAEL AASLGLTQTQ 180 

VKIWFQNKRS KIKKIMKNGE MPPEHSPSSS DPMACNSPQS PAVWEPQGSS RSLSHHPHAH 240 
PPTSNQSPAS SYLENSASWY TSAASSINSH LPPPGSLQHP LALASGTLY 



446 



WO 02/086443 



PCT/US02/12476 



It is understood that the examples described above in no way serve to limit the true 
jcope of this invention, but rather are presented for illustrative purposes. All publications, 
sequences of accession numbers, and patent applications cited in this specification are herein 
jicorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. 
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WHAT IS CLAIMED IS: 

1 L A method of detecting a lung cancer-associated transcript in a cell 

2 from a patient, the method comprising contacting a biological sample from the patient witha 

3 polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 

4 as shown in Tables 1 A- 1 6. : 

1 2 . The method of claim 1 , wherein the polynucleotide selectively 

2 hybridizes to a sequence at least 95% identical to a sequence as shown in Tables 1 A-16: 

1 3 . The method of claim 1 , wherein the biological sample is a tissue 

2 sample. ... 

1 4. The method of claim 1 , wherein the biological sample comprises 

2 isolated nucleic acids. 

1 5. The method of claim 4, wherein the nucleic acids are mRNA. 

1 6. The method of claim 4, further comprising the step of amplifying 

2 nucleic acids before the step of contacting the biological sample with the polynucleotide. 

1 7. The method of claim 1, wherein the polynucleotide comprises a 

2 sequence as shown in Tables 1 A-16. 

1 8. The method of claim 1 , wherein the polynucleotide is labeled. 

1 9. The method of claim 8, wherein the label is a fluorescent label. 

1 1 0. The method of claim 1 , wherein the polynucleotide is immobilized on 

2 a solid surface. 

1 1 1 . The method of claim 1 , wherein the patient is undergoing a therapeutic 

2 regimen to treat lung cancer. . 

1 12. The method of claim 1, wherein the patient is suspected of having lung 

2 cancer. 

1 13. A method of monitoring the efficacy of a therapeutic treatment of lung 

2 cancer, the method comprising the steps of: 
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3 (i) providing a biological sample from a patient undergoing the therapeutic 

4 , treatment; and '..V 

5 (ii) determining the level of a lung cancer-associated transcript in the 

6 biological sample by contacting the biological sample with a polynucleotide that selectively 

7 hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1 A- 16, 

8 thereby monitoring the efficacy of the therapy. 

1 14. The method of claim 13, further comprising the step of: (iii) comparing 

2 the level of the lung cancer-associated transcript to a level of the lung cancer-associated 

3 transcript in.a biological sample from the patient prior to, or earlier in, the. therapeutic 

4 treatment. 

1 15. The method of claim 13, wherein the patient is a human. 

1 1 6. A method of monitoring the efficacy of a therapeutic treatment of lung 

2 cancer, the method comprising the steps of: 

3 (i) providing a biological sample from a patient undergoing the therapeutic 

4 treatment; arid 

5 (ii) determining the level of a lung cancer-associated antibody in the biological 

6 sample by contacting the biological sample with a polypeptide encoded by a polynucleotide 

7 that selectively hybridizes to a sequence at least 80% identical to a sequence as shown in 

8 Tables 1 A-l 6, wherein the polypeptide specifically binds to the lung cancer-associated 

9 antibody, thereby monitoring the efficacy of the therapy. 



1 17. The method of claim 16, further comprising the step of: (iii) comparing 

2 the level of the lung cancer-associated antibody to a level of the lung cancer-associated 

3 antibody in a biological sample from the patient prior to, or earlier in, the therapeutic 

4 treatment. 

1 18. The method of claim 1 6, wherein the patient is a human. 

1 .19. A method of monitoring the efficacy of a therapeutic treatment of lung 

2 cancer, the method comprising the steps of: 

3 (i) providing a biological sample from a patient undergoing the therapeutic 

4 treatment; and 
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5 (ii) determining the level of a lung cancer-associated polypeptide in the 

6 . biological sample by contacting the biological sample with an antibody, wherein the antibody 

7 specifically binds to a polypeptide encoded by a polynucleotide that selectively hybridizes to 

8 a sequence at least 80% identical to a sequence as shown in Tables 1A-16, thereby 

9 monitoring the efficacy of the therapy. 

1 20. The method of claim 19, further comprising the step of: (iii) comparing 

2 the level of the. lung cancer-associated polypeptide to a level of the lung cancer-associated 

3 polypeptide in a biological sample from the patient prior to, or earlier in, the therapeutic 

4 treatment. ' 

21 . The method of claim 19, wherein the patient is a human. 

22. An isolated nucleic acid molecule consisting of a polynucleotide 
sequence as shown in Tables 1A-16. 

23. The nucleic acid molecule of claim 22, which is labeled. 

24. The nucleic acid of claim 23, wherein the label is a fluorescent label 

25. An expression vector comprising the nucleic acid of claim 22. 

26. A host cell comprising the expression vector of claim 25 . 

27. An isolated polypeptide which is encoded by a nucleic acid molecule 
having polynucleotide sequence as shown in Tables 1 A-l 6. 

28. An antibody that specifically binds a polypeptide of claim 27. 

29. The antibody of claim 28, further conjugated to an effector component. 

30. The antibody of claim 29, wherein the effector component is a 
fluorescent label. 

31. The antibody of claim 29, wherein the effector component is a 
radioisotope or a cytotoxic chemical. 

32. The antibody of claim 29, which is an antibody fragment. 
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1 33 . The antibody of claim 29, which is a humanized antibody 

1 34, A method of detecting a lung cancer cell in a biological sample from a 

2 patient, the method comprising contacting the biological sample with an antibody of claim 

3 28. 

1 35. The method of claim 34, wherein the antibody is further conjugated to 

2 an effector component. 



1 36. The method of claim 35, wherein the effector component is a 

2 fluorescent label. 

1 37. A method of detecting antibodies specific to lung cancer in a patient, 

2 the method comprising contacting a biological sample from the patient with a polypeptide 

3 encoded by a nucleic acid comprises a sequence from Tables 1 A-16. 

1 38. A method for identifying^ compound that modulates a lung cancer- 

2 associated polypeptide, the method comprising the steps of: 

3 (i) contacting the compound with a lung cancer-associated polypeptide, the 

4 polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at least 

5 80% identical to a sequence as shown in Tables 1 A-16; and 

6 (ii) determining the functional effect of the compound upon the polypeptide. 

1 39. The method of claim 38, wherein the functional effect is a physical 

2 effect. * 

1 40. The method of claim 38, wherein the functional effect is a chemical 

2 effect. ■ 

1 4 1 . The method of claim 38, wherein the polypeptide is expressed in a 

2 eukaryotic host cell or cell membrane. 

1 42. The method of claim 38, wherein the functional effect is determined by 

2 measuring ligand binding to the polypeptide. 

1 43. The method of claim 38, wherein the polypeptide is recombinant. 
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1 44. A method of inhibiting proliferation of a lung cancer-associated cell to 

2 treat lung cancer in a patient, the method comprising the step of administering to the subject a 

3 therapeutically effective amount of a compound identified using the method of claim 38. 

1 45 . The method of claim 44, wherein the compound is an antibody. 

1 46. The method of claim 45, wherein the patient is a human. 

1 47. A drug screening assay comprising the steps of 

2 (i) administering a test compound to a mammal having lung cancer or a cell 

3 isolated therefrom; 

4 (ii) comparing the level of gene expression of a polynucleotide that selectively 

5 hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1 A-l 6 in a 

6 treated cell or mammal with the level of gene expression of the polynucleotide in a .control 

7 cell or mammal, wherein a test compound that modulates the level of expression of the 

8 polynucleotide is a candidate for the treatment of lung cancer. 

1 . 48. The assay of claim 47, wherein the control is a mammal with lung 

2 cancer or a cell therefrom that has not been treated with the test compound. 

1 49. The assay of claim 47, wherein the control is a normal cell or mammal. 

1 . 50. A method for treating a mammal having lung cancer comprising 

2 administering a compound identified by the assay of claim 47. 

1 5 1 . . A pharmaceutical composition for treating a mammal having lung 

2 cancer, the composition comprising a compound identified by the assay of claim 47 and a 

3 physiologically acceptable excipient. 
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methods for treatment of the human body by surgery or therapy, 
methods for treatment of the animal body by surgery or therapy, 
diagnostic methods practised on the human or animal body, 
mere presentations of information. 

computer programs for which this International Searching Authority is not equipped to search prior art. 

The failure of the following parts of the international application to comply with prescribed requirements prevents a 
meaningful search from being carried out: 

the description Q the claims ■ Q the drawings 

13 The failure bTthe nucleotide and/or amino acid sequence listing to comply with the standard provided for in Annex C 
of the Administrative Instructions prevents a meaningful search from being carried out: 
the written form has not been furnished or does not comply with the standard, 
^ the computer readable form has not been furnished or does not comply with the standard. 

4. Further comments: 



a. 


□ 


b. 


□ 


C 


□ 


d. 


□ 


e. 


□ 


f. 




g- 


= 


h. 




i. 




J. 




k. 




1. 




m. 





Name and mailing address of the ISA/US 
Mail Stop PCT, Attn: ISA/US 
Commissioner for Patents 
P.O. Box 1450 

Alexandria, Virginia 22313-1450 
Facsimile No. (703)305-3230 



PRIMARY EXAMINER 

Telephone No. 703-308-0196 



Authorized officer 
Carla Myers 



Form PCT/ISA/203 (Jury 1998) 



PATENT COOPERATION TREATY 



Prom Che INTERNATIONAL SEARCHING AUTHORITY 



To: 

TOWNSEND AND TOWNSEND AND CREW LLP 
TWO EMBARCADERO CENTER 
EIGHTH FLOOR 

SAN FRANCISCO, CA 94111-3834 



PCT 



NOTIFICATION OF TRANSMITTAL OF 
THE INTERNATIONAL SEARCH REPORT 
OR THE DECLARATION 

(PCT Rule 44. 1) 



Date of Mailing 
{day/month/year) 



15 AUG 2003 
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International application No. 
PCT/US02/12476 



International filing date 
(day/month/year) 

18 April 2002 (18.04.2002) 



Applicant 

EOS BIOTECHNOLOGY, INC 



The applicant is hereby notified that the international search report has been established and is transmitted herewith. 
Kling of amendments and statement under Article 19: 

The applicant is entitled, if he so wishes, to amend the claims of the international application (see Rule 46): 

When? The time limit for filing such amendments is normally two months from the date of transmittal of the 
international search report. 

Where? Directly to the International Bureau of WZPO, 34, chemin des Colombettes 
1211 Geneva 20, Switzerland, Facsimile No.: (41-22) 740.14.35 

For more detailed instructions, see the notes on the accompanying sheet. 

The applicant Is hereby notified that no international search report will be established and that the declaration under 
Article 17(2)(a) to that effect is transmitted herewith. 

With regard to the protest against payment of (an) additional fee(s) under Rule 40.2, the applicant is notified mat: 

the protest together with the decision thereon has been transmitted to the International Bureau together with the 
applicant's request to forward the texts of both the protest and the decision thereon to the designated Offices, 
no decision has been made yet on the protest; the applicant will be notified as soon as a decision is made. 

4. Reminders 

Shortly after 18 months from the priority date, the international application will be published by the International Bureau. If the 
applicant wishtrto avoid or postpone. publication, a notice of withdrawal of the internationai application, or of the priority claim, must 
reach the International Bureau as provided in Rules 90 bis. 1 and 90 bis. 3, respectively, before the completion of the technical 
preparations for international publication. 

Within 19 months from the priority date, bnt only in respect of some designated Offices, a demand for international preliminary 
examination must bo filed if the applicant wishes to postpone the entry into the national phase unto 30 months from the priority date 
(in some Offices even later); otherwise the applicant must, mthin 20 months from the priority date, perform the prescribed acts for 
entry into the national phase before those designated Offices. 

In respect of other designated Offices, the time limit of 30 months (or later) will apply even if no demand is filed within 1 9 months 

See the Annex to Form PCT/IB/3bl and, for details about the applicable time limits. Office by Office, see the PCT Applicant's Guide, 
Volume II, National Chapters and the WTPO Internet site. 
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